Posts

論文メモ Designing Access Methods: The RUM Conjecture

データ構造は読み込み(Read, R), 更新(Update, U), 所要メモリ容量(Memory, M)にトリレンマを抱え、いかなるデータ構造でも、2つを最適化すると残る1つのオーバヘッドが悪化すると予想した。 Read, Update, Memoryの頭文字から、これをRUM Conjectureと名付けた。

April 26, 2021

論文メモ A comparison of Fractal Trees to Log-Structured Merge (LSM) Trees

Fractal Treeは、B+木のルートと節にバッファをもたせるデータ構造にあたる。そのFractal TreeのamplificationをB+木やLSM Treeのそれと比較した。議論になるamplificationは、write, read, spaceの3つで、write amplificationはアプリケーションが書き込むデータ量に対して実際にストレージに書き込まれたデータ量を表す。 read amplificationはクエリの実行に必要なI/Oの回数、space amplificationは仕組み上避けられない断片化や一時的なデータのコピーに該当する。

April 19, 2021

論文メモ Fast Intersection Algorithms for Sorted Sequences

ソートされたシーケンスの直積を高速に求めるアルゴリズム Double Binary Searchを示した。 2つのシーケンス\(D\), \(Q\)があり、\(\mid D\mid=n\), \(\mid Q\mid=m\), \(n >= m\)であれば、平均と最悪時間計算量が、それぞれ、\(\mathcal{O}(m\log(n/m))\), \(m\)になる。本アルゴリズムは、Web検索エンジンで大きなシーケンスの直積を高速に求めるために開発された。

April 12, 2021

論文メモ The Log-Structured Merge-Tree (LSM-Tree)

LSM-Treeは、検索より挿入や削除が多い用途に向いたインデックス構造であり、例えば履歴テーブルやログの保存につかえる。メモリにある1つ木とディスク上の1つ以上の木からなり、直近の挿入や削除をメモリの木で管理する。メモリの木の大きさがしきい値を超えたとき、メモリの木の葉をディスクの木に移す。移動時は、ディスク上の木の葉とメモリの葉をマージソートの要領でソートし、ソートした葉をディスクの新しい連続領域に書き込む。連続領域に書き込み、アームの移動やディスクの回転を減らすことで、高速に挿入や削除ができる。一方、検索速度は、複数の木を探索しなければならないために、1つの木でインデックスを構成するB木に劣る。

April 3, 2021

#Bloom Filter

February 20, 2021

Blanket

Posts

論文メモ Designing Access Methods: The RUM Conjecture

論文メモ A comparison of Fractal Trees to Log-Structured Merge (LSM) Trees

論文メモ Fast Intersection Algorithms for Sorted Sequences

論文メモ The Log-Structured Merge-Tree (LSM-Tree)

論文メモ The Design and Implementation of a Log-Structured File System

論文メモ ARIES/IM: An Efficient and High Concurrency Index Management Method Using Write-Ahead Logging

論文メモ The Ubiquitous B-Tree

論文メモ Robust Random Cut Forest Based Anomaly Detection On Streams

論文メモ Organization and Maintenance of Large Ordered indexes

Space/Time Trade-offs in Hash Coding with Allowable Errors