Text algorithms by M. Crochemore and W. Rytter

string algorithm book

http://web.njit.edu/~rytter/teaching/texts/book.html 少し古い(1994)文字列アルゴリズムの本。

2007-06-04

Skewed Binary Search Trees

algorithm net

http://www.brics.dk/~gerth/papers/esa06skew.pdf via okamoto7 先生平衡しない二分探索木は、平衡した二分探索木より平均深さが深くて、そのために平均の枝をたどる数が多く、探索により長い時間がかかる、というのが伝統的な見解。この論文は、右の子の…

2007-05-01

もっとも近い点を探す

algorithm ir

離散値（あるいは有限精度の実数値）の高次元空間を考える。サイト集合と問い合わせ点が与えられたとき、問い合わせ点にもっともちかく、サイトのひとつである点を出力せよ。近さはユークリッド距離で定義する。ただし、他の距離で高速な手法があればその…

2007-01-23

Dynamic programming and board games -- A survey

algorithm game net

http://www.citeulike.org/user/float/article/917508

2006-12-20

The k-means range algorithm for personalized data clustering in e-commerce

algorithm net

http://dx.doi.org/10.1016/j.ejor.2005.04.011 range tree というデータ構造を使うと、 K-means clustering が高速にできるというはなしだとおもうけど、読んでません。

2006-09-14

階層型クラスタリングの性能保証

algorithm net

http://www.cse.ucsd.edu/~dasgupta/papers/hier-jcss.ps Algorithmic statistics の Dasgupta 先生らによる。

2006-09-09

Dasgupta, Papadimitriou and Vazirani, "Algorithms"

book algorithm net

http://www.cse.ucsd.edu/~dasgupta/algorithms/ 暗号、線形計画や量子アルゴリズムまで含む教科書。具体例中心という感じなのかな。

2006-08-31

Finding Frequent Items in Data Streams - Charikar, Chen, Farach-Colton (2002)

ir algorithm net

http://citeseer.ist.psu.edu/charikar02finding.html 頻度の近似計算？

2006-06-19

INSERTION SORT is O(n log n) (ResearchIndex)

algorithm net

http://citeseer.ist.psu.edu/bender04insertion.html 未使用領域つき挿入ソートの挿入が、高い確率で O(n log n)

2006-06-01

Li -- Using Sketches to Estimate Two-way and Multi-way Associations - Google Scholar

math algorithm stat net

http://scholar.google.com/scholar?hl=en&lr=&cluster=4804964007952417849 Locality Sensitive Hashing 関連。

2006-06-01

Moses Charikar's Home Page

people math algorithm stat net

http://www.cs.princeton.edu/~moses/ Similarity Estimation Techniques from Rounging Algorithmsなど。近似アルゴリズムの専門家。 Locality Sensitive Hashing は、 2つのオブジェクトのハッシュ値が一致する確率が、 2つのオブジェクトの類似度と同じ…

2006-05-19

Katsurada Labo. Text

math algorithm book net

http://www.math.meiji.ac.jp/~mk/labo/text/ 線形計算とか、関数解析などのノート、というか本に近い。

2006-05-19

Bayesian Hierarchical Clustering (2004, ICML)

lm ir algorithm net

http://citeseer.ist.psu.edu/735303.html ボトムアップクラスタリングの距離評価に、 Bayesian なモデルによる確率を使う。

2006-05-17

SVDPACK

math algorithm nlp net

http://www.netlib.org/svdpack/ 潜在意味解析 Latent Semantic Analysis に使う特異値分解 Singular Value Decomposition のパッケージ。論文相当の User's guide 付き。特異値は非正方行列に対して定義される固有値のようなもので、 A trans(A)の固有値…

2006-05-09

Data Clustering -- A Review - Jain, Murty, Flynn (1999)

nlp algorithm net

http://citeseer.ist.psu.edu/jain99data.html Clustering のサーベイ。

2006-05-07

Parsing Techniques - Second Edition

algorithm book net

http://www.cs.vu.nl/~dick/pt2ed.html （プログラミング言語を中心にした）構文解析の本。

2006-04-14

岡野原さんによる定数時間検索の解説

algorithm net

http://hillbig.cocolog-nifty.com/do/files/2005-12-compind.pdf Suffix Array を使うと、 4NのメモリとO(log N)の時間で検索ができる …というのはもう古い。メモリはCSAでN/2程度、時間はwavelet tree でO(m)

2006-04-12

The Personal Page of Gonzalo Navarro

string people algorithm nlp net

http://www.dcc.uchile.cl/~gnavarro/eindex.html 自然言語の全文字列索引付けとか、あいまい検索とか。

2006-04-07

Complexity Zoo - Qwiki

algorithm net

http://qwiki.caltech.edu/wiki/complexity_zoo 計算量クラスの一覧。 PTAS(Polynomial-Time Approximation Scheme)

2006-04-07

Computer Science Special Course (進化計算)

algorithm net

http://www.iba.k.u-tokyo.ac.jp/~iba/cs/

2006-03-26

Bayesian Methods for Machine Learning

math algorithm stat net

http://www.gatsby.ucl.ac.uk/~zoubin/icml04-tutorial.html ベイズとMCMCのチュートリアル。

2006-03-15

Algorithm Database

math algorithm programming net

http://www-or.amp.i.kyoto-u.ac.jp/algo-eng/db/index.html 最小カット、スケジューリング、Voronoi図などのデモ。

2006-03-10

IBM RD 50-1 | An approximation to the greedy algorithm for differential compression

string algorithm net

http://www.research.ibm.com/journal/rd/501/agarwal.html Differential compression arose as part of the string-to-string correction problem [2], finding the minimum cost of representing one string in terms of another. 問題：差分圧縮：参照フ…

2006-02-02

A View Of The Em Algorithm That Justifies Incremental, Sparse, And Other Variants - Neal, Hinton (1998)

stat algorithm net

http://citeseer.ist.psu.edu/neal98view.html EM アルゴリズム。1997年の長めの論文。EM の高速化手法。 The EM Algorithm -- an Old Folk-song Sung to a Fast New Tune

2006-01-26

MDL site - Reading

algorithm learning net

http://www.mdl-research.org/reading.html 新しいチュートリアルがあった。 NEW: P.Gr?nwald, A Tutorial introduction to the minimum description length principle. In: Advances in Minimum Description Length: Theory and Applications (edited by P.…

2006-01-19

Manindra Agrawal, Neeraj Kayal, Nitin Saxena, "PRIMES is in P."

math algorithm net

http://www.cse.iitk.ac.in/users/manindra/primality_v6.pdf 素数判定が多項式時間。 via Favorite Theorems Recap

2006-01-06

Piotr Indyk

string algorithm geom people ir net

http://theory.lcs.mit.edu/~indyk/ 雑音のあるパターンマッチング。 FFTをツールとして、というのが面白そう。

2005-12-17

岡野原大輔『汎用的データにおける確率的言語モデルの抽出とその応用』

segmentation lm algorithm

非自然言語データの上で確率的言語モデルを構築し、圧縮する。単語分割（WX法）とクラス推定を、教師なし学習により行う。まず、単語の最大長 n を与えた上で、 suffix array の一致数（辞書順ソートでの隣接接尾辞の最長共通接頭辞の長さ）の切り替わりを…

2005-12-06

R. セジウィック, "アルゴリズム第2巻＝探索・文字列・計算幾何"

algorithm book

2005-12-04

Suffix Tree から Sparse Suffix Tree への変換

string algorithm segmentation

Sparse Suffix Tree とは、Krisztian さんの論文[2005-11-02-1]で紹介されている、単語単位 Suffix Tree のようなもの。形式的には、文字列と文字列中でのすべての単語の開始位置（区切り記号の位置 -1）が与えられたとき、単語の開始位置から始まる suff…