Comments on The Geomblog: SODA report: ALENEX...

Hi, A few more points:- from my experience, kd-tr...

2006-01-23T15:32:00.000-07:00

Hi,

A few more points:

- from my experience, kd-trees and related techniques are doing surprisingly well even in high dimensions, if you use those algorithms "beyond" the provable bounds. E.g., you can stop the search after a fixed number of steps, or you can set the approximation factor to a very high value. Amazingly, the actual points reported are often of pretty good quality (e.g., you get the actual nearest neighbor 50% the time). Of course, no guarantees, but in many applications you can live without them.

- there are techniques that one can use if your data is nominally high-dimensional, but "really" the points live on a "low-dimensional manifold". See the following survey for more info.

Posted by Piotr

Ken: well I paraphrased, but that is basically wh...

2006-01-22T15:38:00.000-07:00

Ken:
well I paraphrased, but that is basically what he said :). Essentially he argued that it's the most successful technique, but he did point out that LSH doesn't exist for all metrics, and even important metrics like the edit distance.

Ingo: a good starting point is the web page at
http://web.mit.edu/andoni/www/LSH/index.html

Posted by Suresh

I'd love to learn more about locality-sensitive ha...

2006-01-22T13:25:00.000-07:00

I'd love to learn more about locality-sensitive hashing! My exposition to hashing was from cryptography, which is the anti-thesis of locality sensitivity ;-) Did David mention anything in particular that would be good introductory reading? What is state-of-the-art there?

Posted by Ingo

I'm sorry I missed David's talk.-kd-trees are even...

2006-01-21T14:37:00.000-07:00

I'm sorry I missed David's talk.

-kd-trees are even better, if you want the best bounds

You mean "performance" here, right? I'd pretty much agree in that case, for low enough dimension.

-In high dimensions, only locality sensitive hashing can save us.

Well, there are some other techniques that are sometimes helpful. Surely there's nothing that's *always* helpful, is there?

Posted by Ken Clarkson