| Similarity-based Retrieval (cont)
Similarity-based retrieval requires a distance measure
where x and y are two objects (in the database) dist(x,y) ∈ 0..1,     dist(x,x) = 0,      dist(x,y) = dist(y,x)
 
Note: distance calculation often requires substantial computational effort
 How to restrict solution set to only the "most similar" objects:
 
BUT both above methods require knowing distance between query object and all objects in DB threshold dmax
	  (only objects t such that dist(t,q) ≤ dmax)
 count k
	  (k closest objects (k nearest neighbours))
 |