I suppose a combination of methods could work for a similarity search engine, perhaps it wasn't the best choice to base similarity ratings on review analysis. I guess so far it would be more effective to collect data from users and then statistically process it to get more or less reliable results. For example, the
SimilarSiteSearch engine works based on a combination of sources, including user ratings as well as data obtained from the sites themselves.