LATENT SEMANTIC INDEX APPLICATIONS

A problem of big data-sets is relevancy of search results. The best majority of information engines do better when processing small datasets, while losing their relevancy as the data-set becomes increasingly bigger. Fortunately, LSI works in a reverse fashion. The more it knows about a given body of text, the easier it will find similar ones with similar properties. On a LSI based information engine, there could be a facility to store a group of useful results, and then refine the search further by results that are more closely related to previously served queries. It adds a unique relevancy feedback to the engine from the user and this interaction increases the refining process, in which engines return more relevant results to original queries.

For semantic relationships, Latent Semantic Indexing can analyze degrees topical coherence with a body of text. These results correlate very well with readability and comprehension, and therefore LSI would be a good fit with readability metrics on a feedback system for writing instructions. On another end, Spam control. Yes, LSI is very powerful, and is also customizable for this purpose. It could be also much more efficient than traditional keyword-based tools. It could be progressively trained and adjusted on a user’s mailbox. And lets not forget bulletin boards, newsgroups, forums, and anything that shares a common and basic structure type.

Latent Semantic Indexing could be used for structuring data, including all human generated taxonomies. LSI can do a partial structuring out of unstructured data. This would allow archivists to use their time much more efficiently by just doing enhancing, labeling and then correcting LSI-generated categories rather than having to do the indexing of documents themselves. This, combined with data visualization tools currently available could be used with LSI to create more sophisticated and interactive applications for archivists usage.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Shout it
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)

If you enjoyed this post, please consider to leave a comment or subscribe to the feed and get future articles delivered to your feed reader.

Comments

[...] unknown wrote an interesting post today onLATENT SEMANTIC INDEX APPLICATIONS | Technical SEO ConsultantsHere’s a quick excerptOn a LSI based information engine, there could be a facility to store a group of useful results, and then refine the search futher by results that are more closely related to previously served queries. It adss a unique relevancy … [...]

Leave a comment

(required)

(required)