Click here to navigate to the LSI Tool download page.

Generates related keywords from a corpus bundled together into topic areas. 10 keywords are generated per topic. A single text file is uploaded and each line is treated as a separate document.

Baylor University Libraries: LSI Topic Model

Implements the Latent Semantic Index

From Wikipedia “Latent semantic indexing (LSI) is an indexing and retrieval method that uses a mathematical technique called singular value decomposition (SVD) to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text. LSI is based on the principle that words that are used in the same contexts tend to have similar meanings.”

This Python application was built by the Baylor University Libraries to assist researchers to implement unsupervised topic modelling on 1-line documents, such as Twitter social media.

Baylor Libraries Latent Semantic Index (LSI)

Leave a Reply

Your email address will not be published. Required fields are marked *