This research aimed to develop a tool to group bioassays from PubChem based on experimental parameters extracted from narratives using natural language processing (NLP). The researchers used Latent Semantic Indexing (LSI) to identify topics in over 2000 bioassay narratives from Pubmed abstracts. LSI was able to group assays without supervision but was sensitive to the number of tokens and concepts used, focusing on either species or chemical compounds. While encouraging, additional studies are needed to better control LSI's effectiveness for chemical modeling applications.