References to Papers on LSI


[LSI HOME | EXEC SUMMARY | DEMOS | PAPERS | CONTACT US]

Latent Semantic Indexing (LSI) is a novel information retrieval method developed at Telcordia that improves your ability to find relevant information. Using a powerful and fully automatic statistical algorithms LSI can retrieve relevant documents even when they do not share any words with your query -- concepts replace keywords to improve retrieval.

  • For a quick overview, see the LSI Executive Summary.
  • For more details, see the papers below.
  • And, to try it out go to the LSI search demos page.

    General Theory and Applications.

  • Deerwester, S., Dumais, S. T., Landauer, T. K., Furnas, G. W. and Harshman, R. A. (1990) - no figures, "Indexing by latent semantic analysis." Journal of the Society for Information Science, 41(6), 391-407. --- first technical LSI paper; good background. Click here for the PDF version.
  • Dumais, S. T., Furnas, G. W., Landauer, T. K. and Deerwester, S. (1988), "Using latent semantic analysis to improve information retrieval." In Proceedings of CHI'88: Conference on Human Factors in Computing, New York: ACM, 281-285.
  • Dumais, S. T. (1991), "Improving the retrieval of information from external sources." Behavior Research Methods, Instruments and Computers, 23(2), 229-236. --- enhancements using differential term weighting and relevance feedback
  • Dumais, S. T. and Schmitt, D. G. (1991), "Iterative searching in an online database." In Proceedings of Human Factors Society 35th Annual Meeting , 398-402.
  • Dumais, S. T. and Nielsen, J. (1992), "Automating the assignment of submitted manuscripts to reviewers." In N. Belkin, P. Ingwersen, and A. M. Pejtersen (Eds.), SIGIR'92: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, pp.233-244. --- using LSI to match reviewers and papers
  • Foltz, P. W. and Dumais, S. T. (1992) - html, "Personalized information delivery: An analysis of information filtering methods." Communications of the ACM, 35(12), 51-60. --- experiment using LSI for information filtering
  • Dumais, S. T. (1993), "LSI meets TREC: A status report." In: D. Harman (Ed.), The First Text REtrieval Conference (TREC1), National Institute of Standards and Technology Special Publication 500-207 , pp. 137-152.
  • Dumais, S. T. (1994), "Latent Semantic Indexing (LSI) and TREC-2." In: D. Harman (Ed.), The Second Text REtrieval Conference (TREC2), National Institute of Standards and Technology Special Publication 500-215 , pp. 105-116.
  • Dumais, S. T. (1995), "Using LSI for information filtering: TREC-3 experiments." In: D. Harman (Ed.), The Third Text REtrieval Conference (TREC3) National Institute of Standards and Technology Special Publication , in press 1995.
  • Berry, M. W., Dumais, S. T., and O'Brien, G. W. (1995). "Using linear algebra for intelligent information retrieval." SIAM Review, 37(4), 1995, 573-595.
  • Caid, W. R., Dumais, S. T. and Gallant, S. I. (1995), "Learned vector space models for information retrieval." Information Processing and Management, 31(3), 419-429.
  • Dumais, S. T. (1996), "Combining evidence for effective information filtering." In AAAI Spring Symposium on Machine Learning and Information Retrieval, Tech Report SS-96-07, AAAI Press, March 1996.
  • Rosenstein, M. and Lochbaum, C. (2000) "Recommending from Content: Preliminary Results from an E-Commerce Experiment." In Proceedings of CHI'00: Conference on Human Factors in Computing, The Hague, The Netherlands: ACM.
  • Chen, C., Stoffel, N., Post, N., Basu, C., Bassu, D. and Behrens, C. (2001) "Telcordia LSI Engine: Implementation and Scalability Issues." In Proceedings of the 11th Int. Workshop on Research Issues in Data Engineering (RIDE 2001): Document Management for Data Intensive Business and Scientific Applications, Heidelberg, Germany, Apr. 1-2, 2001.
  • Bassu, D. and Behrens, C. (2003) "Distributed LSI: Scalable Concept-based Information Retrieval with High Semantic Resolution." In Proceedings of the 3rd SIAM International Conference on Data Mining (Text Mining Workshop), San Francisco, CA, May 3, 2003.
  • Cross-Language Retrieval.

  • Landauer, T. K. and Littman, M. L. (1990) "Fully automatic cross-language document retrieval using latent semantic indexing." In Proceedings of the Sixth Annual Conference of the UW Centre for the New Oxford English Dictionary and Text Research, pp. 31-38. UW Centre for the New OED and Text Research, Waterloo Ontario, October 1990. --- neat application to cross-language retrieval. Click here for the PDF version.
  • Dumais, S. T., Landauer, T. K. and Littman, M. L. (1996) "Automatic cross-linguistic information retrieval using Latent Semantic Indexing." In SIGIR'96 - Workshop on Cross-Linguistic Information Retrieval, pp. 16-23, August 1996. The Viewgraphs for this talk, which include a few new results, are also available. Click here for the PDF version of the viewgraphs.
    An extended version of the paper will be published in G. Gredenstette (Ed.) Cross Language Information Retrieval . Click here for the PDF version of the extended paper.
  • Dumais, S. T., Letsche, T. A., Littman, M. L. and Landauer, T. K. (1997) "Automatic cross-language retrieval using Latent Semantic Indexing." In AAAI Spring Symposuim on Cross-Language Text and Speech Retrieval , March 1997. The Viewgraphs for this talk are also available.
  • M. L. Littman, and G. A. Keim (1997) "Cross-language text retrieval with three Languages". Submitted to NIPS'97.

    AMIT and LSI.

  • Wittenburg, K. and Sigman, E. "Integration of Browsing, Searching, and Filtering in an Applet for Web Information Access." CHI'97 short paper.
  • Modeling Human Memory.

  • Landauer, T. K. and Dumais, S. T. (1977) - html only, "Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction and Representation of Knowledge." Psychological Review, 1997, 104 (2), 211-240.
  • Dumais, S. T. (1997) - pdf format "Using LSI for Information Retrieval, Information Filtering, and Other Things". Talk at Cognitive Technology Workshop, April 4-5, 1997.
  • Patents.

  • Patent : "Computer information retrieval using latent semantic structure". U. S. Patent No. 4,839,853, Jun 13, 1989.
  • Patent : "Computerized cross-language document retrieval using latent semantic indexing". U. S. Patent No. 5,301,109, Apr 5, 1994.
  • Additional LSI papers and sites.

  • Story, R. E. (1996) "An explanation of the effectiveness of latent semantic indexing by means of a Bayesian regression model". Information Processing & Management, 32(03), pp. 329-344.
  • Foltz, P. W. (1990) "Using Latent Semantic Indexing for Information Filtering". In R. B. Allen (Ed.) Proceedings of the Conference on Office Information Systems, Cambridge, MA, 40-47.
  • Tom Landauer's LSA page at Colorado.
  • Mike Berry's LSI page at Tennessee.


  • For more information about LSI, please contact us at: lsi@research.telcordia.com.


    [LSI HOME | EXEC SUMMARY | DEMOS | PAPERS | CONTACT US]

    Copyright:© 1997-2005 Telcordia Technologies, Inc. All Rights reserved.