https://store-images.s-microsoft.com/image/apps.6332.e9c81557-c44c-4e0c-b3c8-b763a1068900.c4645fe5-2a1c-46e9-99fa-4b4c09f89b04.2c7ac694-08fb-4929-bb8d-71b7e1c11d24

Gensim

bCloud LLC

Gensim

bCloud LLC

Version 4.3.3 + Free with Support on Ubuntu 24.04

Gensim is an open-source library for unsupervised machine learning, natural language processing (NLP), and topic modeling. It specializes in efficiently processing large text corpora and extracting semantic meanings. Gensim is widely used for building scalable vector space models, topic models, and document similarity queries.

Features of Gensim:

  • Efficient algorithms for topic modeling, document similarity, and word embeddings.
  • Supports popular models like LDA (Latent Dirichlet Allocation), Word2Vec, Doc2Vec, and FastText.
  • Built for scalability to handle large datasets in-memory and on disk.
  • Fast, memory-efficient algorithms suitable for large text corpora.
  • Works seamlessly with other libraries and tools like Numpy, Scipy, and Pandas.

To check Gensim version: python -c "import gensim; print(gensim.__version__)"

Disclaimer: Gensim is an independent open-source project maintained by the community. It is not affiliated with, endorsed by, or sponsored by any specific corporation or entity. It is provided "as is," without any warranty, express or implied. Users are responsible for understanding and complying with licensing terms, data policies, and regulations while using Gensim.