dsdlt/mongodb-scalable-document-embeddings

Generate embeddings at scale using MongoDB Atlas Stream Processing and MongoDB Atlas Vector Search

/ 100

Experimental

This project helps data engineers or platform teams process vast amounts of unstructured text data, like song lyrics or articles, as it arrives. It takes raw text documents, generates numerical representations (embeddings) that capture their meaning, and stores them in a MongoDB database. This enables powerful semantic search and analysis of the documents.

No commits in the last 6 months.

Use this if you need to continuously process and embed large, streaming volumes of text documents in real-time, making them instantly searchable by meaning.

Not ideal if you only have a small, static set of documents to embed or primarily need simple keyword search rather than semantic understanding.

data-engineering real-time-analytics text-processing semantic-search document-management

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch...

Agrover112/awesome-semantic-search

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

vectara/react-search

UI widget for adding semantic search to your React UI in just a few lines of code

frutik/awesome-search

Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

askaitools/askaitools-community-edition

A cutting-edge search engine project tailored specifically for the AI product

Explore Embedding Tools

All categories Trending Embeddings directory Insights