Vector Database
tags :
Summary #





Weaviate #
URL The AI Native Vector Database
Weaviate is an Open Source vector database. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects.
milvus #
URL Vector database built for scalable similarity search Open Source, highly scalable, and blazing fast.
pinecone #
Transform your business with high-performance AI applications. Pinecone’s vector Database is fully-managed, developer-friendly, and easily scalable.
FAISS #
A library for efficient similarity search and clustering of dense vectors. By Meta
Chroma #
URL the AI-native open-source embedding database github
HeatWave #
Elasticsearch #

Key Features
- Combine text and vector search for optimal relevance and accuracy.
- Elasticsearch includes a complete vector database, supporting text, sparse and dense vectors, and hybrid retrieval.
- Capture meaning, context, and associations with the flexibility to pick embedding models.
- Assign granular role-based access controls with document and field-level security.
- Leverage filters and faceting capabilities for refined vector search.
OpenSearch #
OpenSearch merges classical search, analytics, and vector search into a single solution. Its vector database features enhance AI application development, providing seamless integration of models, vectors, and information for vector, lexical, and hybrid search.
Key Features #
- Vector search for various purposes
- Multimodal, semantic, visual search, and gen AI agents
- Creating product and user embeddings
- Similarity search for data quality operations
- Apache 2.0-licensed vector database
pgvector #
sqlite-vss #
How does it work? #

Use Cases #
At intuit #
AWS re:Invent 2023 - Improve your search with vector capabilities in OpenSearch Service







OCR of Images #
2024-04-16_12-14-06_screenshot.png #

Example of Vector Databases Pinecone Milvus - SCaNN Weaviate Vespa - - FAISS Chroma Qdrant PgVector Redis Annoy
2024-04-16_12-14-11_screenshot.png #

Vector Database A specialized type of database designed to efficiently store and manipulate high-dimensional vector data. Unstructured data 80% of the data out there are unstructured and cannot fit into a relational database. (Gartner) MP4 PDF Non relational databases are not solving the ultimate problem of vector data. - Vector databases come into the picture to solve the problem by efficiently storing and indexing to make query fast.
2024-01-03_10-54-37_screenshot.png #

: - HUMMV - Vectors need a new kind of database Key-Value Jocument Graph Vector T - - - - - E - - - - - - - - L in a - - Pinecone
2024-01-03_10-55-08_screenshot.png #

Examples of Vector Database Unstructured data Embedding model Vector database Large Language Models LLM Embeddings OpenAI Face Hugging Bard OpenAI redis ME cohere Stanford OMeta Alpaca LOG Pinecone Ask questions Create Get relevant documents from db Construct prompts Fmbeddings Query LLM and Ret answers Long lerm memory User query Create Embed histary Embeddings Get response Store in LTM Query LLM: and get answers - Ask questions Coche lodkup Construct prompts, if not in cache Result: storei in Cache Create Embeddings Return Query LLM and get answers CheckCache for similar queries &answers Sers vd by LUM Served by Vector db
2024-01-03_10-55-59_screenshot.png #

Dedicated vector databases Databases that support vector search chroma marqo J OpenSearch I - - ClickHouse vespa drant LanceDB PostgreSQL )> Milvus cassandra elasticsearch redis Weaviate ROCKSET] 8 1 Pinecone SingleStore
2024-01-03_10-37-49_screenshot.png #

Elastic Platform Indices k-NN Search Search Results Embedding (with HNSW) Inference API Query Data - Your Data Embedding Model Legend OPyTorch A. Generation of Embeddings B. Execution of Vector Search Elasticsearch Architecture
2024-01-03_10-54-06_screenshot.png #

Audio Audio model Audio vector embeddings ENTERPRISE lext lexts model lext vector embeddings Similarity Search Videos Videos model Video vector embeddings
2024-05-01_16-36-06_screenshot.png #

Vector database use cases Intuit aims to enhance the search experience by leveraging advanced techniques to identify user intent, rather than relying solely on traditional text-based matching. This approach will enable us to deliver more relevant and personalized search results, ultimatelyimproving the overall userjourney. 14 Generative QA Build retrieval enhanced generative QA systems Entity resolution identity resolution Document discovery processing to retrieve information ent Product matching, entity matching, Finding similar documents embedded and later aws o 2023, L Web Services itsa lates AU relnvent
2024-05-01_16-36-31_screenshot.png #

Generative QA Paved path for building a knowledge base using a RAG model architecture Query Knowledge Base Create Knowledge Base New session Home Use cases QRS tracker Explore with Data Catalog GenAIS Studio Model Daz rivate What data Knon edge oises General public info Addnew What problem are you solving today? Get started by browsing different capabilities offered by Al Marketplace. Generative Al is here now! 0: Examples 4 Capabilities A Limitations Take command of the future with Generative Al- tryi it outt to power up your workflow. 'Explain quantumo computingin Remembers wat user ad May occasionalyg generate smolel lerms" earlerin the conversation incorrectinformation 'Gota an/c creatve icAs fora 210 Aloust usert provde lollon. May occasional produce year old's birthdry" "Hcn dol mrakea HTTP Tranedt to decirer rapprocrate request Unaserptt. (0 vocorectons harmtul instructions based content requests Lintedi Arowedge cl: xord dard events after 2021 D: Finet tuning Fine-tune large languager models Knowledge base Embed k knowledge base for QBA aws o 2023, Amazon Web Services, Inc. or Its affillates. All rights reserved.
2024-05-01_16-36-39_screenshot.png #

Use case and data flow Documents 1 uploaded to Amazon S3 by user Chunks are sent to a Kafka topic Relevant results are 7 sent to LLM along with prompt 4 Mention the path 2 of documents and start the process Vector search 5 ingestor persists the data into OpenSearch Service Optimal 8 sent back response from the LLM to the user is ent Al service breaks 3 down documents into chunks E Users now can query the knowledge base aws 02 2023, Amazon Web Services, Inc. its fillates All hts reserve ed AWS rezlnvent
2024-05-01_16-37-00_screenshot.png #

Architecture diagram Generated response User query User query as embedding Client app T L Amazon OpenSearch Service domain Embedding model Documents from S3 Embedding generation Metadata fields and embeddings Al Service Kafka topic OpenSearch ingestor WS o 2023, zon Web Servic ce Inc. V ed relnvent
2024-05-01_16-37-09_screenshot.png #

Metrics 7 use cases onboarded to production Exact k-NN, approximate k-NN, approximate k-NN with pre-filtering queries are in use No P1, P2 incidents after launch 46 use cases in pipeline nvent 2 customers are experimenting with product quantization compression aws 02023, Amazont Web Services, Inc. lts affillates. AUD nu reserved. AWS rezlnvent
2024-05-01_16-37-44_screenshot.png #

Learnings from adopting Amazon OpenSearch Service as a vector store Leverage monitoring capabilities of OpenSearch Scoring disparity during hybrid search Service - Support higher dimensions on Leverage data operations such as reindexing, backup, index Lucene engine Higher latency when documents management scale to billions aws D2023, Amazo Web Services, Inc tes ed AWS re:Invent
2024-05-01_16-37-56_screenshot.png #

Desired features from OpenSearch Service Easier configuration to enable different compression techniques Reduce memory footprint by storing subset ofthe vectors to the disk Support model-serving framework from OpenSearch open Ivent source project relnvent
OCR of Images #
2024-04-16_12-14-06_screenshot.png #

Example of Vector Databases Pinecone Milvus - SCaNN Weaviate Vespa - - FAISS Chroma Qdrant PgVector Redis Annoy
2024-04-16_12-14-11_screenshot.png #

Vector Database A specialized type of database designed to efficiently store and manipulate high-dimensional vector data. Unstructured data 80% of the data out there are unstructured and cannot fit into a relational database. (Gartner) MP4 PDF Non relational databases are not solving the ultimate problem of vector data. - Vector databases come into the picture to solve the problem by efficiently storing and indexing to make query fast.
2024-01-03_10-54-37_screenshot.png #

: - HUMMV - Vectors need a new kind of database Key-Value Jocument Graph Vector T - - - - - E - - - - - - - - L in a - - Pinecone
2024-01-03_10-55-08_screenshot.png #

Examples of Vector Database Unstructured data Embedding model Vector database Large Language Models LLM Embeddings OpenAI Face Hugging Bard OpenAI redis ME cohere Stanford OMeta Alpaca LOG Pinecone Ask questions Create Get relevant documents from db Construct prompts Fmbeddings Query LLM and Ret answers Long lerm memory User query Create Embed histary Embeddings Get response Store in LTM Query LLM: and get answers - Ask questions Coche lodkup Construct prompts, if not in cache Result: storei in Cache Create Embeddings Return Query LLM and get answers CheckCache for similar queries &answers Sers vd by LUM Served by Vector db
2024-01-03_10-55-59_screenshot.png #

Dedicated vector databases Databases that support vector search chroma marqo J OpenSearch I - - ClickHouse vespa drant LanceDB PostgreSQL )> Milvus cassandra elasticsearch redis Weaviate ROCKSET] 8 1 Pinecone SingleStore
2024-01-03_10-37-49_screenshot.png #

Elastic Platform Indices k-NN Search Search Results Embedding (with HNSW) Inference API Query Data - Your Data Embedding Model Legend OPyTorch A. Generation of Embeddings B. Execution of Vector Search Elasticsearch Architecture
2024-01-03_10-54-06_screenshot.png #

Audio Audio model Audio vector embeddings ENTERPRISE lext lexts model lext vector embeddings Similarity Search Videos Videos model Video vector embeddings
2024-05-01_16-36-06_screenshot.png #

Vector database use cases Intuit aims to enhance the search experience by leveraging advanced techniques to identify user intent, rather than relying solely on traditional text-based matching. This approach will enable us to deliver more relevant and personalized search results, ultimatelyimproving the overall userjourney. 14 Generative QA Build retrieval enhanced generative QA systems Entity resolution identity resolution Document discovery processing to retrieve information ent Product matching, entity matching, Finding similar documents embedded and later aws o 2023, L Web Services itsa lates AU relnvent
2024-05-01_16-36-31_screenshot.png #

Generative QA Paved path for building a knowledge base using a RAG model architecture Query Knowledge Base Create Knowledge Base New session Home Use cases QRS tracker Explore with Data Catalog GenAIS Studio Model Daz rivate What data Knon edge oises General public info Addnew What problem are you solving today? Get started by browsing different capabilities offered by Al Marketplace. Generative Al is here now! 0: Examples 4 Capabilities A Limitations Take command of the future with Generative Al- tryi it outt to power up your workflow. 'Explain quantumo computingin Remembers wat user ad May occasionalyg generate smolel lerms" earlerin the conversation incorrectinformation 'Gota an/c creatve icAs fora 210 Aloust usert provde lollon. May occasional produce year old's birthdry" "Hcn dol mrakea HTTP Tranedt to decirer rapprocrate request Unaserptt. (0 vocorectons harmtul instructions based content requests Lintedi Arowedge cl: xord dard events after 2021 D: Finet tuning Fine-tune large languager models Knowledge base Embed k knowledge base for QBA aws o 2023, Amazon Web Services, Inc. or Its affillates. All rights reserved.
2024-05-01_16-36-39_screenshot.png #

Use case and data flow Documents 1 uploaded to Amazon S3 by user Chunks are sent to a Kafka topic Relevant results are 7 sent to LLM along with prompt 4 Mention the path 2 of documents and start the process Vector search 5 ingestor persists the data into OpenSearch Service Optimal 8 sent back response from the LLM to the user is ent Al service breaks 3 down documents into chunks E Users now can query the knowledge base aws 02 2023, Amazon Web Services, Inc. its fillates All hts reserve ed AWS rezlnvent
2024-05-01_16-37-00_screenshot.png #

Architecture diagram Generated response User query User query as embedding Client app T L Amazon OpenSearch Service domain Embedding model Documents from S3 Embedding generation Metadata fields and embeddings Al Service Kafka topic OpenSearch ingestor WS o 2023, zon Web Servic ce Inc. V ed relnvent
2024-05-01_16-37-09_screenshot.png #

Metrics 7 use cases onboarded to production Exact k-NN, approximate k-NN, approximate k-NN with pre-filtering queries are in use No P1, P2 incidents after launch 46 use cases in pipeline nvent 2 customers are experimenting with product quantization compression aws 02023, Amazont Web Services, Inc. lts affillates. AUD nu reserved. AWS rezlnvent
2024-05-01_16-37-44_screenshot.png #

Learnings from adopting Amazon OpenSearch Service as a vector store Leverage monitoring capabilities of OpenSearch Scoring disparity during hybrid search Service - Support higher dimensions on Leverage data operations such as reindexing, backup, index Lucene engine Higher latency when documents management scale to billions aws D2023, Amazo Web Services, Inc tes ed AWS re:Invent
2024-05-01_16-37-56_screenshot.png #

Desired features from OpenSearch Service Easier configuration to enable different compression techniques Reduce memory footprint by storing subset ofthe vectors to the disk Support model-serving framework from OpenSearch open Ivent source project relnvent
OCR of Images #
2024-04-16_12-14-06_screenshot.png #

Example of Vector Databases Pinecone Milvus - SCaNN Weaviate Vespa - - FAISS Chroma Qdrant PgVector Redis Annoy
2024-04-16_12-14-11_screenshot.png #

Vector Database A specialized type of database designed to efficiently store and manipulate high-dimensional vector data. Unstructured data 80% of the data out there are unstructured and cannot fit into a relational database. (Gartner) MP4 PDF Non relational databases are not solving the ultimate problem of vector data. - Vector databases come into the picture to solve the problem by efficiently storing and indexing to make query fast.
2024-01-03_10-54-37_screenshot.png #

: - HUMMV - Vectors need a new kind of database Key-Value Jocument Graph Vector T - - - - - E - - - - - - - - L in a - - Pinecone
2024-01-03_10-55-08_screenshot.png #

Examples of Vector Database Unstructured data Embedding model Vector database Large Language Models LLM Embeddings OpenAI Face Hugging Bard OpenAI redis ME cohere Stanford OMeta Alpaca LOG Pinecone Ask questions Create Get relevant documents from db Construct prompts Fmbeddings Query LLM and Ret answers Long lerm memory User query Create Embed histary Embeddings Get response Store in LTM Query LLM: and get answers - Ask questions Coche lodkup Construct prompts, if not in cache Result: storei in Cache Create Embeddings Return Query LLM and get answers CheckCache for similar queries &answers Sers vd by LUM Served by Vector db
2024-01-03_10-55-59_screenshot.png #

Dedicated vector databases Databases that support vector search chroma marqo J OpenSearch I - - ClickHouse vespa drant LanceDB PostgreSQL )> Milvus cassandra elasticsearch redis Weaviate ROCKSET] 8 1 Pinecone SingleStore
2024-01-03_10-37-49_screenshot.png #

Elastic Platform Indices k-NN Search Search Results Embedding (with HNSW) Inference API Query Data - Your Data Embedding Model Legend OPyTorch A. Generation of Embeddings B. Execution of Vector Search Elasticsearch Architecture
2024-01-03_10-54-06_screenshot.png #

Audio Audio model Audio vector embeddings ENTERPRISE lext lexts model lext vector embeddings Similarity Search Videos Videos model Video vector embeddings
2024-05-01_16-36-06_screenshot.png #

Vector database use cases Intuit aims to enhance the search experience by leveraging advanced techniques to identify user intent, rather than relying solely on traditional text-based matching. This approach will enable us to deliver more relevant and personalized search results, ultimatelyimproving the overall userjourney. 14 Generative QA Build retrieval enhanced generative QA systems Entity resolution identity resolution Document discovery processing to retrieve information ent Product matching, entity matching, Finding similar documents embedded and later aws o 2023, L Web Services itsa lates AU relnvent
2024-05-01_16-36-31_screenshot.png #

Generative QA Paved path for building a knowledge base using a RAG model architecture Query Knowledge Base Create Knowledge Base New session Home Use cases QRS tracker Explore with Data Catalog GenAIS Studio Model Daz rivate What data Knon edge oises General public info Addnew What problem are you solving today? Get started by browsing different capabilities offered by Al Marketplace. Generative Al is here now! 0: Examples 4 Capabilities A Limitations Take command of the future with Generative Al- tryi it outt to power up your workflow. 'Explain quantumo computingin Remembers wat user ad May occasionalyg generate smolel lerms" earlerin the conversation incorrectinformation 'Gota an/c creatve icAs fora 210 Aloust usert provde lollon. May occasional produce year old's birthdry" "Hcn dol mrakea HTTP Tranedt to decirer rapprocrate request Unaserptt. (0 vocorectons harmtul instructions based content requests Lintedi Arowedge cl: xord dard events after 2021 D: Finet tuning Fine-tune large languager models Knowledge base Embed k knowledge base for QBA aws o 2023, Amazon Web Services, Inc. or Its affillates. All rights reserved.
2024-05-01_16-36-39_screenshot.png #

Use case and data flow Documents 1 uploaded to Amazon S3 by user Chunks are sent to a Kafka topic Relevant results are 7 sent to LLM along with prompt 4 Mention the path 2 of documents and start the process Vector search 5 ingestor persists the data into OpenSearch Service Optimal 8 sent back response from the LLM to the user is ent Al service breaks 3 down documents into chunks E Users now can query the knowledge base aws 02 2023, Amazon Web Services, Inc. its fillates All hts reserve ed AWS rezlnvent
2024-05-01_16-37-00_screenshot.png #

Architecture diagram Generated response User query User query as embedding Client app T L Amazon OpenSearch Service domain Embedding model Documents from S3 Embedding generation Metadata fields and embeddings Al Service Kafka topic OpenSearch ingestor WS o 2023, zon Web Servic ce Inc. V ed relnvent
2024-05-01_16-37-09_screenshot.png #

Metrics 7 use cases onboarded to production Exact k-NN, approximate k-NN, approximate k-NN with pre-filtering queries are in use No P1, P2 incidents after launch 46 use cases in pipeline nvent 2 customers are experimenting with product quantization compression aws 02023, Amazont Web Services, Inc. lts affillates. AUD nu reserved. AWS rezlnvent
2024-05-01_16-37-44_screenshot.png #

Learnings from adopting Amazon OpenSearch Service as a vector store Leverage monitoring capabilities of OpenSearch Scoring disparity during hybrid search Service - Support higher dimensions on Leverage data operations such as reindexing, backup, index Lucene engine Higher latency when documents management scale to billions aws D2023, Amazo Web Services, Inc tes ed AWS re:Invent
2024-05-01_16-37-56_screenshot.png #

Desired features from OpenSearch Service Easier configuration to enable different compression techniques Reduce memory footprint by storing subset ofthe vectors to the disk Support model-serving framework from OpenSearch open Ivent source project relnvent