midroid / VisionRAG Public

forked from dame-cell/VisionRAG

Notifications You must be signed in to change notification settings
Fork 0
Star 0

A new novel multi-modality (Vision) RAG architecture

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
colpali.jpeg		colpali.jpeg

Repository files navigation

VisionRAG

VisionRAG is an implementation of MULTI-MODALITY-RAG which uses the new novel approach ColPali: Efficient Document Retrieval with Vision Language Models

Overview of ColPali Paper:

Direct embedding of document screenshots
No need for OCR or complex preprocessing
Handles multi-modal content (text, images, charts, tables)
Streamlined retrieval and ranking process
Built on ColPali 2's efficient embedding technique

This project aims to demonstrate how visual-based embedding can simplify and enhance RAG systems, making them more versatile and easier to implement for a wide range of document types.

About

A new novel multi-modality (Vision) RAG architecture

Readme

MIT license

Activity

0 stars

0 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VisionRAG

Overview of ColPali Paper:

About

Uh oh!

Releases

Packages

Languages

License

midroid/VisionRAG

Folders and files

Latest commit

History

Repository files navigation

VisionRAG

Overview of ColPali Paper:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages