Exploring ColBERT with RAGatouille
Date : 2024-01-27
Description
This summary was drafted with mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf
Simon Willison delves into the workings of ColBERT, a retrieval model designed for scalable BERT-based search over extensive text collections. He explains how ColBERT differs from regular embedding models and how it provides more information than traditional embedding search by showing which words in the document are most relevant. Willison then proceeds to use RAGatouille, a library that makes working with ColBERT easier, to create an index of his blog's content. He also demonstrates querying the index and implementing a basic question-answering mechanism using LLM. The article further explores re-ranking queries without building an index first.
Read article here
Recently on :
Artificial Intelligence
Information Processing | Computing
WEB - 2024-12-30
Fine-tune ModernBERT for text classification using synthetic data
David Berenstein explains how to finetune a ModernBERT model for text classification on a synthetic dataset generated from argi...
WEB - 2024-12-25
Fine-tune classifier with ModernBERT in 2025
In this blog post Philipp Schmid explains how to fine-tune ModernBERT, a refreshed version of BERT models, with 8192 token cont...
WEB - 2024-12-18
MordernBERT, finally a replacement for BERT
6 years after the release of BERT, answer.ai introduce ModernBERT, bringing modern model optimizations to encoder-only models a...
PITTI - 2024-09-19
A bubble in AI?
Bubble or true technological revolution? While the path forward isn't without obstacles, the value being created by AI extends ...
PITTI - 2024-09-08
Artificial Intelligence : what everyone can agree on
Artificial Intelligence is a divisive subject that sparks numerous debates about both its potential and its limitations. Howeve...