
Shift happens: we compared 5 methods to detect drift in ML embeddings
Date : 2023-05-17
Presentation
Monitoring embedding drift is relevant for the production use of LLM and NLP models. EvidentlyAI ran experiments to compare different drift detection methods. They implemented them in an open-source library and recommend model-based drift detection as a good default.
What you’ll find in the blog:
- Experiment design. We created artificial shifts on three datasets and chose five embedding drift detection methods to test.
- Comparison of drift detectors. We summarize how they work and react to varying data changes. The goal is to help shape the intuition of the behavior of different methods.
- Colab notebooks with all the code. You can repeat the comparisons on your data by introducing artificial shifts with the same approach as we did.
- Open-source library to detect embedding drift. We implemented our findings in Evidently, an open-source Python library to evaluate, test and monitor ML models.
Read blog post here
Evaluation of Sports Performance: Cognitive Biases, Vectors an...
What Tech tells us about corporate culture
Recently on :
Artificial Intelligence
Research
WEB - 2025-11-13
Measuring political bias in Claude
Anthropic gives insights into their evaluation methods to measure political bias in models.
WEB - 2025-10-09
Defining and evaluating political bias in LLMs
OpenAI created a political bias evaluation that mirrors real-world usage to stress-test their models’ ability to remain objecti...
WEB - 2025-07-23
Preventing Woke AI In Federal Government
Citing concerns that ideological agendas like Diversity, Equity, and Inclusion (DEI) are compromising accuracy, this executive ...
WEB - 2025-07-10
America’s AI Action Plan
To win the global race for technological dominance, the US outlined a bold national strategy for unleashing innovation, buildin...
WEB - 2024-12-30
Fine-tune ModernBERT for text classification using synthetic data
David Berenstein explains how to finetune a ModernBERT model for text classification on a synthetic dataset generated from argi...