Stella Biderman's Directory of LLMs
Date : 2023-05-31
Description
Stella Biderman shares resources where she documents key features of LLMs.
- The LLM directory linked here and at the bottom of the page provides a detailed chronology of the evolution of large language models between GPT-2's introduction in February 2019 until today. The list includes models like Megatron-BERT, T5, Meena, and GPT-3, among others, delving into their organizations, accessibility, and language support.
- The Common LLM Settings spreadsheetdocuments a wide variety of h params on a per-model basis as well as provide her recommendations for how to go about setting up a LLM architecture.
Find the directory here
Evaluation of Sports Performance: Cognitive Biases, Vectors an...
What Tech tells us about corporate culture
Recently on :
Artificial Intelligence
WEB - 2024-12-30
Fine-tune ModernBERT for text classification using synthetic data
David Berenstein explains how to finetune a ModernBERT model for text classification on a synthetic dataset generated from argi...
WEB - 2024-12-25
Fine-tune classifier with ModernBERT in 2025
In this blog post Philipp Schmid explains how to fine-tune ModernBERT, a refreshed version of BERT models, with 8192 token cont...
WEB - 2024-12-18
MordernBERT, finally a replacement for BERT
6 years after the release of BERT, answer.ai introduce ModernBERT, bringing modern model optimizations to encoder-only models a...
PITTI - 2024-09-19
A bubble in AI?
Bubble or true technological revolution? While the path forward isn't without obstacles, the value being created by AI extends ...
PITTI - 2024-09-08
Artificial Intelligence : what everyone can agree on
Artificial Intelligence is a divisive subject that sparks numerous debates about both its potential and its limitations. Howeve...