PITTI - Article - Textbooks Are All You Need

Textbooks Are All You Need

Artificial Intelligence,Research

Date : 2023-06-20

Abstract

We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of ``textbook quality" data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens). Despite this small scale, phi-1 attains pass@1 accuracy 50.6% on HumanEval and 55.5% on MBPP. It also displays surprising emergent properties compared to phi-1-base, our model before our finetuning stage on a dataset of coding exercises, and phi-1-small, a smaller model with 350M parameters trained with the same pipeline as phi-1 that still achieves 45% on HumanEval.

Paper from Microsoft Research available here

How hard does Art need to be ?

Evaluation of Sports Performance: Cognitive Biases, Vectors an...

Recently on :

Artificial Intelligence

Research

WEB - 2025-11-13

Measuring political bias in Claude

Anthropic gives insights into their evaluation methods to measure political bias in models.

WEB - 2025-10-09

Defining and evaluating political bias in LLMs

OpenAI created a political bias evaluation that mirrors real-world usage to stress-test their models’ ability to remain objecti...

WEB - 2025-07-23

Preventing Woke AI In Federal Government

Citing concerns that ideological agendas like Diversity, Equity, and Inclusion (DEI) are compromising accuracy, this executive ...

WEB - 2025-07-10

America’s AI Action Plan

To win the global race for technological dominance, the US outlined a bold national strategy for unleashing innovation, buildin...

WEB - 2024-12-30

Fine-tune ModernBERT for text classification using synthetic data

David Berenstein explains how to finetune a ModernBERT model for text classification on a synthetic dataset generated from argi...

more articles on
-
Artificial Intelligence

We care about your privacy so we do not store nor use any cookie unless it is stricly necessary to make the website to work

Got it

Learn more