PITTI - Articles

On the Dwarkesh Podcast, former chief economist of the IMF Ken Rogoff explains why China is on the brink of a debt crisis. But is his argument really about China?

Existential Economic Questions for Generations X, Y and Z

2025-02-02

Economy | Economics,

Social Science | Society

Generations X,Y and Z are faced with a number of existential economic questions for the rest of the century. From the looming demographic crisis in developed nations to the precarious state of sovereign debt, and the escalating economic warfare between the US and China over AI and space technology, here is a sobering picture ...

Fine-tune ModernBERT for text classification using synthetic data

2024-12-30

Artificial Intelligence,

Information Processing | Computing

David Berenstein explains how to finetune a ModernBERT model for text classification on a synthetic dataset generated from argilla's synthetic-data-generator.

Fine-tune classifier with ModernBERT in 2025

2024-12-25

Artificial Intelligence,

Information Processing | Computing

In this blog post Philipp Schmid explains how to fine-tune ModernBERT, a refreshed version of BERT models, with 8192 token context length, for classifying user prompts to implement an intelligent LLM router.

MordernBERT, finally a replacement for BERT

2024-12-18

Artificial Intelligence,

Information Processing | Computing,

Research

6 years after the release of BERT, answer.ai introduce ModernBERT, bringing modern model optimizations to encoder-only models and representing a major Pareto improvement over older encoders.

What happens when a "normal" person tries to wrap their head around one of humanity's most elusive concepts? This article is as much about overcoming imposter syndrome as it is about the metaphysics of time.

A bubble in AI?

2024-09-19

Artificial Intelligence,

Business

Bubble or true technological revolution? While the path forward isn't without obstacles, the value being created by AI extends far beyond speculative investments, touching every sector of the economy and fundamentally altering the nature of work itself.

China Is Rapidly Becoming a Leading Innovator in Advanced Industries

2024-09-16

Regulations | Policy,

Business

There may be no more important question for the West’s competitive position in advanced industries than whether China is becoming a rival innovator. While the evidence suggests it hasn’t yet taken the overall lead, it has pulled ahead in certain areas, and in many others Chinese firms will likely equal or surpass Western firm...

Artificial Intelligence : what everyone can agree on

2024-09-08

Business,

Artificial Intelligence

Artificial Intelligence is a divisive subject that sparks numerous debates about both its potential and its limitations. However, there's one aspect everyone can agree on: current commercial practices demonstrate that the sector is still far from maturity.

TikTok is urging users to call Congress about a looming ban | The Verge

2024-03-07

Regulations | Policy,

Design | Culture,

Social Science | Society

The Verge - TikTok, owned by Chinese company ByteDance, sent users in the US a push notification on Wednesday, warning that 'Congress is planning a total ban of TikTok' that would 'strip 170 million Americans of their Constitutional right to free expression.'

Nvidia bans using translation layers for CUDA software | Tom's Hardware

2024-03-04

Artificial Intelligence,

Information Processing | Computing

Tom's Hardware - Nvidia has banned running CUDA-based software on other hardware platforms using translation layers in its licensing terms, which previously wasn't included in the documentation placed on a host system during the installation process. The restriction appears to be designed to prevent initiatives like ZLUDA and...

A Quantum Trick Implied Eternal Stability. Now the Idea May Be Falling Apart | Quantamagazine

2024-02-26

Physics

Quantamagazine - Physicists have discovered that quantum phenomena may not be as fragile as previously thought, with certain quantum systems appearing to remain stable for eons. However, recent research suggests that these systems may not be eternal after all, challenging the foundations of thermodynamics and statistical mech...

Groq Inference Tokenomics: Speed, But At What Cost? | Semianalysis

2024-02-21

Artificial Intelligence,

Infrastructure,

Information Processing | Computing

Semianalysis - Groq, an AI hardware startup, has been making waves with their impressive demos showcasing Mistral Mixtral 8x7b on their inference API. They are achieving up to 4x the throughput of other inference services while also charging less than 1/3 that of Mistral themselves. However, speed is only one part of the equa...

Retell AI : conversational speech engine

2024-02-21

Artificial Intelligence

Retell tackle the challenge of real time conversations with voice AI.

Let's build the GPT Tokenizer

2024-02-20

Artificial Intelligence,

Information Processing | Computing

VIDEO | Andrej Karpathy builds from scratch the Tokenizer used in the GPT series from OpenAI, showing that a lot of weird behaviors and problems of LLMs actually trace back to tokenization and discusseing why tokenization is at fault

An Empirical Evaluation of LLMs for Solving Offensive Security Challenges

2024-02-19

Artificial Intelligence,

Research,

Security | Surveillance | Privacy

Minghao Shao, Boyuan Chen, Sofija Jancheska, Brendan Dolan-Gavitt, Siddharth Garg, Ramesh Karri and Muhammad Shafique found that Large Language Models (LLMs) achieved a higher success rate than an average human participant in solving Capture The Flag (CTF) challenges. This research highlights the potential of LLMs in cyberse...

My benchmark for large language models

2024-02-19

Artificial Intelligence

Nicholas Carlini introduces his new benchmark for large language models, which includes nearly 100 tests extracted from actual conversation history with various LLMs.

Major-TOM Datasets: Core-S2L2A

2024-02-19

Data Visualization,

Datasets

The Major-TOM Core-S2L2A dataset contains a global coverage of Sentinel-2 (Level 1C) patches, each of size 1,068 x 1,068 pixels. The dataset includes 2,245,886 patches and over 2.564 trillion pixels.

Nexus - 3D semantic graph of hacker interests

2024-02-17

Information Processing | Computing,

Data Visualization

Freeman Jiang introduces Nexus, a 3D data visualization of hacker interests

SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

2024-02-16

Artificial Intelligence,

Research,

Information Processing | Computing

Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-Mageed, Sinong Wang, Rong Jin, Sem Park and Ning Yao and Bo Long use pretrained language models (PLMs) to encode user histories and candidate items, treating the task as textual semantic matching. SPAR addresses long user engagement history and insufficient user-item i...

GRIT : Generative Representational Instruction Tuning

2024-02-15

Research,

Information Processing | Computing

Niklas Muennighoff, Hongjin Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh and Douwe Kiela introduce generative representational instruction tuning (GRIT) whereby a large language model is trained to handle both generative and embedding tasks by distinguishing between them through instructions.

New study suggests the Atlantic overturning circulation AMOC “is on tipping course”

2024-02-09

Environment

In RealClimate, Stefan reports on a new study published in Science Advances suggests that the Atlantic overturning circulation (AMOC) is on a 'tipping course' and could collapse if the northern Atlantic Ocean is diluted with freshwater, reducing its salinity and density. The study confirms past concerns that climate models sy...

The CHIPS Act treats the symptoms, but not the causes

2024-02-07

Artificial Intelligence,

Infrastructure,

Regulations | Policy

Cory Doctorow examines the CHIPS Act and its implications for America's high-tech industry. He argues that while the Act may address some of the issues caused by monopolies in the sector, it fails to tackle the underlying problems.

Inside the Underground Site Where ‘Neural Networks’ Churn Out Fake IDs

2024-02-05

Artificial Intelligence,

Security | Surveillance | Privacy

In 404Media, Joseph Cox reports on an underground website, OnlyFake, is using 'neural networks' to generate realistic looking fake IDs for just $15. This technology threatens to streamline everything from bank fraud to money laundering and has implications for cybersecurity.

What Your Brain Is Doing When You’re Not Doing Anything | Quantamagazine

2024-02-05

Biology,

Health | Healthcare

Quantamagazine - The discovery of the default mode network, a collection of seemingly unrelated areas of the brain that activate when you’re not doing much at all, has offered insights into how the brain functions outside of well-defined tasks. It is active during rest, when we turn mentally inward, and uses more energy than ...