PITTI

Explore
Articles
Projects
Blogs
en

MENU
X
Explore
Articles
Projects
Blogs
English

Copyright © All rights reserved

a
@PITTI_DATA
@PITTI_FI
@SorarePITTI

We care about your privacy so we do not store nor use any cookie unless it is stricly necessary to make the website to work

Got it

Learn more

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Artificial Intelligence

Josef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang and Yaodong Yang propose Safe RLHF, a novel algorithm for human value alignment in large language models (LLMs). Experimental results show improved mitigation of harmful responses and enhanced model performance.

Brain decoding: toward real-time reconstruction of visual perception

Artificial Intelligence,

Brain-Computer Interface

Yohann Benchetrit, Hubert Banville and Jean-Rémi King propose a new approach for real-time brain activity decoding using magnetoencephalography (MEG) in their paper 'Brain decoding: toward real-time reconstruction of visual perception.

One Year Into Musk’s Ownership, X (Twitter) Down By Every Measure | Similarweb

Similarweb - Estimates of web and app engagement on X (Twitter) trending down for everything... except Elon Musk’s profile

SmoothLLM: Defending LLMs Against Jailbreaking Attacks

Artificial Intelligence,

Security | Surveillance | Privacy

Alex Robey, Eric Wong, Hamed Hassani and George J. Pappas discuss the history of adversarial attacks on language models and introduces SmoothLLM, a randomized defense for LLMs against jailbreaking attacks.

Fuyu-8B: A Multimodal Architecture for AI Agents

Artificial Intelligence,

Information Processing | Computing

Adept AI released Fuyu-8B, a multimodal model designed for digital agents with a simpler architecture and faster response time. It performs well on standard image understanding benchmarks such as visual question-answering and natural-image-captioning.

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Artificial Intelligence,

Information Processing | Computing

Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi introduce Self-Reflective Retrieval-Augmented Generation (Self-RAG), a new framework that enhances an LM's quality and factuality through retrieval and self-reflection. Self-RAG trains a single arbitrary LM to adaptively retrieve passages on-demand, gener...

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Artificial Intelligence,

Haikang Deng and Colin Raffel introduce Reward-Augmented Decoding (RAD), a text generation procedure that uses a small unidirectional reward model to encourage language models to generate text with certain properties. RAD matches the performance of state-of-the-art methods that involve re-training the language model while inc...

State of AI Report 2023

Artificial Intelligence

The 2023 State of AI Report by Nathan Benaich highlights the advancements and implications of Large Language Models (LLMs) on research, industry dynamics, and geopolitics. The competition for compute power has led to new fault lines in community norms around openness, with Meta AI emerging as the champion of open(ish) AI

Large Language Models Are Zero-Shot Time Series Forecasters

Artificial Intelligence,

Information Processing | Computing

Nate Gruver, Marc Finzi, Shikai Qiu and Andrew Gordon Wilson find that large language models (LLMs) such as GPT-3 and LLaMA-2 can surprisingly zero-shot extrapolate time series at a level comparable to or exceeding the performance of purpose-built time series models trained on the downstream tasks

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Artificial Intelligence,

Information Processing | Computing,

Security | Surveillance | Privacy

Robin Staab, Mark Vero, Mislav Balunović and Martin Vechev explore the potential of large language models (LLMs) to infer personal attributes from text, finding that current LLMs can achieve high accuracy at a fraction of the cost and time required by humans. They also discuss the emergence of privacy-invasive chatbots and th...

Kaggle AI Report 2023

Information Processing | Computing,

Artificial Intelligence

Essays and insights from the world's largest data science and machine learning community

Novo Nordisk stops Ozempic kidney trial after early signs of success | Reuters

Health | Healthcare

Reuters - Novo Nordisk stops a trial studying Ozempic to treat kidney failure in diabetes patients ahead of schedule because it was clear from an interim analysis that the treatment would succeed.

Unknown and Unknowable

Artificial Intelligence,

On his blog, Lewis Enterprises, Hunter describes the art of investing in the Unknown and Unknowable. And how AI-powered asset management would fare in this segment.

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

Artificial Intelligence,

Information Processing | Computing,

Howard Chen, Ramakanth Pasunuru, Jason Weston and Asli Celikyilmaz introduce MemWalker, a method that first processes the long context into a tree of summary nodes. Upon receiving a query, the model navigates this tree in search of relevant information, and responds once it gathers sufficient information.

Could Earth be the only planet with intelligent life? | Big Think

Space | Astrophysics,

DNA | Evolution

Big Think - Ethan Siegel explains what the science has to say about the possibility that planet Earth is home to the only instance of intelligent life in the entire universe.

China's First 28nm Lithography Tool to Be Delivered This Year | Tom's Hardware

Information Processing | Computing

Tom's Hardware - Chinese plans to deliver scanners capable of producing chips on a 28nm-class fabrication process by the end of the year.

LLaVa : Improved Baselines with Visual Instruction Tuning

Artificial Intelligence,

Information Processing | Computing,

Haotian Liu, Chunyuan Li, Yuheng Li and Yong Jae Lee show that the fully-connected vision-language cross-modal connector in LLaVA is surprisingly powerful and data-efficient.

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Artificial Intelligence,

Information Processing | Computing

Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia and Christopher Potts introduce DSPy, a programming model that optimizes large language models (LMs) pipelines by abstracting them as text...

Decoding speech perception from non-invasive brain recordings | Nature

Artificial Intelligence,

Brain-Computer Interface,

Nature - Alexandre Défossez, Charlotte Caucheteux, Jérémy Rapin, Ori Kabeli and Jean-Rémi King introduce a model trained with contrastive learning to decode self-supervised representations of perceived speech from the non-invasive recordings of a large cohort of healthy individuals.

UK to examine Amazon and Microsoft's cloud dominance | Reuters

Infrastructure,

Regulations | Policy

Reuters - Britain's media regulator asked the country's antitrust authority to investigate U.S. tech giants Amazon and Microsoft's dominance of the UK cloud market.

6 Crucial Steps in Semiconductor Manufacturing

Artificial Intelligence,

Infrastructure,

Information Processing | Computing

Microchips, the small but powerful components in digital devices, are made through a complex process involving several key steps. This article discusses six of them: deposition, photoresist coating, lithography, etch, ion implantation, and packaging. Each step plays a crucial role in determining the chip's final functionality

Love-GPT: How “single ladies” looking for your data upped their game with ChatGPT

Artificial Intelligence,

Security | Surveillance | Privacy

Avast's Threat Intelligence Team look under the hood of a tool designed to scam users on dating apps.

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

Artificial Intelligence,

Anthropic use a weak dictionary learning algorithm called a sparse autoencoder to generate learned features from a trained model that offer a more monosemantic unit of analysis than the model's neurons themselves.

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Artificial Intelligence,

Google-Deepmind and partners from 33 academic labs have pooled data from 22 different robot types to create the Open X-Embodiment dataset and RT-X model

SlimTrainer and Adalite

Artificial Intelligence,

Information Processing | Computing

Joshua Pritsker explores optimization techniques to allow for full parameter 16-bit finetuning of language models up to 7B on a single 24GB GPU