Berkeley

Items tagged with “Berkeley”.

Apr 11, 2025 bair.berkeley.edu

Defending Against Prompt Injection with StruQ and SecAlign

Overview of StruQ and SecAlign defenses to mitigate prompt injection in LLM-powered apps, with Secure Front-End concepts and evaluation results.

Berkeley LLM Research

Apr 11, 2025 bair.berkeley.edu

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP to LLM-integrated applications, where an LLM input contains a trusted prompt (ins

Berkeley LLM

Apr 08, 2025 bair.berkeley.edu

PLAID: Repurposing Protein Folding Models for Latent-Diffusion Generated Multimodal Proteins

PLAID enables simultaneous generation of protein sequences and 3D structures by sampling the latent space of folding models, leveraging large sequence databases and diffusion on embeddings.

Berkeley Diffusion

Apr 08, 2025 bair.berkeley.edu

PLAID: Multimodal protein generation via latent diffusion

PLAID jointly generates protein 1D sequences and 3D structures by learning the latent space of protein folding models. It enables function- and organism-guided prompts and decodes structure with frozen folding-model weights.

Berkeley Diffusion Training

Mar 25, 2025 bair.berkeley.edu

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Berkeley researchers deployed 100 RL-controlled vehicles on a live highway to dampen stop-and-go waves, improving traffic flow and cutting energy use for all drivers.

Berkeley RL

Mar 25, 2025 bair.berkeley.edu

Scaling RL for Traffic Smoothing: 100-AV Highway Deployment

100 RL-controlled cars deployed on I-24 during rush hour to dampen stop-and-go waves, improve throughput, and reduce fuel use for all road users. Decentralized controllers rely on basic radar sensors and local observations.

Berkeley RL Training

Nov 12, 2024 bair.berkeley.edu

Anthology: Conditioning LLMs with Rich Backstories to Create Virtual Personas

Anthology conditions language models on richly detailed backstories to simulate representative, consistent, and diverse virtual personas for surveys and social science research.

Berkeley LLM Privacy

Nov 12, 2024 bair.berkeley.edu

Anthology: Conditioning LLMs with Rich Backstories to Create Virtual Personas

A method to steer LLMs toward representative, consistent virtual personas by generating naturalistic backstories and using them as conditioning context, enabling individualized simulations and scalable user studies.

Berkeley LLM

Sep 20, 2024 bair.berkeley.edu

Linguistic Bias in ChatGPT: How Models Reinforce Dialect Discrimination

A Berkeley AI study finds ChatGPT favors Standard American English, shows poorer comprehension and more stereotyping for non‑standard English varieties, and can amplify dialect discrimination in GPT‑3.5 and GPT‑4.

Berkeley

Sep 20, 2024 bair.berkeley.edu

Linguistic Bias in ChatGPT: Dialect Discrimination Across English Varieties

Analysis of how ChatGPT responds to different English dialects, highlighting biases against non-standard varieties and implications for global users.

Berkeley Open Source Research

Aug 28, 2024 bair.berkeley.edu

How StrongREJECT Improves Jailbreak Evaluation for Frontier LLMs

StrongREJECT advances jailbreak evaluation by pairing a high-quality forbidden-prompt dataset with automated evaluators aligned to human judgments, delivering more reliable measurements of jailbreak effectiveness against frontier LLMs.

Berkeley LLM Benchmark

Aug 28, 2024 bair.berkeley.edu

StrongREJECT: A robust benchmark for evaluating jailbreak methods in LLMs

Overview of a high-quality jailbreak benchmark with dual automated evaluators, a 313-prompt dataset, and findings that many jailbreaks underperform claims from earlier work.

Berkeley LLM Benchmark

Jul 20, 2024 bair.berkeley.edu

Visual Haystacks benchmark exposes limits of multi-image reasoning in LMMs

A new MIQA benchmark tests Large Multimodal Models on visual retrieval and reasoning across 1–10K images, revealing key limitations and introducing MIRAGE, a single-stage approach to scale LMMs.

Berkeley Benchmark Open Source

Jul 20, 2024 bair.berkeley.edu

Visual Haystacks (VHs): Benchmark for Visual Multi-Image Reasoning

Benchmark for long-context visual reasoning across large, uncorrelated image sets; introduces MIRAGE to extend LMMs beyond single-image VQA.

Berkeley Benchmark Open Source

May 29, 2024 bair.berkeley.edu

TinyAgent: Enabling Function Calling and Edge Agent Workflows with Small Language Models

TinyAgent shows small language models can be fine-tuned for reliable function calling and edge deployment, using curated synthetic data, an LLMCompiler planner, and a Tool RAG approach to power private, low-latency agentic workflows.

Berkeley RAG Inference

May 29, 2024 bair.berkeley.edu

TinyAgent: Edge Function Calling for Small Language Models

A study demonstrating how small language models can perform accurate function calling at the edge using a curated data pipeline, an LLMCompiler-based planner, and on-device execution with macOS integrations.

Berkeley Inference Privacy

Mar 21, 2024 bair.berkeley.edu

Modeling Extremely Large Images End-to-End with xT: Nested Tokenization and Long-Context Vision

xT enables end-to-end modeling of gigapixel-scale images on modern GPUs using nested tokenization, region encoders, and long-context vision, delivering high fidelity and context on images up to 29,000×25,000 pixels.

Berkeley Transformers GPU

Mar 21, 2024 bair.berkeley.edu

xT: End-to-end Modeling of Extremely Large Images on GPUs

End-to-end modeling of extremely large images on contemporary GPUs via nested tokenization and region/context encoders, delivering richer context with lower memory footprints.

Berkeley Transformers GPU

Mar 11, 2024 bair.berkeley.edu

2024 BAIR Graduate Directory: Profiles of Berkeley AI PhD Graduates

Overview of BAIR Lab's 2024 AI PhD graduates, their research areas, advisors, and contact links, with profiles, research blurbs, and URLs for recruiting and collaboration.

Berkeley LLM NLP

Mar 11, 2024 bair.berkeley.edu

BAIR 2024 Graduate Directory – PhD Profiles & Contact Info

Directory of BAIR Lab PhD graduates featuring research interests, advisor(s), and contact details to facilitate collaboration and recruitment.

Berkeley NLP CV