Own your
Artificial Intelligence

We have designed a new chip that dramatically lowers the barrier to create frontier artificial intelligence

a simpler chip

−1 · 0 · +1

for the densest possible intelligence

The Problem

Today's silicon runs AI at 4–32 bits
Most of them are wasted

FP32

32 bit

FP16 / BF16

16 bit

INT8

8 bit

NVFP4

4 bit

1-BIT

1 bit

No silicon today natively runs in 1-bit
So we built one.

The Chip

1-bit native silicon
Training and Inference

−10+1

Per-MAC silicon

Traditional MAC

Multiplier · Carry-save tree · Adder

Ternary MAC

Mux · Adder

A chip for the era of 1-bit neural networks

Formats

W1A4
W1.58A4
W1.58A6

Workload

Training
Inference

Software

PyTorch

Status

Live on FPGA

How it works

model =

"meta-llama/Llama-3.1-8B"

# or any torch.nn.Module

Pick a model

point to a hugging-face model id, or bring your own pytorch module

baud.compile(model)

Compile

our compiler converts your model and weights to a format our hardware understands

model.generate()

model.train()

Accelerate training and inference

get accelerated training, rl fine-tuning and inference on our hardware

Live Now

Deployed on FPGAs

Our architecture is running on FPGAs today. Accessible to developers right now.

Models

Up to 32B parameters

Pretraining

Supported

RL fine-tuning

Supported

Inference

Supported

ASIC Deployment

Architecture verified on GlobalFoundries 12nm process, first silicon in progress

Models

1T+ parameters

Pretraining

Accelerated

RL fine-tuning

Accelerated

Inference

Accelerated

Availability

Coming soon

Book Capacity

Join us to

Work on something exciting

Open Roles Say Hello

Own yourArtificial Intelligence

Today's silicon runs AI at 4–32 bitsMost of them are wasted

1-bit native siliconTraining and Inference

Deployed on FPGAs

ASIC Deployment

Work on something exciting

Own your
Artificial Intelligence

Today's silicon runs AI at 4–32 bits
Most of them are wasted

1-bit native silicon
Training and Inference