Alloy (Auto Model Routing) - Factory Documentation

Alloy (Auto Model Routing) is currently in Research Preview.

Alloy (Auto Model Routing) automatically chooses the best model for each Droid task. Instead of manually picking a single model for every session, Alloy evaluates the task and routes work to the model that balances quality, latency, and cost. Alloy is available in the Factory CLI and App and works with the same enterprise controls as other models.

Why use Alloy?

Alloy is designed for teams that want strong results without manually changing model selection for every workflow.

Best performance at lower cost

In Factory evaluations, Alloy maintained frontier-level performance on challenging benchmarks while reducing costs roughly 20–25% compared with always using top-tier models.

Improved model selection

Droid can route routine steps to faster or lower-cost models and reserve stronger models for work that needs deeper reasoning.

Enterprise-ready controls

Alloy respects the same org, project, and user-level model controls used across the Factory platform.

Model Options

Alloy routes to one of the following models. The same pricing applies depending on which model Alloy routes the request to.

Model	Multiplier
Claude Opus 4.7	2×
Kimi K2.6	0.4×
MiniMax M2.7	0.12×

Availability

Alloy is available in the Factory CLI and App model selector for select customers as Alloy (Research Preview).

Enterprise controls

Enterprise admins can manage Alloy through the same model governance system used for other Factory-supported models, including allowing or restricting Alloy at the organization level like any other model. For more information on enterprise control of models, see Models, LLM Gateways & Integrations.

FAQ

How does model routing work?

Alloy is a per-task router that uses a mix of session and per-request routing. It strongly considers prompt cache maintenance and savings in its optimization.

Are we able to configure it?

Alloy can be enabled/disabled like any other model today. We also offer Enterprise Controls for Alloy routing guidance.

Do we have benchmarks?

Yes. Alloy maintains frontier-level performance on challenging benchmarks like Terminal Bench 2 and Legacy Bench, and provides the best balance of quality, latency, and cost.

Do we have references on cost/performance?

Alloy maintains frontier-level performance while reducing costs roughly 20–25% compared with always using top-tier models.

Quickstart

Droid Exec (Headless)

Documentation Index

​Why use Alloy?

Best performance at lower cost

Improved model selection

Enterprise-ready controls

​Model Options

​Availability

​Enterprise controls

​FAQ

​How does model routing work?

​Are we able to configure it?

​Do we have benchmarks?

​Do we have references on cost/performance?

Why use Alloy?

Model Options

Availability

Enterprise controls

FAQ

How does model routing work?

Are we able to configure it?

Do we have benchmarks?

Do we have references on cost/performance?