Factory's latest news
Factory is bringing autonomy to software engineering. We’re shaping the future of technology, transforming how software is built, tested, and delivered.
Factory is bringing autonomy to software engineering. We’re shaping the future of technology, transforming how software is built, tested, and delivered.
Read how Factory’s customers are using Droids accelerate software development.
start building
Start building
Engineering
Agent Native Development

With a score of 58.75%, Droid sets the new state-of-the-art on Terminal-Bench. Agent design, not just choice of model, is the decisive factor as we achieve leading performance on every model.
Learn moreNew
Engineering

How we built a closed-loop system for recursive self-improvement—where the agent detects its own failures and implements fixes automatically.
Learn moreNew
Product

A framework for measuring and improving how well your codebase supports autonomous development. Evaluate repositories across eight technical pillars and five maturity levels.
Learn moreEngineering
Research

Droid analyzed 780,000 of its own web searches. It wrote the queries, built the pipeline, and generated the visualizations.
Learn moreEngineering
Research

We built an evaluation framework to measure how much context different compression strategies preserve. After testing three approaches on real-world, long-running agent sessions spanning debugging, code review, and feature implementation, we found that structured summarization retains more useful information than alternatives from OpenAI and Anthropic.
Learn moreEngineering
Agent Native Development

Agents write the code; linters write the law. We’re moving from “developers writing code with AI” to “developers orchestrating agents that build software.” The old guardrails (code review, conventions, tribal memory) aren’t enough. Agents need clear rules they can concretely verify. They need lint rules, not suggestions.
Learn moreEngineering

Large language models have limited context windows - approximately 1 million tokens. In contrast, a typical enterprise monorepo can span thousands of files and several million tokens.
Learn moreCase Study
Groq leverages Factory's model-agnostic software development agents to accelerate day-0 launches
Learn More