Agent Readiness Report

Codex

TypeScript

openai/codex

Pass Rate

53%

OpenAI Codex CLI tool

L1100%

L2100%

L365%

L40%

L50%

Strong Build System

Codex reaches Level 3 with 75% Build System pass rate. Currently becoming autonomous-capable with 30/57 criteria passing (53%). Key areas for improvement include the opportunities listed below.

Strengths

Formatter

Prettier configured (.prettierrc.toml) for TypeScript, rustfmt configured (rustfmt.toml) for Rust

Lint Config

ESLint configured for TypeScript SDK/MCP (eslint.config.js), Clippy configured for Rust (clippy.toml with custom rules)

Naming Consistency

Clippy has disallowed-methods for naming conventions, ESLint has @typescript-eslint rules for naming

Opportunities

Cyclomatic Complexity

Add complexity analysis to identify and refactor overly complex functions.

Feature Flag Infrastructure

Add feature flags to enable safer deployments and gradual rollouts.

Integration Tests Exist

Add integration tests to verify component interactions and catch issues unit tests miss.

All Criteria

Style & Validation5/11 (45%)

—code_modularization—/—Skipped: Both applications are relatively small and focused. Rust's module system provides natural boundaries.

✗cyclomatic_complexity0/2No explicit cyclomatic complexity analysis configured in CI or linter configs

✗dead_code_detection1/2Clippy detects unused code in Rust (enabled by default), but no dead code detection for TypeScript (no knip, ts-prune, or unimported)

✗duplicate_code_detection0/2No duplicate code detection tools found (no jscpd, SonarQube, or similar)

✓formatter2/2Prettier configured (.prettierrc.toml) for TypeScript, rustfmt configured (rustfmt.toml) for Rust

✗large_file_detection0/1No large file detection tools found (no git hooks, CI checks, LFS config, or linter rules for file size)

✓lint_config2/2ESLint configured for TypeScript SDK/MCP (eslint.config.js), Clippy configured for Rust (clippy.toml with custom rules)

—n_plus_one_detection—/—Skipped: Applications do not have database/ORM usage requiring N+1 query detection

✓naming_consistency2/2Clippy has disallowed-methods for naming conventions, ESLint has @typescript-eslint rules for naming

✗pre_commit_hooks0/2No pre-commit hooks found (no husky, lint-staged, or .pre-commit-config.yaml)

✓strict_typing2/2TypeScript has strict mode enabled, Rust is strongly typed by default

✗tech_debt_tracking0/1No tech debt tracking found (no TODO/FIXME scanner in CI, no linter rules enforcing issue links, no SonarQube)

✓type_check2/2TypeScript has strict: true in tsconfig.json, Rust has compiler type checking by default

Build System9/12 (75%)

✗agentic_development0/1No evidence of agent co-authorship in recent 100 commits (no factory-droid, Claude, or other agent signatures)

—automated_pr_review—/—Skipped: gh CLI is available but would require checking recent PRs for automated review comments, which is time-intensive for OSS evaluation

✓build_cmd_doc1/1README documents build commands: npm install -g @openai/codex or brew install --cask codex, AGENTS.md has just fmt and cargo commands

—build_performance_tracking—/—Skipped: While sccache is used for build caching, no explicit build duration metrics or optimization tracking found

—dead_feature_flag_detection—/—Skipped: Prerequisite feature_flag_infrastructure failed (no feature flag system exists)

✓deployment_frequency1/1Frequent deployments: 5+ releases on Jan 22, 2026 alone (rust-v0.89.0-alpha.1 through alpha.5), multiple releases daily

✓deps_pinned1/1Dependencies pinned: pnpm-lock.yaml and Cargo.lock are committed to repository

✓fast_ci_feedback1/1CI feedback is fast: most checks complete in 0-9 minutes (analyzed from recent merged PRs), well under 10-minute threshold

✗feature_flag_infrastructure0/1No feature flag infrastructure found (no LaunchDarkly, Statsig, Unleash, or GrowthBook configs). Statsig is only used for metrics via OpenTelemetry.

—heavy_dependency_detection—/—Skipped: CLI applications (non-bundled). Bundle size analysis not applicable to backend services or CLI tools.

✓monorepo_tooling1/1Monorepo tooling configured: pnpm workspaces (pnpm-workspace.yaml) for TypeScript, Cargo workspace for Rust (codex-rs/Cargo.toml)

—progressive_rollout—/—Skipped: Not applicable for CLI application (not an infrastructure/service deployment)

✓release_automation1/1Release automation via rust-release.yml workflow: automated builds, signing, GitHub release creation, and npm publishing on tag push

✓release_notes_automation1/1git-cliff is configured (cliff.toml) and used in rust-release.yml workflow for automated changelog generation

—rollback_automation—/—Skipped: Not applicable for CLI application (not an infrastructure/service deployment)

✓single_command_setup1/1README documents single command setup: npm install -g @openai/codex followed by codex, or brew install --cask codex

✗unused_dependencies_detection1/2cargo-shear configured in CI (rust-ci.yml) for Rust dependencies, but no depcheck or similar for TypeScript packages

✓vcs_cli_tools1/1GitHub CLI (gh) is installed and authenticated successfully (verified with gh auth status)

—version_drift_detection—/—Skipped: No version drift detection tools found (no syncpack or manypkg for monorepo). Not critical for this monorepo structure.

Testing3/6 (50%)

—flaky_test_detection—/—Skipped: No evidence of flaky test management (no test retry configs, no BuildPulse, no quarantine mechanisms)

✗integration_tests_exist1/2Rust has integration tests (login_tests, chatgpt_tests, app-server tests), but no integration tests found for TypeScript apps

✗test_coverage_thresholds0/2No coverage thresholds enforced: jest.config.cjs has no coverageThreshold, no pytest --cov-fail-under, no CI coverage gates

✓test_isolation2/2Tests run in isolation: Jest runs parallel by default, Rust cargo test and nextest run tests in parallel

✓test_naming_conventions2/2Test naming enforced: Jest testMatch pattern for TypeScript (**/tests/**/*.test.ts), Rust follows *_test.rs and tests/ conventions

✗test_performance_tracking0/2No test performance tracking found (no timing output in CI, no test analytics platforms, no --durations flags in configs)

✓unit_tests_exist2/2Unit tests exist: TypeScript SDK/MCP have tests/**/*.test.ts files, Rust has extensive tests in codex-rs/**/tests/

—unit_tests_runnable—/—Skipped: OSS evaluation mode - requires fully configured dev environment to verify test execution

Documentation3/7 (43%)

✓agents_md1/1AGENTS.md exists at repository root with Rust/codex-rs instructions, formatting rules, test commands, and TUI conventions

✗agents_md_validation0/1No AGENTS.md validation in CI (no automated checks that commands still work, no doc testing, no link checking)

—api_schema_docs—/—Skipped: No API schema files found (no OpenAPI/Swagger/GraphQL schemas). Applications are CLI tools, not API services.

✗automated_doc_generation0/1No automated documentation generation found (no API doc generators, no changelog automation visible in workflows)

✓documentation_freshness1/1Documentation is fresh: AGENTS.md was modified within last 180 days (git log shows recent updates)

✓readme1/1README.md exists with installation instructions, quickstart guide, and documentation links

✗service_flow_documented0/1No architecture documentation found (no .mermaid, .puml files, no docs/architecture or docs/diagrams directories)

✗skills0/1No skills directory found (.factory/skills/, .skills/, or .claude/skills/ do not exist)

Dev Environment1/2 (50%)

—database_schema—/—Skipped: Applications do not use databases (no Prisma schema, TypeORM entities, SQLAlchemy models, or SQL schemas)

✓devcontainer1/1.devcontainer/devcontainer.json exists with Rust configuration, rust-analyzer and TOML extensions

—devcontainer_runnable—/—Skipped: devcontainer CLI not installed, cannot verify container builds and runs successfully

✗env_template0/1No environment template found (no .env.example, .env.template, or env.example file)

—local_services_setup—/—Skipped: No external service dependencies requiring docker-compose.yml or local setup documentation

Debugging & Observability2/7 (29%)

✗alerting_configured0/2No alerting infrastructure found (no PagerDuty, OpsGenie, or custom alerting rules in code or configs)

—circuit_breakers—/—Skipped: Applications do not have external service dependencies requiring circuit breaker patterns

—code_quality_metrics—/—Skipped: OSS evaluation mode - requires admin API access to check code scanning analyses and coverage bots

✗deployment_observability0/2No deployment observability documentation (no dashboard links in docs, no deploy notification integrations evident)

✓distributed_tracing2/2OpenTelemetry configured for distributed tracing (codex-rs/otel/ with trace_exporter, X-Request-ID propagation evident)

✗error_tracking_contextualized1/2Sentry configured in Rust (codex-feedback crate uses sentry crate 0.46), but no error tracking for TypeScript packages

—health_checks—/—Skipped: CLI applications do not require health check endpoints (not deployed services with load balancers)

✓metrics_collection2/2Metrics collection via OpenTelemetry configured with Statsig as metrics exporter (STATSIG_OTLP_HTTP_ENDPOINT in otel config)

—profiling_instrumentation—/—Skipped: Performance profiling not applicable for CLI applications (no APM tools, continuous profiling, or flame graph generation)

✗runbooks_documented0/1No runbooks documentation found (no references to Notion, Confluence, wiki, or runbooks/ directory in README/AGENTS.md/docs)

✗structured_logging1/2Rust uses tracing crate for structured logging (configured in multiple crates), but TypeScript packages have no logging library

Security4/6 (67%)

—automated_security_review—/—Skipped: OSS evaluation mode - requires admin API access to check code-scanning/alerts for SAST tools

✓branch_protection1/1Branch protection via rulesets: 'alpha', 'CI must pass to merge to main', and 'Rust Release' rulesets exist (gh api rulesets)

✗codeowners0/1No CODEOWNERS file found in repository root or .github/ directory

—dast_scanning—/—Skipped: Applications are not web services requiring DAST scanning (CLI tools, not deployed web applications)

✓dependency_update_automation1/1Dependabot configured (.github/dependabot.yaml) for cargo, github-actions, devcontainers, docker, rust-toolchain, and bun

✓gitignore_comprehensive1/1.gitignore properly excludes .env* (not .env.example), node_modules, build artifacts, IDE configs, and OS files (.DS_Store)

✗log_scrubbing1/2Rust tracing crate has filtering/redaction capabilities configured, but TypeScript packages lack log sanitization

—pii_handling—/—Skipped: Applications do not process personal data requiring PII detection/handling (developer tools, no user data collection)

—privacy_compliance—/—Skipped: CLI application without end-user data collection (no consent management, no GDPR/CCPA handling needed)

—secret_scanning—/—Skipped: OSS evaluation mode - gh api secret-scanning/alerts returned 404 (not enabled or access denied)

✓secrets_management1/1.env* files properly gitignored (not .env.example), GitHub Actions uses secrets.* references, no hardcoded secrets found

Task Discovery3/4 (75%)

✓backlog_health1/1Excellent backlog health: 100% of 50 open issues have labels, 0 issues older than 365 days

✗issue_labeling_system0/1Limited labeling system: only 'bug' label found from standard categories, missing priority (P0-P3), type (feature/chore), and area labels

✓issue_templates1/1.github/ISSUE_TEMPLATE/ directory exists with structured templates (bug-report, feature-request, docs-issue, vs-code-extension)

✓pr_templates1/1.github/pull_request_template.md exists with sections for description and testing

Product & Analytics0/2 (0%)

✗error_to_insight_pipeline0/2No error-to-insight pipeline found (no Sentry-GitHub integration, no automated issue creation from errors)

✗product_analytics_instrumentation1/2Statsig metrics instrumentation via OpenTelemetry (used for telemetry), but no full product analytics like Mixpanel/Amplitude

Codex

Strong Build System

Strengths

Opportunities

All Criteria

Ready to build the software of the future?