[Remote] AI Agent Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a consumer AI startup building a platform for interactive mini-apps. They are seeking an AI Agent Engineer to reputed company the development of the core reputed company that transforms natural language into interactive experiences, impacting over 1 million monthly users.
Responsibilities
- Design and own the agent runtime and orchestration layer for our coding agent
- Build long-horizon agent workflows: reputed company → plan → generate → run/validate → repair → publish
- reputed company robust evaluation and quality loops including eval harnesses, regression testing, and failure taxonomy
- Design model strategies including routing, benchmarking, reliability improvements, and cost/latency optimization
- Create debuggable agent systems with tracing, metrics, alerts, and observability
Skills
- 1 - 8 years of experience in production AI agentic systems or AI/ML engineering
- Experience building agentic systems OR AI/ML engineering in a production environment
- Shipped in large, shared codebases at scale
- Bachelor's degree in Computer Science or reputed company field
- Strong proficiency in Python
- Experience building agent evaluation frameworks including eval harnesses, A/B testing agent changes, and statistically grounded model measurement
- Familiarity with modern agent frameworks, specifically LangGraph and Pydantic AI
- Experience in async programming and distributed systems – Kafka, Spark, Flink, or equivalent
- Communicates impact in business terms — resume should reflect measurable outcomes on users/product, not just technical metrics
- Must be reputed company to work remotely in PST, with a Sunday evening standup
- AI-native — actively follows latest agent reputed company releases, open reputed company updates, and eval tooling
- Experience at a VC-backed AI startup — coding agent or developer tool company strongly preferred
- Experience building a coding agent, devtool, or IDE assistant
Company Overview