Senior Software Engineer, Agentic AI
About this role
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
We are looking for an experienced Software Engineer to develop our core libraries for Agentic Applications. Are you excited by the prospect of building foundational technology that powers the next generation of agents? Have you ever dreamed of elevating engineering practices to enable the safe and scalable deployment of autonomous systems? As a global leader in AI and deep learning, NVIDIA is redefining industries and our work in agentic computing is at the forefront of this revolution. Join us in building the scalable agentic capabilities, reusable building blocks, and high-quality libraries that will accelerate developer productivity, ensure agent quality, and provide critical acceleration and optimization for highly performant and efficient agents. Work directly with a community of creative engineers and collaborate on projects that push the boundaries of what's possible in the world of autonomous agents!
What you'll be doing:
• Track and understand evolving agent development patterns across NVIDIA and the broader ecosystem, maintaining current knowledge of both research and commercial products.
• Develop open-source libraries and tools which accelerate and optimize agent harnesses and frameworks ensuring top-tier performance, accuracy, quality, and stability.
• Benchmark the latest agents to identify bottlenecks and build creative solutions to increase performance, reduce cost, and improve latency.
• Work closely with teams building high-performance data pipelines, RAG systems, vector databases, and GPU-optimized training and inference workflows to deliver best-in-class agentic applications.
• Identify gaps and friction in current agent architectures, and translate insights into agentic tools that boosts developer velocity and agent quality—backed by evaluations, benchmarking, and feedback loops.
What we need to see:
• Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Applied Math, or a related field, or equivalent experience.
• 8+ years experience in at least one of Rust, Python, Go, or Node.js, with working familiarity in at least one more.
• Solid understanding of asynchronous programming, callbacks, request lifecycles, and event-driven systems.
• Hands-on experience with evolving agent architectures, multiple agents frameworks and agent harnesses.
• Proficiency in LLM applications, agent workflows, tool calls, and model-provider APIs.
• Ability to design or extend cross-language APIs with attention to consistency, usability, stability, and backwards compatibility.
• Systems-level debugging and performance intuition: able to trace execution from high-level API calls through runtime internals, language bindings, callbacks, serialization, and event emission to understand overhead and optimize hot paths.
• Strong interpersonal skills, enabling direct contact and effective collaboration with the open source community.
Ways to stand out from the crowd:
• Experience building evaluation/benchmarking systems for agent workflows (metrics, regression, feedback loops).
• Rust systems work, especially async Rust, Tokio, serde, API design, or runtime state management and /or Python native extension experience with PyO3, maturin, or Python/Rust bindings.
• Instrumenting third-party frameworks without changing user-visible behavior. Knowledge of OpenTelemetry, tracing, structured events, exporters, or observability pipelines are a plus.
• Middleware, plugin systems, guardrails, policy engines, or request/response interception and maintaining open-source libraries, SDKs, or internal developer platforms.
• Profiling or optimizing runtime/library overhead across language boundaries, async execution, native bindings, serialization, tracing, or middleware pipelines.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until May 10, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.