Perik.ai See who’s hiring. Apply before everyone else.
← Back to all jobs

AI Research Engineer - Reinforcement Learning

Jobgether
📍 UK 📅 Posted April 20, 2026
Apply on Jobgether’s website →

About this role

Accountabilities:

• Design and implement advanced reinforcement learning algorithms to improve decision-making, policy optimization, and system performance across simulated and real-world environments

• Run controlled experiments, track performance metrics, evaluate outcomes against benchmarks, and iterate on model improvements through empirical analysis

• Develop and curate high-quality simulation environments and training datasets aligned with domain-specific requirements and learning objectives

• Debug and optimize RL pipelines, addressing challenges such as exploration strategy, reward stability, sample efficiency, and training convergence

• Collaborate with engineering and research teams to integrate RL agents into production systems and ensure measurable real-world performance gains

• Define evaluation frameworks and continuously monitor deployed systems to support robustness, scalability, and domain adaptation

Requirements:

• Advanced degree in Computer Science, Machine Learning, or related field; PhD preferred with strong academic research background and publications in top-tier conferences

• Proven experience running large-scale reinforcement learning projects, including modern online RL techniques such as policy optimization methods and actor-critic frameworks

• Deep understanding of reinforcement learning theory and practice, including policy gradients, exploration-exploitation trade-offs, and optimization strategies for stability and efficiency

• Strong hands-on expertise with PyTorch and RL frameworks, including building full pipelines from simulation to training and deployment

• Demonstrated ability to solve complex RL challenges such as sample inefficiency, reward noise, and training instability through empirical and algorithmic innovation

• Strong analytical mindset with ability to design robust experiments, interpret results, and continuously improve model performance

Benefits:

• Fully remote work environment with global team collaboration

• Opportunity to work on cutting-edge AI and reinforcement learning research at scale

• High-impact role influencing production-level AI systems and real-world applications

• Competitive compensation aligned with experience and expertise

• Exposure to advanced research, multimodal AI systems, and state-of-the-art infrastructure

• Flexible working culture supporting autonomy and innovation

How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
 Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

This listing was aggregated by Perik.ai from Jobgether’s public job board. Click the button above to view the full job description and apply directly.
Explore more jobs
More from Jobgether Browse all AI & tech jobs

Perik.ai is an AI & tech job board that aggregates the latest openings from top companies — updated daily so you can apply before everyone else.

About FAQ Privacy Policy Terms of Service Contact