Perik.ai See who’s hiring. Apply before everyone else.
← Back to all jobs

AI Specialist (AI Engineering)

Hyphenconnect
📍 San Francisco Bay Area, USA 📅 Posted April 24, 2026
Apply on Hyphenconnect’s website →

About this role

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities:

• Compress and optimize large language and vision models for on-device inference.

• Develop pipelines for model distillation and hardware-specific compilation.

• Benchmark performance across various NPU/GPU architectures.

Qualifications:

• Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.

• Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.

• Strong C++ and Python skills.

This listing was aggregated by Perik.ai from Hyphenconnect’s public job board. Click the button above to view the full job description and apply directly.
Explore more jobs
More from Hyphenconnect Browse all AI & tech jobs

Perik.ai is an AI & tech job board that aggregates the latest openings from top companies — updated daily so you can apply before everyone else.

About FAQ Privacy Policy Terms of Service Contact