AI Specialist (AI Engineering)
Hyphenconnect
Apply on Hyphenconnect’s website →
About this role
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
• Compress and optimize large language and vision models for on-device inference.
• Develop pipelines for model distillation and hardware-specific compilation.
• Benchmark performance across various NPU/GPU architectures.
Qualifications:
• Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
• Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
• Strong C++ and Python skills.
This listing was aggregated by Perik.ai from Hyphenconnect’s public job board.
Click the button above to view the full job description and apply directly.
Explore more jobs