Loading...
Please wait a moment
Founded by passionate advocates of learning and innovation, Learni set out to make professional training accessible to everyone, everywhere in the world. Our team works in the largest cities such as Paris, Lyon, Marseille, and internationally, to support talents and organizations in their skills development.
Which format do you prefer?
30 free minutes with a training advisor — no commitment.
Loading available slots...
Artificial Intelligence training in Raleigh in June 2026 with Learni. Certified, expert trainers, eligible for employer funding. Free quote.
Cybersecurity training in Brighton in July 2026 with Learni. Certified, expert trainers, eligible for employer funding. Free quote.
Discover Learni's comprehensive Excel training program launching in May 2026, guiding learners from basic spreadsheets to advanced data mastery for career success.
Comprehensive guide to Figma training in 2025, covering essentials to sophisticated prototyping. Ideal for designers preparing for professional growth.
The Training ONNX Runtime - Accelerate ML Inference in Production training is delivered in-person or remotely (blended-learning, e-learning, virtual classroom, remote in-person). At Learni, a Qualiopi-certified training organization, each program is designed to maximize skills acquisition, regardless of the training mode chosen.
The trainer alternates between demonstrative, interrogative, and active methods (through practical exercises and/or real-world scenarios). This pedagogical approach ensures concrete and directly applicable learning in the workplace.
To ensure the quality of the Training ONNX Runtime - Accelerate ML Inference in Production training, Learni provides the following teaching resources:
For in-house training at a location external to Learni, the client ensures and commits to having all necessary teaching materials (IT equipment, internet connection...) for the proper conduct of the training action in accordance with the prerequisites indicated in the communicated training program.
The assessment of skills acquired during the Training ONNX Runtime - Accelerate ML Inference in Production training is carried out through:
Learni is committed to the accessibility of its professional training programs. All our training programs are accessible to people with disabilities. Our teams are available to adapt teaching methods to your specific needs. Do not hesitate to contact us for any accommodation request.
Learni training programs are available for inter-company and intra-company settings, both in-person and remote. Registration is possible up to 48 business hours before the start of training. Our programs are eligible for OPCO, Pôle emploi, and FNE-Formation funding. Contact us to discuss your training project and funding possibilities.
Quick installation of ONNX Runtime on various environments, hands-on with Python and C++ APIs to load your first ONNX models, execution of basic inferences on real datasets like ImageNet, practical exercises to convert TensorFlow or PyTorch models to ONNX, analysis of performance logs and creation of your first professional inference script, with code review by the trainer to validate enterprise best practices.
Exploration of ONNX Runtime providers for CPU, GPU CUDA, and DirectML, application of automatic and manual graph optimizations on your models, exercises to reduce inference latency by 50% on concrete enterprise cases, use of tools like ONNX Optimizer and Model Profiler, development of custom scripts for quantization and pruning, comparative tests before/after with precise metrics, deliverable: optimized model ready for production.
Integration of ONNX Runtime into Python Flask applications for inference APIs, development of a scalable web service with session management and batching, exercises on edge deployment with ONNX Runtime Mobile for Android/iOS, connection to databases for end-to-end pipelines, practical cases inspired by production such as real-time image recognition, advanced debugging and monitoring with integrated tools, production of a functional prototype usable immediately in the enterprise.
Containerization of models with Docker and official ONNX Runtime containers, orchestration with Kubernetes for auto-scaling, implementation of security (authentication, input encryption), setup of monitoring with Prometheus and Grafana for latency/throughput, exercises on error handling and rollbacks in production, multi-hardware benchmarks and final optimization, deliverable: complete deployment pipeline with monitoring dashboard, ready for your critical business challenges.
Target audience
Data scientists, ML engineers, AI developers seeking to optimize inference in the enterprise to enhance their skills
Prerequisites
Knowledge of Python, basics in Machine Learning, and handling of ONNX models
Loading...
Please wait a moment





























