Ctrl+K

latest

Site Navigation

User guide
Powered by
LLM

Contributor guide

Cluster serving

GitHub Repository for BigDL

Site Navigation

User guide
Powered by
LLM

Contributor guide

Cluster serving

GitHub Repository for BigDL

Back to Homepage ↵

Section Navigation

BigDL-Nano Document

Nano in 5 minutes
Installation
Key Features
Tutorials
How-to Guides
Tips and Known Issues
Troubleshooting Guide
OS Support
API Reference

Inference Optimization: For PyTorch Users#

How to find accelerated method with minimal latency using InferenceOptimizer
How to accelerate a PyTorch inference pipeline through ONNXRuntime
How to accelerate a PyTorch inference pipeline through OpenVINO
How to accelerate a PyTorch inference pipeline through JIT/IPEX
How to quantize your PyTorch model in INT8 for inference using Intel Neural Compressor
How to quantize your PyTorch model in INT8 for inference using OpenVINO Post-training Optimization Tools
How to enable automatic context management for PyTorch inference on Nano optimized models
How to save and load optimized ONNXRuntime model
How to save and load optimized OpenVINO model
How to save and load optimized JIT model
How to save and load optimized IPEX model
How to accelerate a PyTorch inference pipeline through multiple instances
How to accelerate a PyTorch inference pipeline using Intel ARC series dGPU
How to accelerate PyTorch inference using async multi-stage pipeline

previous

Accelerate Inference on Intel GPUs Using OpenVINO

next

Find Acceleration Method with the Minimum Inference Latency using InferenceOptimizer

© Copyright 2020, BigDL Authors.

Created using Sphinx 4.5.0.