Skip to main contentSkip to navigation

TensorFlow Serving

A production-grade serving system for machine learning models that provides high-performance inference with gRPC and REST APIs, model versioning, and batching capabilities for TensorFlow and other ML frameworks.