Deploying gen AI application serving pipelines#

MLRun serving can produce managed ML application pipelines using real-time auto-scaling Nuclio serverless functions. The application pipeline includes all the steps including: accepting events or data, preparing the required model features, inferring results using one or more models, and driving actions.

In this section

See also