Deploying gen AI application serving pipelines

Deploying gen AI application serving pipelines#

MLRun serving can produce managed ML application pipelines using real-time, auto-scaling, Nuclio serverless functions. The application pipeline includes all the steps including: accepting events or data, preparing the required model features, inferring results using one or more models, and driving actions.

In this section

See also