Skip to content

Azure online endpoint is Scaling taking long time #3515

@newstar85

Description

@newstar85

Operating System

Linux

Version Information

Using the latest version of ML online endpoint

Steps to reproduce

I have a project to run an AI model using an online endpoint as a backend service, the endpoint is configured (manually set in the portal) to be auto-scale based on the number of requests.

Image

Expected behavior

Expect the endpoint will scale up between 1-2 minutes like other services such as virtual machine scaleset, etc...

Actual behavior

With endpoint, scaling takes a long time, about 12-18 minutes.

Addition information

Do you have suggestions for speeding up the scaling time?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions