An Azure machine learning service for building and deploying models.
Internal Server Error when starting compute session in Prompt Flow in classic Foundry UI
I’m encountering an issue in Azure AI Foundry/Prompt Flow where the starting compute session operation fails with an internal server error, with no additional information. Error message: Session operation failed: InternalServerError Trace ID:…
Azure Machine Learning
Sweep jobs using Bayesian Optimisation now fail due to missing pkg_resources
We've had a working sweep job creation method for a long time that suddenly starting breaking end of last week. We haven't updated our environment, code or job submission file. The error is as follows: Encounter error : Exception occurred while sampling:…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Import "azure.identity" could not be resolved
using the following script: https://microsoftlearning.github.io/mslearn-mlops/docs/01-experiment-evaluate-models.html line1: import azure.identity , get error Import "azure.identity" could not be resolved I'm using kernel Python 3.10…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Not able to make a gpu instance in azure ml compute instance
Hi, i wanted to train a model so i went and deplyed an azure ml and then in compute when i make a compute instance it shows cpu models available but gpu there is no quota and it shows You do not have enough quota for the following VM sizes. Click here to…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Support for mlflow 3.x.x in azure machine learning studio?
We had the same issue that is mentioned here: https://learn.microsoft.com/en-us/answers/questions/2288377/how-to-fix-model-logging-in-mlflow or here: https://learn.microsoft.com/en-us/answers/questions/5609748/support-for-mlflow-3-x-x-in-azure-ml. Is…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Azure ML: .txt files appear truncated and get _start_end.txt suffixes – what causes this and how to avoid it?
I am working with plain text (.txt) files in Azure Machine Learning and observing unexpected behavior when importing / consuming them as data. When the files are accessed through Azure ML datasets (e.g. FileDataset / MLTable / jobs), they sometimes…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Problem with creation of compute instance.
Hello, I am experiencing an issue while deploying a new compute instance. The provisioning process is stuck in the "Creating" state and does not progress. Could you please investigate if there is a platform-level issue or a resource lock…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
The compute instance in the Azure ML workspace is stuck in the “Creating” state
We deploy Azure ML workspaces, endpoints, and computes via ARM using EV2. The EV2 deployment is failing at the step where a Machine Learning compute instance is being created. We observed that the compute instance remains stuck in the “Creating” state…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Problem connecting Azure Foundry models to VS Code Codex
We are trying to set up a foundry model to use for codex in VS Code and we're experiencing some issues, that I was hoping you could help us solve. We then followed this guide to set it up in vscode:…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
[AML] `request_timeout_ms` always resets to 0 on AML Singularity online deployments when updating with "az ml online-endpoint update"
When updating request_settings.request_timeout_ms on an Azure ML online deployment backed by a Singularity instance type, the value is silently ignored and always returns 0, regardless of the update method used. Steps to Reproduce Method 1: az ml with…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Auto ML RAI job fails when 'Explain Model' is enables for classification
Environment - Azure Machine Learning, AutoML Classification task, Explain Model feature enabled Issue - I trained a classification model using Azure ML AutoML with the 'Explain Model' option turned on. The model training completed successfully and the…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Compute start and stop and new resource creation is failing in AML workspace
I am unable to create and start and stop the compute resources in the AML workspace to run the notebooks required help in fixing the issue. is it due to quota issue?
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Azure ML Model prediction fail with Batch Endpoint
I have trained a Classification model through AutoML in Azure Machine Learning. After the training I registered the best model, deployed the model using the batch endpoint wizard. However, BatchScoring job always fail with this error: Execution failed.…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Azure AutoML Classification Job Fails After Featurization - "Non terminal exception retries exhausted. TotalRetriesAttempted: 90
Environment Service: Azure Machine Learning AutoML (Classification) SDK: azure-ai-ml (Python 3.9) Job type: AutoML Classification, submitted via MLClient.jobs.create_or_update() Primary metric: AUC_weighted Max trials: 200 Feature count: ~600…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Cannot deploy Anthropic Claude models - InvalidModelProviderData via REST API, UI deployment also fails
Issue type: Technical Service: Cognitive Services Problem type: Deployment and setup issues Severity: C Support plan: Basic (free) Attempts: 1. AI Foundry UI: deployment completes submission but resource shows Failed state with only "Internal…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Using newer versions of python packages with Azure ML Batch Inference
I'm currently serving a scikit-learn style mlflow model using batch inference. I create the deployment using the python sdk azure.ai.ml.entities ModelBatchDeployment Based on the instructions linked, the environment associated with the mlflow model…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
Azure ML Batch Endpoint stuck in Updating state – HTTP 409 EndpointIsBeingUpdated blocking all operations
Azure ML Batch Endpoint stuck in Updating state – HTTP 409 EndpointIsBeingUpdated Question We are facing an issue where Azure Machine Learning Batch Endpoints get permanently stuck in provisioningState = Updating, blocking all further…
Azure Machine Learning
An Azure machine learning service for building and deploying models.
I can not access to my AML's blob storage, so I can't use compute resource.
I can not access to my AML's blob storage, so I can't use compute resource.
Azure Machine Learning
An Azure machine learning service for building and deploying models.
deployment of ML model in azure
I have created a ML model in Vscode registerd the model in MLflow, FastAPI, docker image is create how to I productionise it in azure and create a CI/CD pipeline in azure devops. need to monitor the model and retrain it. I have stored my codes in github
Azure Machine Learning
An Azure machine learning service for building and deploying models.
AML LowPriority Compute Still Running Post‑Retirement – Clarification on Spot Conversion.
Hello, Following the retirement of Azure Batch LowPriority VMs for AML on 31 March 2026, we understand AML Batch‑based clusters must be recreated using Dedicated VMs. However, we are currently migrating our workloads to AKS using Spot nodes as an…
Azure Machine Learning
An Azure machine learning service for building and deploying models.