ML.NET model accuracy drops after deployment to production

Question

ML.NET model accuracy drops after deployment to production

KIRAN P 0

I built a classification model with ML.NET in a .NET 8 application. The model performed well during local testing with about 92% accuracy, but after deploying it to production, the predictions became unreliable and the overall accuracy dropped noticeably.
Is PredictionEngine safe for concurrent ASP.NET requests? or I need to use PredictionEnginePool instead?

0 comments

2 answers

Your answer

Answer 1

Varsha Dundigalla(INFOSYS LIMITED) 4,945 Microsoft External Staff

Thank you for reaching out.

PredictionEngine is not designed to handle multiple requests at the same time. It is meant for simple or single‑prediction scenarios, and it is not thread-safe.

In a production ASP.NET app, many requests can happen in parallel, and if predictions are made at the same time using this type, it can lead to inconsistent or unreliable outputs. That can look like the model accuracy has dropped, even though the model itself hasn’t changed.

To avoid this, the recommended approach in web applications is to use PredictionEnginePool, which is built to handle concurrent requests safely by managing multiple instances internally.

At the same time, it is also important to make sure the data being passed in production is the same format and processed the same way as during training, because differences there can also affect prediction results.

Please let us know if you require any further assistance, we’re happy to help. If you found this information useful, kindly mark this as "Accept Answer". So that others in the community facing similar issues can easily find the solution. Your contribution is highly appreciated.

Varsha Dundigalla(INFOSYS LIMITED) 4,945 Reputation points Microsoft External Staff

2026-05-14T13:19:06.8933333+00:00

Hope you're doing well. Could you please let us know if the issue still persists or if it has been resolved? If everything is working fine now, we’d appreciate it if you could mark the response as "Accepted Answer". We will proceed with the thread closure by 15th May, so we are hoping for your reply before that.
Varsha Dundigalla(INFOSYS LIMITED) 4,945 Reputation points Microsoft External Staff

2026-05-15T12:43:17.47+00:00

We are closing this thread at this moment since we are unable to investigate this issue further without the requested information. If the issue still occurs, please collect the requested information and share it with us. Upon receipt of the additional information, we will reopen this thread and continue to investigate this issue. Thank you.

Answer 2

PredictionEngine is not safe for concurrent ASP.NET requests.

PredictionEngine is explicitly documented as:

A convenience API for single predictions.
Not thread-safe.
Acceptable only in single-threaded or prototype environments.

In a production ASP.NET Core application handling concurrent requests, using a shared PredictionEngine instance can cause race conditions and corrupt internal state, which can manifest as unstable or degraded prediction accuracy after deployment.

For production and concurrent scenarios, use PredictionEnginePool instead. PredictionEnginePool:

Creates an ObjectPool of PredictionEngine instances.
Provides improved performance and thread safety.
Can automatically reload models when the underlying file changes (when configured with watchForChanges: true).

In an ASP.NET Core app, register it in Program.cs like this:

builder.Services.AddPredictionEnginePool<ModelInput, ModelOutput>()
    .FromFile(modelName: "SentimentAnalysisModel", filePath: "sentiment_model.zip", watchForChanges: true);

Then inject and use the pool in controllers/services instead of a single PredictionEngine instance.

Using PredictionEnginePool is the recommended approach for production, concurrent ASP.NET workloads and can help prevent the kind of reliability issues observed after deployment.

References:

Share via

ML.NET model accuracy drops after deployment to production

2 answers

Your answer