fix: runtime capability detection for backends #6149

sozercan · 2025-08-26T18:02:36Z

Description

This fixes a regression since llama-cpp is a modular backend now. Previously, we detected capabilities at runtime, and fallback to cpu if it's not possible to run with the highest priority meta backend.

Issue happens if you have a container with cuda (or other) and cpu llama-cpp backends. Since llama-cpp alias has both meta backends, it might use the cuda runtime, which we may not be able to run depending on the container and host capabilities. We'll need to detect the platform at runtime so we can fallback gracefully instead of expecting user to set the appropriate value.

Notes for Reviewers

Signed commits

Yes, I signed my commits.

Signed-off-by: Sertac Ozercan <[email protected]>

netlify · 2025-08-26T18:02:55Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`3b673ff`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/68b686fd614895000889d74f
😎 Deploy Preview	https://deploy-preview-6149--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Signed-off-by: Sertac Ozercan <[email protected]>

mudler · 2025-08-27T07:03:13Z

Thanks! this makes sense, just small nits here and there for consistency

mudler · 2025-08-27T07:05:06Z

core/gallery/backends.go

+// ListSystemBackendsSelected lists system backends and, when multiple concrete backends share the same alias
+// (e.g., cpu-llama-cpp and cuda12-llama-cpp both alias to "llama-cpp"), selects the optimal one based on the
+// detected system capability (GPU vendor/platform). Concrete backend names are always included.
+func ListSystemBackendsSelected(systemState *system.SystemState) (SystemBackends, error) {


I think at this point would make sense to actually modify directly ListSystemBackends

LocalAI/core/gallery/backends.go

Line 284 in 21faa41

func ListSystemBackends(systemState *system.SystemState) (SystemBackends, error) {

Its usage in the code is quite limited https://github.com/search?q=repo%3Amudler%2FLocalAI%20ListSystemBackends&type=code

otherwise would make sense to re-use it as much as possible, to avoid code dups

mudler · 2025-08-27T07:06:27Z

core/gallery/backends.go

+	return backends, nil
+}
+
+func selectBestCandidate(systemState *system.SystemState, cands []backendCandidate) backendCandidate {


Probably this is better placed in the capabilities code, to keep the capability logic well isolated.

Could maybe be just a method of system State?

https://github.com/mudler/LocalAI/blob/21faa4114bf6c8980fc612e7db5a2a13b62e8d23/pkg/system/capabilities.go

Signed-off-by: Sertac Ozercan <[email protected]>

runtime capability detection for backends

dbb7430

Signed-off-by: Sertac Ozercan <[email protected]>

sozercan requested a review from mudler August 26, 2025 18:02

sozercan added 2 commits August 26, 2025 18:46

test

05ebc74

Signed-off-by: Sertac Ozercan <[email protected]>

skip nvidia on darwin

f76a7b9

Signed-off-by: Sertac Ozercan <[email protected]>

mudler reviewed Aug 27, 2025

View reviewed changes

sozercan added 2 commits September 2, 2025 05:45

Merge branch 'master' into runtime-caps-selection

b2ca93e

address review comments

3b673ff

Signed-off-by: Sertac Ozercan <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: runtime capability detection for backends #6149

fix: runtime capability detection for backends #6149

Uh oh!

sozercan commented Aug 26, 2025 •

edited

Loading

Uh oh!

netlify bot commented Aug 26, 2025 •

edited

Loading

Uh oh!

mudler commented Aug 27, 2025

Uh oh!

mudler Aug 27, 2025 •

edited

Loading

Uh oh!

mudler Aug 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

fix: runtime capability detection for backends #6149

Are you sure you want to change the base?

fix: runtime capability detection for backends #6149

Uh oh!

Conversation

sozercan commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

mudler commented Aug 27, 2025

Uh oh!

mudler Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mudler Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sozercan commented Aug 26, 2025 •

edited

Loading

netlify bot commented Aug 26, 2025 •

edited

Loading

mudler Aug 27, 2025 •

edited

Loading

mudler Aug 27, 2025 •

edited

Loading