server: Allow for longer prompts in q URL parameter #16862

chansikpark · 2025-10-30T15:09:17Z

Resolves the larger part of #16830.

The current limit is 8192 characters. This is not enough for summarizing most articles worth summarizing. Doubling handles many. Quadrupling would arguably handle most.

Supporting web browser integration introduces security risks. If there were an exploit opportunity that opened up in llama-server, this could give the exploiter quadruple the space for an exploit. Furthermore, an exploit might not always require the user to have inadvisably exposed the server to a public network. If a prompt injection attack on a website isn't handled properly anywhere along the way to/from the LLM, and there's some exploitable code where it's mishandled, this change could presumably make this easier/possible to exploit.

It could be sufficient to include a warning somewhere along with advice for good practices which might include using a combination of a strict content blocker and only allowing prompts from trusted websites.

Quadruple request URI max length to 32768

23ce9f4

chansikpark requested review from ggerganov and ngxson as code owners October 30, 2025 15:09

chansikpark changed the title ~~Allow for longer prompts in q URL parameter~~ server: Allow for longer prompts in q URL parameter Oct 30, 2025

ggerganov approved these changes Oct 30, 2025

View reviewed changes

ngxson approved these changes Oct 30, 2025

View reviewed changes

github-actions bot added examples server labels Oct 30, 2025

ggerganov merged commit 16724b5 into ggml-org:master Oct 30, 2025
64 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server: Allow for longer prompts in q URL parameter #16862

server: Allow for longer prompts in q URL parameter #16862

Uh oh!

chansikpark commented Oct 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

server: Allow for longer prompts in q URL parameter #16862

server: Allow for longer prompts in q URL parameter #16862

Uh oh!

Conversation

chansikpark commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chansikpark commented Oct 30, 2025 •

edited

Loading