Special token ids aren't validated when loading a model

When loading the GGUF model, special token ids aren't validated to be in range, this can lead to index errors later on when they're used to looked up tokens, etc.

Here's an example: https://huggingface.co/Undi95/Mistral-11B-OmniMix/blob/main/config.json

We have 

```json
  "eos_token_id": 32000,
```

However the model's vocab size is 32,000 so that is out of bounds. Currently trying to load that model just crashes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Special token ids aren't validated when loading a model #3634

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Special token ids aren't validated when loading a model #3634

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions