Unable to load models `plus-16B` and `plus-6B`

Hi, thank you for your work. I'm trying to use CodeT5+ model types `plus-16B` and `plus-6B`. However, when running, I get an error:

`ValueError: CodeT5pEncoderDecoderModel does not support "device_map='auto'". To implement support, the modelclass needs to implement the "_no_split_modules" attribute.`

The code I'm using is the same as provided in the examples:

```
from codetf.models import load_model_pipeline

code_generation_model = load_model_pipeline(model_name="codet5", task="pretrained",
            model_type="plus-6B", is_eval=True,
            load_in_8bit=True, load_in_4bit=False, weight_sharding=False)

result = code_generation_model.predict(["def print_hello_world():"])
print(result)
```

Any ideas on how the issue could be resolved?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unable to load models `plus-16B` and `plus-6B` #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unable to load models plus-16B and plus-6B #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Unable to load models `plus-16B` and `plus-6B` #39