Commit b505131
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (ggml-org#4490)
* phi2 implementation
* fix breaking change
* phi-2 : various fixes
* phi-2 : use layer norm eps
* py : whitespaces
* llama : fix meta KV override bug
* convert : phi don't add BOS token
* convert : revert "added_tokens_decoder" change
* phi-2 : scale Q instead of KQ for better precision
* ggml : fix NeoX rope to rotate just first n_dims
* cuda : less diff in the rope_neox kernel
* ggml : add ggml_mul_mat_set_prec
ggml-ci
* Update ggml-cuda.cu
Co-authored-by: slaren <[email protected]>
* Update ggml-cuda.cu
Co-authored-by: slaren <[email protected]>
* cuda : ggml_cuda_op_mul_mat_cublas support F32 precision
* cuda : remove oboslete comment
---------
Co-authored-by: Ebey Abraham <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: slaren <[email protected]>1 parent a2616a1 commit b505131
File tree
9 files changed
+463
-76
lines changed- gguf-py/gguf
- tests
9 files changed
+463
-76
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
182 | 182 | | |
183 | 183 | | |
184 | 184 | | |
| 185 | + | |
| 186 | + | |
185 | 187 | | |
186 | 188 | | |
187 | 189 | | |
| |||
221 | 223 | | |
222 | 224 | | |
223 | 225 | | |
| 226 | + | |
| 227 | + | |
224 | 228 | | |
225 | 229 | | |
226 | 230 | | |
| |||
980 | 984 | | |
981 | 985 | | |
982 | 986 | | |
| 987 | + | |
| 988 | + | |
| 989 | + | |
| 990 | + | |
| 991 | + | |
| 992 | + | |
| 993 | + | |
| 994 | + | |
| 995 | + | |
| 996 | + | |
| 997 | + | |
| 998 | + | |
| 999 | + | |
| 1000 | + | |
| 1001 | + | |
| 1002 | + | |
| 1003 | + | |
| 1004 | + | |
983 | 1005 | | |
984 | 1006 | | |
985 | 1007 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4998 | 4998 | | |
4999 | 4999 | | |
5000 | 5000 | | |
5001 | | - | |
| 5001 | + | |
| 5002 | + | |
| 5003 | + | |
| 5004 | + | |
| 5005 | + | |
| 5006 | + | |
| 5007 | + | |
| 5008 | + | |
| 5009 | + | |
| 5010 | + | |
5002 | 5011 | | |
5003 | 5012 | | |
5004 | 5013 | | |
| |||
7057 | 7066 | | |
7058 | 7067 | | |
7059 | 7068 | | |
| 7069 | + | |
7060 | 7070 | | |
7061 | 7071 | | |
7062 | 7072 | | |
| |||
7073 | 7083 | | |
7074 | 7084 | | |
7075 | 7085 | | |
| 7086 | + | |
7076 | 7087 | | |
7077 | 7088 | | |
7078 | 7089 | | |
| |||
7376 | 7387 | | |
7377 | 7388 | | |
7378 | 7389 | | |
7379 | | - | |
| 7390 | + | |
7380 | 7391 | | |
7381 | 7392 | | |
7382 | 7393 | | |
| |||
8300 | 8311 | | |
8301 | 8312 | | |
8302 | 8313 | | |
8303 | | - | |
| 8314 | + | |
8304 | 8315 | | |
8305 | | - | |
8306 | | - | |
8307 | | - | |
8308 | | - | |
8309 | | - | |
8310 | | - | |
8311 | | - | |
8312 | | - | |
| 8316 | + | |
| 8317 | + | |
| 8318 | + | |
| 8319 | + | |
| 8320 | + | |
| 8321 | + | |
| 8322 | + | |
| 8323 | + | |
8313 | 8324 | | |
8314 | 8325 | | |
8315 | 8326 | | |
8316 | 8327 | | |
8317 | 8328 | | |
8318 | | - | |
8319 | | - | |
| 8329 | + | |
| 8330 | + | |
8320 | 8331 | | |
8321 | 8332 | | |
8322 | 8333 | | |
8323 | | - | |
| 8334 | + | |
8324 | 8335 | | |
8325 | 8336 | | |
8326 | 8337 | | |
| |||
8376 | 8387 | | |
8377 | 8388 | | |
8378 | 8389 | | |
8379 | | - | |
| 8390 | + | |
| 8391 | + | |
| 8392 | + | |
| 8393 | + | |
| 8394 | + | |
| 8395 | + | |
| 8396 | + | |
| 8397 | + | |
| 8398 | + | |
| 8399 | + | |
| 8400 | + | |
| 8401 | + | |
| 8402 | + | |
| 8403 | + | |
| 8404 | + | |
| 8405 | + | |
| 8406 | + | |
| 8407 | + | |
| 8408 | + | |
| 8409 | + | |
| 8410 | + | |
| 8411 | + | |
| 8412 | + | |
| 8413 | + | |
| 8414 | + | |
| 8415 | + | |
| 8416 | + | |
| 8417 | + | |
| 8418 | + | |
| 8419 | + | |
| 8420 | + | |
| 8421 | + | |
| 8422 | + | |
| 8423 | + | |
| 8424 | + | |
8380 | 8425 | | |
8381 | 8426 | | |
8382 | 8427 | | |
| |||
8385 | 8430 | | |
8386 | 8431 | | |
8387 | 8432 | | |
8388 | | - | |
8389 | | - | |
8390 | | - | |
8391 | 8433 | | |
8392 | 8434 | | |
8393 | 8435 | | |
| |||
8397 | 8439 | | |
8398 | 8440 | | |
8399 | 8441 | | |
8400 | | - | |
| 8442 | + | |
8401 | 8443 | | |
8402 | | - | |
8403 | | - | |
8404 | | - | |
8405 | | - | |
| 8444 | + | |
| 8445 | + | |
| 8446 | + | |
| 8447 | + | |
8406 | 8448 | | |
8407 | 8449 | | |
8408 | 8450 | | |
| |||
8414 | 8456 | | |
8415 | 8457 | | |
8416 | 8458 | | |
8417 | | - | |
8418 | | - | |
8419 | | - | |
| 8459 | + | |
| 8460 | + | |
| 8461 | + | |
8420 | 8462 | | |
8421 | | - | |
| 8463 | + | |
8422 | 8464 | | |
8423 | 8465 | | |
8424 | 8466 | | |
| |||
8435 | 8477 | | |
8436 | 8478 | | |
8437 | 8479 | | |
8438 | | - | |
| 8480 | + | |
8439 | 8481 | | |
8440 | 8482 | | |
8441 | 8483 | | |
8442 | 8484 | | |
8443 | 8485 | | |
8444 | | - | |
| 8486 | + | |
8445 | 8487 | | |
8446 | 8488 | | |
8447 | 8489 | | |
8448 | 8490 | | |
8449 | 8491 | | |
8450 | 8492 | | |
8451 | | - | |
8452 | | - | |
8453 | | - | |
| 8493 | + | |
| 8494 | + | |
| 8495 | + | |
8454 | 8496 | | |
8455 | | - | |
| 8497 | + | |
8456 | 8498 | | |
8457 | 8499 | | |
8458 | 8500 | | |
| |||
8464 | 8506 | | |
8465 | 8507 | | |
8466 | 8508 | | |
8467 | | - | |
8468 | | - | |
| 8509 | + | |
| 8510 | + | |
| 8511 | + | |
| 8512 | + | |
| 8513 | + | |
| 8514 | + | |
8469 | 8515 | | |
8470 | 8516 | | |
8471 | | - | |
8472 | 8517 | | |
8473 | 8518 | | |
8474 | 8519 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1702 | 1702 | | |
1703 | 1703 | | |
1704 | 1704 | | |
1705 | | - | |
1706 | | - | |
| 1705 | + | |
| 1706 | + | |
| 1707 | + | |
1707 | 1708 | | |
1708 | 1709 | | |
1709 | 1710 | | |
| |||
1722 | 1723 | | |
1723 | 1724 | | |
1724 | 1725 | | |
| 1726 | + | |
| 1727 | + | |
| 1728 | + | |
| 1729 | + | |
| 1730 | + | |
| 1731 | + | |
| 1732 | + | |
| 1733 | + | |
1725 | 1734 | | |
1726 | 1735 | | |
1727 | 1736 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4098 | 4098 | | |
4099 | 4099 | | |
4100 | 4100 | | |
| 4101 | + | |
| 4102 | + | |
| 4103 | + | |
| 4104 | + | |
| 4105 | + | |
| 4106 | + | |
| 4107 | + | |
| 4108 | + | |
4101 | 4109 | | |
4102 | 4110 | | |
4103 | 4111 | | |
| |||
9168 | 9176 | | |
9169 | 9177 | | |
9170 | 9178 | | |
| 9179 | + | |
| 9180 | + | |
9171 | 9181 | | |
9172 | 9182 | | |
9173 | 9183 | | |
| |||
9237 | 9247 | | |
9238 | 9248 | | |
9239 | 9249 | | |
| 9250 | + | |
| 9251 | + | |
9240 | 9252 | | |
9241 | 9253 | | |
9242 | 9254 | | |
| |||
11562 | 11574 | | |
11563 | 11575 | | |
11564 | 11576 | | |
11565 | | - | |
| 11577 | + | |
| 11578 | + | |
11566 | 11579 | | |
11567 | | - | |
11568 | | - | |
| 11580 | + | |
| 11581 | + | |
| 11582 | + | |
| 11583 | + | |
11569 | 11584 | | |
11570 | 11585 | | |
11571 | 11586 | | |
| |||
11588 | 11603 | | |
11589 | 11604 | | |
11590 | 11605 | | |
| 11606 | + | |
| 11607 | + | |
| 11608 | + | |
| 11609 | + | |
| 11610 | + | |
| 11611 | + | |
| 11612 | + | |
| 11613 | + | |
11591 | 11614 | | |
11592 | 11615 | | |
11593 | 11616 | | |
| |||
11715 | 11738 | | |
11716 | 11739 | | |
11717 | 11740 | | |
11718 | | - | |
| 11741 | + | |
| 11742 | + | |
11719 | 11743 | | |
11720 | | - | |
11721 | | - | |
| 11744 | + | |
| 11745 | + | |
| 11746 | + | |
| 11747 | + | |
11722 | 11748 | | |
11723 | 11749 | | |
11724 | 11750 | | |
| |||
11741 | 11767 | | |
11742 | 11768 | | |
11743 | 11769 | | |
| 11770 | + | |
| 11771 | + | |
| 11772 | + | |
| 11773 | + | |
| 11774 | + | |
| 11775 | + | |
| 11776 | + | |
| 11777 | + | |
11744 | 11778 | | |
11745 | 11779 | | |
11746 | 11780 | | |
| |||
0 commit comments