oobabooga
|
c5f048e912
|
Bump ExLlamaV2 to 0.2.2
|
2024-09-27 15:04:08 -07:00 |
|
oobabooga
|
7424f789bf
|
Fix the sampling monkey patch (and add more options to sampler_priority) (#6411)
|
2024-09-27 19:03:25 -03:00 |
|
oobabooga
|
c497a32372
|
Bump transformers to 4.45
|
2024-09-26 11:55:51 -07:00 |
|
oobabooga
|
f98431c744
|
Apply the change to all requirements (oops)
|
2024-09-06 18:48:13 -07:00 |
|
oobabooga
|
a50477ec85
|
Apply the change to all requirements (oops)
|
2024-09-06 18:47:25 -07:00 |
|
oobabooga
|
ac30b004ef
|
Pin fastapi/pydantic requirement versions
|
2024-09-06 18:45:15 -07:00 |
|
oobabooga
|
e86ab37aaf
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-09-06 18:44:43 -07:00 |
|
oobabooga
|
27797a92d0
|
Pin fastapi/pydantic requirement versions
|
2024-09-06 18:38:57 -07:00 |
|
Jean-Sylvain Boige
|
4924ee2901
|
typo in OpenAI response format (#6365)
|
2024-09-05 21:42:23 -03:00 |
|
oobabooga
|
bba5b36d33
|
Don't import PEFT unless necessary
|
2024-09-03 19:40:53 -07:00 |
|
oobabooga
|
c5b40eb555
|
llama.cpp: prevent prompt evaluation progress bar with just 1 step
|
2024-09-03 17:37:06 -07:00 |
|
oobabooga
|
2cb8d4c96e
|
Bump llama-cpp-python to 0.2.90
|
2024-09-03 05:53:18 -07:00 |
|
oobabooga
|
64919e0d69
|
Bump flash-attention to 2.6.3
|
2024-09-03 05:51:46 -07:00 |
|
oobabooga
|
68d52c60f3
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-09-02 21:16:39 -07:00 |
|
oobabooga
|
d1168afa76
|
Bump ExLlamaV2 to 0.2.0
|
2024-09-02 21:15:51 -07:00 |
|
Stefan Merettig
|
9a150c3368
|
API: Relax multimodal format, fixes HuggingFace Chat UI (#6353)
|
2024-09-02 23:03:15 -03:00 |
|
GralchemOz
|
4c74c7a116
|
Fix UnicodeDecodeError for BPE-based Models (especially GLM-4) (#6357)
|
2024-09-02 23:00:59 -03:00 |
|
FartyPants (FP HAM)
|
41a8eb4eeb
|
Training pro update script.py (#6359)
|
2024-09-02 23:00:15 -03:00 |
|
oobabooga
|
1f288b4072
|
Bump ExLlamaV2 to 0.1.9
|
2024-08-22 12:40:15 -07:00 |
|
joachimchauvet
|
c24966c591
|
update API documentation with examples to list/load models (#5902)
|
2024-08-21 15:33:45 -03:00 |
|
oobabooga
|
5522584992
|
Merge pull request #6339 from oobabooga/dev
Merge dev branch
|
2024-08-20 11:20:52 -03:00 |
|
oobabooga
|
1124f71cf3
|
Update README.md
|
2024-08-20 11:19:46 -03:00 |
|
oobabooga
|
1b62cd8508
|
Merge pull request #6337 from oobabooga/dev
Merge dev branch
|
2024-08-20 01:54:47 -03:00 |
|
oobabooga
|
d9a031fcad
|
Update README.md
|
2024-08-20 01:52:30 -03:00 |
|
oobabooga
|
073694bf15
|
Merge pull request #6336 from oobabooga/dev
Merge dev branch
|
2024-08-20 01:27:58 -03:00 |
|
oobabooga
|
9d99156ca3
|
Update README.md
|
2024-08-20 01:27:02 -03:00 |
|
oobabooga
|
406995f722
|
Update README
|
2024-08-19 21:24:01 -07:00 |
|
oobabooga
|
1b1518aa6a
|
Update README.md
|
2024-08-20 00:36:18 -03:00 |
|
oobabooga
|
5058269143
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-08-19 19:55:45 -07:00 |
|
oobabooga
|
fd9cb26619
|
UI: update the DRY parameters descriptions/order
|
2024-08-19 19:40:17 -07:00 |
|
dependabot[bot]
|
64e16e9a46
|
Update accelerate requirement from ==0.32.* to ==0.33.* (#6291)
|
2024-08-19 23:34:10 -03:00 |
|
dependabot[bot]
|
68f928b5e0
|
Update peft requirement from ==0.8.* to ==0.12.* (#6292)
|
2024-08-19 23:33:56 -03:00 |
|
oobabooga
|
8bac1a9382
|
Update README.md
|
2024-08-19 23:10:04 -03:00 |
|
oobabooga
|
bb987ffe66
|
Update README.md
|
2024-08-19 23:06:52 -03:00 |
|
oobabooga
|
4d8c1801c2
|
Bump llama-cpp-python to 0.2.89
|
2024-08-19 17:45:01 -07:00 |
|
oobabooga
|
bf8187124d
|
Bump llama-cpp-python to 0.2.88
|
2024-08-13 12:40:18 -07:00 |
|
oobabooga
|
089d5a9415
|
Bump llama-cpp-python to 0.2.87
|
2024-08-07 20:36:28 -07:00 |
|
oobabooga
|
81773f7f36
|
Bump transformers to 4.44
|
2024-08-06 20:07:05 -07:00 |
|
oobabooga
|
e926c03b3d
|
Add a --tokenizer-dir command-line flag for llamacpp_HF
|
2024-08-06 19:41:18 -07:00 |
|
oobabooga
|
f106e780ba
|
downloader: use 1 session for all files for better speed
|
2024-08-06 19:41:12 -07:00 |
|
oobabooga
|
d011040f43
|
Merge pull request #6300 from oobabooga/dev
Merge dev branch
|
2024-08-01 02:26:12 -03:00 |
|
oobabooga
|
608545d282
|
Bump llama-cpp-python to 0.2.85
|
2024-07-31 18:44:46 -07:00 |
|
oobabooga
|
30b4d8c8b2
|
Fix Llama 3.1 template including lengthy "tools" headers
|
2024-07-29 11:52:17 -07:00 |
|
oobabooga
|
f4d95f33b8
|
downloader: better progress bar
|
2024-07-28 22:21:56 -07:00 |
|
oobabooga
|
9dcff21da9
|
Remove unnecessary shared.previous_model_name variable
|
2024-07-28 18:35:11 -07:00 |
|
oobabooga
|
addcb52c56
|
Make --idle-timeout work for API requests
|
2024-07-28 18:31:40 -07:00 |
|
oobabooga
|
514fb2e451
|
Fix UI error caused by --idle-timeout
|
2024-07-28 18:30:06 -07:00 |
|
oobabooga
|
3aa646c1d0
|
UI: improve the style of headers in chat messages
|
2024-07-28 15:26:15 -07:00 |
|
oobabooga
|
92ab3a9a6a
|
Bump llama-cpp-python to 0.2.84
|
2024-07-28 15:13:06 -07:00 |
|
oobabooga
|
5223c009fe
|
Minor change after previous commit
|
2024-07-27 23:13:34 -07:00 |
|