Commit Graph

3943 Commits

Author SHA1 Message Date
pandora
b98635d823
Fixing Mistral Templates
Hi there, fixing the templates as close as possible to the ground truth, you ca find more information regarding the templates in here: https://github.com/mistralai/cookbook/blob/main/concept-deep-dive/tokenization/chat_templates.md

Still needs to be verified, so please dont merge yet!
2024-09-21 18:48:28 +02:00
oobabooga
a50477ec85 Apply the change to all requirements (oops) 2024-09-06 18:47:25 -07:00
oobabooga
e86ab37aaf Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-09-06 18:44:43 -07:00
oobabooga
27797a92d0 Pin fastapi/pydantic requirement versions 2024-09-06 18:38:57 -07:00
Jean-Sylvain Boige
4924ee2901
typo in OpenAI response format (#6365) 2024-09-05 21:42:23 -03:00
oobabooga
bba5b36d33 Don't import PEFT unless necessary 2024-09-03 19:40:53 -07:00
oobabooga
c5b40eb555 llama.cpp: prevent prompt evaluation progress bar with just 1 step 2024-09-03 17:37:06 -07:00
oobabooga
2cb8d4c96e Bump llama-cpp-python to 0.2.90 2024-09-03 05:53:18 -07:00
oobabooga
64919e0d69 Bump flash-attention to 2.6.3 2024-09-03 05:51:46 -07:00
oobabooga
68d52c60f3 Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-09-02 21:16:39 -07:00
oobabooga
d1168afa76 Bump ExLlamaV2 to 0.2.0 2024-09-02 21:15:51 -07:00
Stefan Merettig
9a150c3368
API: Relax multimodal format, fixes HuggingFace Chat UI (#6353) 2024-09-02 23:03:15 -03:00
GralchemOz
4c74c7a116
Fix UnicodeDecodeError for BPE-based Models (especially GLM-4) (#6357) 2024-09-02 23:00:59 -03:00
FartyPants (FP HAM)
41a8eb4eeb
Training pro update script.py (#6359) 2024-09-02 23:00:15 -03:00
oobabooga
1f288b4072 Bump ExLlamaV2 to 0.1.9 2024-08-22 12:40:15 -07:00
joachimchauvet
c24966c591
update API documentation with examples to list/load models (#5902) 2024-08-21 15:33:45 -03:00
oobabooga
1124f71cf3
Update README.md 2024-08-20 11:19:46 -03:00
oobabooga
d9a031fcad
Update README.md 2024-08-20 01:52:30 -03:00
oobabooga
9d99156ca3
Update README.md 2024-08-20 01:27:02 -03:00
oobabooga
406995f722 Update README 2024-08-19 21:24:01 -07:00
oobabooga
1b1518aa6a
Update README.md 2024-08-20 00:36:18 -03:00
oobabooga
5058269143 Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-08-19 19:55:45 -07:00
oobabooga
fd9cb26619 UI: update the DRY parameters descriptions/order 2024-08-19 19:40:17 -07:00
dependabot[bot]
64e16e9a46
Update accelerate requirement from ==0.32.* to ==0.33.* (#6291) 2024-08-19 23:34:10 -03:00
dependabot[bot]
68f928b5e0
Update peft requirement from ==0.8.* to ==0.12.* (#6292) 2024-08-19 23:33:56 -03:00
oobabooga
8bac1a9382
Update README.md 2024-08-19 23:10:04 -03:00
oobabooga
bb987ffe66
Update README.md 2024-08-19 23:06:52 -03:00
oobabooga
4d8c1801c2 Bump llama-cpp-python to 0.2.89 2024-08-19 17:45:01 -07:00
oobabooga
bf8187124d Bump llama-cpp-python to 0.2.88 2024-08-13 12:40:18 -07:00
oobabooga
089d5a9415 Bump llama-cpp-python to 0.2.87 2024-08-07 20:36:28 -07:00
oobabooga
81773f7f36 Bump transformers to 4.44 2024-08-06 20:07:05 -07:00
oobabooga
e926c03b3d Add a --tokenizer-dir command-line flag for llamacpp_HF 2024-08-06 19:41:18 -07:00
oobabooga
f106e780ba downloader: use 1 session for all files for better speed 2024-08-06 19:41:12 -07:00
oobabooga
608545d282 Bump llama-cpp-python to 0.2.85 2024-07-31 18:44:46 -07:00
oobabooga
30b4d8c8b2 Fix Llama 3.1 template including lengthy "tools" headers 2024-07-29 11:52:17 -07:00
oobabooga
f4d95f33b8 downloader: better progress bar 2024-07-28 22:21:56 -07:00
oobabooga
9dcff21da9 Remove unnecessary shared.previous_model_name variable 2024-07-28 18:35:11 -07:00
oobabooga
addcb52c56 Make --idle-timeout work for API requests 2024-07-28 18:31:40 -07:00
oobabooga
514fb2e451 Fix UI error caused by --idle-timeout 2024-07-28 18:30:06 -07:00
oobabooga
3aa646c1d0 UI: improve the style of headers in chat messages 2024-07-28 15:26:15 -07:00
oobabooga
92ab3a9a6a Bump llama-cpp-python to 0.2.84 2024-07-28 15:13:06 -07:00
oobabooga
5223c009fe Minor change after previous commit 2024-07-27 23:13:34 -07:00
oobabooga
7050bb880e UI: make n_ctx/max_seq_len/truncation_length numbers rather than sliders 2024-07-27 23:11:53 -07:00
Harry
078e8c8969
Make compress_pos_emb float (#6276) 2024-07-28 03:03:19 -03:00
oobabooga
ffc713f72b UI: fix multiline LaTeX equations 2024-07-27 15:36:10 -07:00
oobabooga
493f8c3242 UI: remove animation after clicking on "Stop" in the Chat tab 2024-07-27 15:22:34 -07:00
oobabooga
e4d411b841 UI: fix rendering LaTeX enclosed between \[ and \] 2024-07-27 15:21:44 -07:00
oobabooga
6bab4c2faa UI: add back single $ for equations 2024-07-26 23:03:53 -07:00
oobabooga
f32d26240d UI: Fix the chat "stop" event 2024-07-26 23:03:05 -07:00
oobabooga
9e82f8c394 UI: Fix chat sometimes not scrolling down after sending a message 2024-07-26 22:35:30 -07:00