Commit Graph

3814 Commits

Author SHA1 Message Date
oobabooga
0589ff5b12
Bump llama-cpp-python to 0.2.19 & add min_p and typical_p parameters to llama.cpp loader (#4701) 2023-11-21 20:59:39 -03:00
oobabooga
2769a1fa25 Hide deprecated args from Session tab 2023-11-21 15:15:16 -08:00
oobabooga
0047d9f5e0 Do not install coqui_tts requirements by default
It breaks the one-click installer on Windows.
2023-11-21 15:13:42 -08:00
oobabooga
fb124ab6e2 Bump to flash-attention 2.3.4 + switch to Github Actions wheels on Windows (#4700) 2023-11-21 15:07:17 -08:00
oobabooga
e9cdaa2ada
Bump to flash-attention 2.3.4 + switch to Github Actions wheels on Windows (#4700) 2023-11-21 20:06:56 -03:00
oobabooga
b81d6ad8a4
Detect Orca 2 template (#4697) 2023-11-21 15:26:42 -03:00
oobabooga
360eeb9ff1
Merge pull request #4686 from oobabooga/dev
Merge dev branch
2023-11-21 08:38:50 -03:00
oobabooga
54a4eb60a3
Remove --no-dependencies from TTS installation command 2023-11-21 08:30:50 -03:00
oobabooga
efdd99623c
Merge pull request #4683 from oobabooga/dev
Merge dev branch
2023-11-21 00:36:58 -03:00
oobabooga
b02dc4dc0d Add --no-dependencies to TTS installation command 2023-11-20 19:02:12 -08:00
oobabooga
55f2a3643b Update multimodal API example 2023-11-20 18:41:09 -08:00
oobabooga
829c6d4f78 Add "remove_trailing_dots" option to XTTSv2 2023-11-20 18:33:29 -08:00
kanttouchthis
8dc9ec3491
add XTTSv2 (coqui_tts extension) (#4673)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-11-20 22:37:52 -03:00
oobabooga
ff24648510 Credit llama-cpp-python in the README 2023-11-20 12:13:15 -08:00
oobabooga
be78d79811 Revert accidental noavx2 changes 2023-11-20 11:48:04 -08:00
oobabooga
4b84e45116 Use +cpuavx2 instead of +cpuavx 2023-11-20 11:46:38 -08:00
oobabooga
d7f1bc102b
Fix "Illegal instruction" bug in llama.cpp CPU only version (#4677) 2023-11-20 16:36:38 -03:00
drew9781
5e70263e25
docker: install xformers with sepcific cuda version, matching the docker image. (#4670) 2023-11-19 21:43:15 -03:00
oobabooga
f11092ac2a
Merge pull request #4664 from oobabooga/dev
Merge dev branch
2023-11-19 15:12:55 -03:00
oobabooga
f0d66cf817 Add missing file 2023-11-19 10:12:13 -08:00
oobabooga
22e7a22d1e
Merge pull request #4662 from oobabooga/dev
Merge dev branch
2023-11-19 14:23:19 -03:00
oobabooga
a2e6d00128 Use convert_ids_to_tokens instead of decode in logits endpoint
This preserves the llama tokenizer spaces.
2023-11-19 09:22:08 -08:00
oobabooga
d1bba48a83
Merge pull request #4660 from oobabooga/dev
Merge dev branch
2023-11-19 13:32:08 -03:00
oobabooga
8cf05c1b31 Fix disappearing character gallery 2023-11-19 08:31:01 -08:00
oobabooga
9da7bb203d Minor LoRA bug fix 2023-11-19 07:59:29 -08:00
oobabooga
78af3b0a00 Update docs/What Works.md 2023-11-19 07:57:16 -08:00
oobabooga
a6f1e1bcc5 Fix PEFT LoRA unloading 2023-11-19 07:55:25 -08:00
oobabooga
a290d17386 Add hover cursor to bot pfp 2023-11-19 06:56:42 -08:00
oobabooga
ab94f0d9bf Minor style change 2023-11-18 21:11:04 -08:00
oobabooga
5fcee696ea
New feature: enlarge character pictures on click (#4654) 2023-11-19 02:05:17 -03:00
Jordan Tucker
cb836dd49c
fix: use shared chat-instruct_command with api (#4653) 2023-11-19 01:19:10 -03:00
oobabooga
771e62e476
Add /v1/internal/lora endpoints (#4652) 2023-11-19 00:35:22 -03:00
oobabooga
ef6feedeb2
Add --nowebui flag for pure API mode (#4651) 2023-11-18 23:38:39 -03:00
oobabooga
0fa1af296c
Add /v1/internal/logits endpoint (#4650) 2023-11-18 23:19:31 -03:00
oobabooga
8f4f4daf8b
Add --admin-key flag for API (#4649) 2023-11-18 22:33:27 -03:00
wizd
af76fbedb8
Openai embedding fix to support jina-embeddings-v2 (#4642) 2023-11-18 20:24:29 -03:00
Jordan Tucker
baab894759
fix: use system message in chat-instruct mode (#4648) 2023-11-18 20:20:13 -03:00
oobabooga
47d9e2618b Refresh the Preset menu after saving a preset 2023-11-18 14:03:42 -08:00
oobabooga
83b64e7fc1
New feature: "random preset" button (#4647) 2023-11-18 18:31:41 -03:00
oobabooga
d1a58da52f Update ancient Docker instructions 2023-11-17 19:52:53 -08:00
oobabooga
e0ca49ed9c
Bump llama-cpp-python to 0.2.18 (2nd attempt) (#4637)
* Update requirements*.txt

* Add back seed
2023-11-18 00:31:27 -03:00
oobabooga
3146124ec0
Merge pull request #4632 from oobabooga/dev
Merge dev branch
2023-11-17 10:18:31 -03:00
oobabooga
9d6f79db74 Revert "Bump llama-cpp-python to 0.2.18 (#4611)"
This reverts commit 923c8e25fb.
2023-11-17 05:14:25 -08:00
oobabooga
e0a7cc5e0f Simplify CORS code 2023-11-16 20:11:55 -08:00
oobabooga
13dc3b61da Update README 2023-11-16 19:57:55 -08:00
oobabooga
8b66d83aa9 Set use_fast=True by default, create --no_use_fast flag
This increases tokens/second for HF loaders.
2023-11-16 19:55:28 -08:00
oobabooga
f889302d24
Merge pull request #4628 from oobabooga/dev
Merge dev branch
2023-11-16 23:47:07 -03:00
oobabooga
b2ce8dc7ee Update a message 2023-11-16 18:46:26 -08:00
oobabooga
0ee8d2b66b
Merge pull request #4627 from oobabooga/dev
Merge dev branch
2023-11-16 23:41:18 -03:00
oobabooga
780b00e1cf Minor bug fix 2023-11-16 18:39:39 -08:00