Commit Graph

4006 Commits

Author SHA1 Message Date
Blazzycrafter
6f0f214408
- added available models into dummy models
- changed args and settings to the get method to make it more Robust and easier to use
2024-11-18 17:15:29 +01:00
Blazzycrafter
768124c4b0
Added model load logic for chat Completions 2024-11-18 17:12:41 +01:00
oobabooga
cc8c7ed209
Merge pull request #6491 from oobabooga/dev
Merge dev branch
2024-10-25 01:10:23 -03:00
oobabooga
3a92fa517b Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-10-24 11:26:21 -07:00
oobabooga
8deea2936d Remove lm_eval from requirements 2024-10-24 11:25:42 -07:00
PIRI
e1061ba7e3
Make token bans work again on HF loaders (#6488) 2024-10-24 15:24:02 -03:00
oobabooga
b50dc3bf57 Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-10-24 11:22:54 -07:00
oobabooga
386c0d8289 Bump transformers to 4.46 2024-10-24 11:09:09 -07:00
Paul Richardson
6a0837451e
Minor Documentation update - query cuda compute for docker .env (#6469) 2024-10-15 10:39:00 -03:00
Molly Sophia
18f836b280
Add RWKV-World instruction template (#6456) 2024-10-14 17:51:20 -03:00
dependabot[bot]
e784938654
Update accelerate requirement from ==0.33.* to ==1.0.* (#6441) 2024-10-14 17:32:53 -03:00
oobabooga
f1a8eae04d Remove optimum from requirements 2024-10-14 13:30:45 -07:00
oobabooga
2468cfd8bb Merge remote-tracking branch 'refs/remotes/origin/dev' into dev 2024-10-14 13:25:27 -07:00
oobabooga
bb62e796eb Fix locally compiled llama-cpp-python failing to import 2024-10-14 13:24:13 -07:00
oobabooga
c9a9f63d1b Fix llama.cpp loader not being random (thanks @reydeljuego12345) 2024-10-14 13:07:07 -07:00
PIRI
03a2e70054
Fix temperature_last when temperature not in sampler priority (#6439) 2024-10-09 11:25:14 -03:00
Grzegorz Lippe
9d8b1c5fd9
Fix intel bug described in #6253 (#6433) 2024-10-05 11:58:17 -03:00
Luana
22baa5378f
Fix for systems that have bash in a non-standard directory (#6428) 2024-10-03 00:35:13 -03:00
SeanScripts
e1338a1804
Add whisper turbo (#6423) 2024-10-01 17:49:35 -03:00
oobabooga
d1af7a41ad
Merge pull request #6422 from oobabooga/dev
Merge dev branch
2024-10-01 15:21:53 -03:00
oobabooga
49dfa0adaf Fix the "save preset" event 2024-10-01 11:20:48 -07:00
oobabooga
93c250b9b6 Add a UI element for enable_tp 2024-10-01 11:16:15 -07:00
oobabooga
3b06cb4523
Merge pull request #6421 from oobabooga/dev
Merge dev branch
2024-10-01 14:48:41 -03:00
oobabooga
d364aa0a3c Lint 2024-10-01 10:22:57 -07:00
oobabooga
cca9d6e22d Lint 2024-10-01 10:21:06 -07:00
oobabooga
c6b50f88da Lint 2024-10-01 10:19:28 -07:00
oobabooga
7cb98351da
Merge branch 'main' into dev 2024-10-01 14:18:32 -03:00
oobabooga
617cd7b705 Revert "Update accelerate requirement from ==0.33.* to ==0.34.* (#6416)"
This reverts commit 6063a66414.
2024-10-01 09:06:25 -07:00
dependabot[bot]
6063a66414
Update accelerate requirement from ==0.33.* to ==0.34.* (#6416) 2024-09-30 18:50:38 -03:00
oobabooga
4d9ce586d3 Update llama_cpp_python_hijack.py, fix llamacpp_hf 2024-09-30 14:49:21 -07:00
oobabooga
9ca0cd7749 Bump llama-cpp-python to 0.3.1 2024-09-29 20:47:04 -07:00
oobabooga
bbdeed3cf4 Make sampler priority high if unspecified 2024-09-29 20:45:27 -07:00
oobabooga
01362681f2 Bump exllamav2 to 0.2.4 2024-09-29 07:42:44 -07:00
Hanusz Leszek
e4b0467f9f
Add beforeunload event to add confirmation dialog when leaving page (#6279) 2024-09-29 01:14:19 -03:00
Manuel Schmid
0f90a1b50f
Do not set value for histories in chat when --multi-user is used (#6317) 2024-09-29 01:08:55 -03:00
oobabooga
055f3f5632 Fix after #6386 (thanks @Touch-Night) 2024-09-28 20:55:26 -07:00
oobabooga
57160cd6fa Update README 2024-09-28 20:50:41 -07:00
oobabooga
3f0571b62b Update README 2024-09-28 20:48:30 -07:00
oobabooga
3fb02f43f6 Update README 2024-09-28 20:38:43 -07:00
oobabooga
3b99532e02 Remove HQQ and AQLM from requirements 2024-09-28 20:34:59 -07:00
oobabooga
c61b29b9ce Simplify the warning when flash-attn fails to import 2024-09-28 20:33:17 -07:00
oobabooga
b92d7fd43e Add warnings for when AutoGPTQ, TensorRT-LLM, or HQQ are missing 2024-09-28 20:30:24 -07:00
oobabooga
65e5864084 Update README 2024-09-28 20:25:26 -07:00
oobabooga
1a870b3ea7 Remove AutoAWQ and AutoGPTQ from requirements (no wheels available) 2024-09-28 19:38:56 -07:00
oobabooga
85994e3ef0 Bump pytorch to 2.4.1 2024-09-28 09:44:08 -07:00
oobabooga
ca5a2dba72 Bump rocm to 6.1.2 2024-09-28 09:39:53 -07:00
oobabooga
7276dca933 Fix a typo 2024-09-27 20:28:17 -07:00
RandoInternetPreson
46996f6519
ExllamaV2 tensor parallelism to increase multi gpu inference speeds (#6356) 2024-09-28 00:26:03 -03:00
Philipp Emanuel Weidmann
301375834e
Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition (#6335) 2024-09-27 22:50:12 -03:00
oobabooga
3492e33fd5 Bump bitsandbytes to 0.44 2024-09-27 16:59:30 -07:00