hronoas
|
9b3a3d8f12
|
openai extension fix: Handle Multiple Content Items in Messages (#6528)
|
2024-11-18 11:59:52 -03:00 |
|
oobabooga
|
5fa9336dab
|
Bump flash-attention to 2.7.0.post2
|
2024-11-18 06:55:29 -08:00 |
|
oobabooga
|
0c48ecf359
|
Bump exllamav2 to 0.2.4
|
2024-11-18 06:51:56 -08:00 |
|
oobabooga
|
8d5cf7b134
|
Bump llama-cpp-python to 0.3.2
|
2024-11-18 06:51:06 -08:00 |
|
oobabooga
|
3a92fa517b
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-10-24 11:26:21 -07:00 |
|
oobabooga
|
8deea2936d
|
Remove lm_eval from requirements
|
2024-10-24 11:25:42 -07:00 |
|
PIRI
|
e1061ba7e3
|
Make token bans work again on HF loaders (#6488)
|
2024-10-24 15:24:02 -03:00 |
|
oobabooga
|
b50dc3bf57
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-10-24 11:22:54 -07:00 |
|
oobabooga
|
386c0d8289
|
Bump transformers to 4.46
|
2024-10-24 11:09:09 -07:00 |
|
Paul Richardson
|
6a0837451e
|
Minor Documentation update - query cuda compute for docker .env (#6469)
|
2024-10-15 10:39:00 -03:00 |
|
Molly Sophia
|
18f836b280
|
Add RWKV-World instruction template (#6456)
|
2024-10-14 17:51:20 -03:00 |
|
dependabot[bot]
|
e784938654
|
Update accelerate requirement from ==0.33.* to ==1.0.* (#6441)
|
2024-10-14 17:32:53 -03:00 |
|
oobabooga
|
f1a8eae04d
|
Remove optimum from requirements
|
2024-10-14 13:30:45 -07:00 |
|
oobabooga
|
2468cfd8bb
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-10-14 13:25:27 -07:00 |
|
oobabooga
|
bb62e796eb
|
Fix locally compiled llama-cpp-python failing to import
|
2024-10-14 13:24:13 -07:00 |
|
oobabooga
|
c9a9f63d1b
|
Fix llama.cpp loader not being random (thanks @reydeljuego12345)
|
2024-10-14 13:07:07 -07:00 |
|
PIRI
|
03a2e70054
|
Fix temperature_last when temperature not in sampler priority (#6439)
|
2024-10-09 11:25:14 -03:00 |
|
Grzegorz Lippe
|
9d8b1c5fd9
|
Fix intel bug described in #6253 (#6433)
|
2024-10-05 11:58:17 -03:00 |
|
Luana
|
22baa5378f
|
Fix for systems that have bash in a non-standard directory (#6428)
|
2024-10-03 00:35:13 -03:00 |
|
SeanScripts
|
e1338a1804
|
Add whisper turbo (#6423)
|
2024-10-01 17:49:35 -03:00 |
|
oobabooga
|
49dfa0adaf
|
Fix the "save preset" event
|
2024-10-01 11:20:48 -07:00 |
|
oobabooga
|
93c250b9b6
|
Add a UI element for enable_tp
|
2024-10-01 11:16:15 -07:00 |
|
oobabooga
|
d364aa0a3c
|
Lint
|
2024-10-01 10:22:57 -07:00 |
|
oobabooga
|
cca9d6e22d
|
Lint
|
2024-10-01 10:21:06 -07:00 |
|
oobabooga
|
c6b50f88da
|
Lint
|
2024-10-01 10:19:28 -07:00 |
|
oobabooga
|
7cb98351da
|
Merge branch 'main' into dev
|
2024-10-01 14:18:32 -03:00 |
|
oobabooga
|
617cd7b705
|
Revert "Update accelerate requirement from ==0.33.* to ==0.34.* (#6416)"
This reverts commit 6063a66414 .
|
2024-10-01 09:06:25 -07:00 |
|
dependabot[bot]
|
6063a66414
|
Update accelerate requirement from ==0.33.* to ==0.34.* (#6416)
|
2024-09-30 18:50:38 -03:00 |
|
oobabooga
|
4d9ce586d3
|
Update llama_cpp_python_hijack.py, fix llamacpp_hf
|
2024-09-30 14:49:21 -07:00 |
|
oobabooga
|
9ca0cd7749
|
Bump llama-cpp-python to 0.3.1
|
2024-09-29 20:47:04 -07:00 |
|
oobabooga
|
bbdeed3cf4
|
Make sampler priority high if unspecified
|
2024-09-29 20:45:27 -07:00 |
|
oobabooga
|
01362681f2
|
Bump exllamav2 to 0.2.4
|
2024-09-29 07:42:44 -07:00 |
|
Hanusz Leszek
|
e4b0467f9f
|
Add beforeunload event to add confirmation dialog when leaving page (#6279)
|
2024-09-29 01:14:19 -03:00 |
|
Manuel Schmid
|
0f90a1b50f
|
Do not set value for histories in chat when --multi-user is used (#6317)
|
2024-09-29 01:08:55 -03:00 |
|
oobabooga
|
055f3f5632
|
Fix after #6386 (thanks @Touch-Night)
|
2024-09-28 20:55:26 -07:00 |
|
oobabooga
|
57160cd6fa
|
Update README
|
2024-09-28 20:50:41 -07:00 |
|
oobabooga
|
3f0571b62b
|
Update README
|
2024-09-28 20:48:30 -07:00 |
|
oobabooga
|
3fb02f43f6
|
Update README
|
2024-09-28 20:38:43 -07:00 |
|
oobabooga
|
3b99532e02
|
Remove HQQ and AQLM from requirements
|
2024-09-28 20:34:59 -07:00 |
|
oobabooga
|
c61b29b9ce
|
Simplify the warning when flash-attn fails to import
|
2024-09-28 20:33:17 -07:00 |
|
oobabooga
|
b92d7fd43e
|
Add warnings for when AutoGPTQ, TensorRT-LLM, or HQQ are missing
|
2024-09-28 20:30:24 -07:00 |
|
oobabooga
|
65e5864084
|
Update README
|
2024-09-28 20:25:26 -07:00 |
|
oobabooga
|
1a870b3ea7
|
Remove AutoAWQ and AutoGPTQ from requirements (no wheels available)
|
2024-09-28 19:38:56 -07:00 |
|
oobabooga
|
85994e3ef0
|
Bump pytorch to 2.4.1
|
2024-09-28 09:44:08 -07:00 |
|
oobabooga
|
ca5a2dba72
|
Bump rocm to 6.1.2
|
2024-09-28 09:39:53 -07:00 |
|
oobabooga
|
7276dca933
|
Fix a typo
|
2024-09-27 20:28:17 -07:00 |
|
RandoInternetPreson
|
46996f6519
|
ExllamaV2 tensor parallelism to increase multi gpu inference speeds (#6356)
|
2024-09-28 00:26:03 -03:00 |
|
Philipp Emanuel Weidmann
|
301375834e
|
Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition (#6335)
|
2024-09-27 22:50:12 -03:00 |
|
oobabooga
|
3492e33fd5
|
Bump bitsandbytes to 0.44
|
2024-09-27 16:59:30 -07:00 |
|
Thireus ☠
|
626b0a0437
|
Force /bin/bash shell for conda (#6386)
|
2024-09-27 19:47:04 -03:00 |
|