text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-12-26 22:30:44 +01:00

Author	SHA1	Message	Date
oobabooga	aa809e420e	Bump llama-cpp-python to 0.2.83, add back tensorcore wheels Also add back the progress bar patch	2024-07-22 18:05:11 -07:00
oobabooga	11bbf71aa5	Bump back llama-cpp-python (#6257 )	2024-07-22 16:19:41 -03:00
oobabooga	0f53a736c1	Revert the llama-cpp-python update	2024-07-22 12:02:25 -07:00
oobabooga	a687f950ba	Remove the tensorcores llama.cpp wheels They are not faster than the default wheels anymore and they use a lot of space.	2024-07-22 11:54:35 -07:00
oobabooga	7d2449f8b0	Bump llama-cpp-python to 0.2.82.3 (unofficial build)	2024-07-22 11:49:20 -07:00
dependabot[bot]	1c3671699c	Bump hqq from 0.1.7.post3 to 0.1.8 (#6238 )	2024-07-20 18:20:26 -03:00
oobabooga	b19d239a60	Bump flash-attention to 2.6.1	2024-07-12 20:16:11 -07:00
dependabot[bot]	063d2047dd	Update accelerate requirement from ==0.31.* to ==0.32.* (#6217 )	2024-07-11 19:56:42 -03:00
oobabooga	01e4721da7	Bump ExLlamaV2 to 0.1.7	2024-07-11 12:33:46 -07:00
oobabooga	fa075e41f4	Bump llama-cpp-python to 0.2.82	2024-07-10 06:03:24 -07:00
oobabooga	7e22eaa36c	Bump llama-cpp-python to 0.2.81	2024-07-02 20:29:35 -07:00
dependabot[bot]	9660f6f10e	Bump aqlm[cpu,gpu] from 1.1.5 to 1.1.6 (#6157 )	2024-06-27 21:13:02 -03:00
dependabot[bot]	a5df8f4e3c	Bump jinja2 from 3.1.2 to 3.1.4 (#6172 )	2024-06-27 21:12:39 -03:00
dependabot[bot]	c6cec0588c	Update accelerate requirement from ==0.30.* to ==0.31.* (#6156 )	2024-06-27 21:12:02 -03:00
oobabooga	66090758df	Bump transformers to 4.42 (for gemma support)	2024-06-27 11:26:02 -07:00
oobabooga	602b455507	Bump llama-cpp-python to 0.2.79	2024-06-24 20:26:38 -07:00
oobabooga	7db8b3b532	Bump ExLlamaV2 to 0.1.6	2024-06-24 05:38:11 -07:00
oobabooga	125bb7b03b	Revert "Bump llama-cpp-python to 0.2.78" This reverts commit `b6eaf7923e`.	2024-06-23 19:54:28 -07:00
oobabooga	b6eaf7923e	Bump llama-cpp-python to 0.2.78	2024-06-14 21:22:09 -07:00
oobabooga	9420973b62	Downgrade PyTorch to 2.2.2 (#6124 )	2024-06-14 16:42:03 -03:00
dependabot[bot]	fdd8fab9cf	Bump hqq from 0.1.7.post2 to 0.1.7.post3 (#6090 )	2024-06-14 13:46:35 -03:00
oobabooga	8930bfc5f4	Bump PyTorch, ExLlamaV2, flash-attention (#6122 )	2024-06-13 20:38:31 -03:00
oobabooga	bd7cc4234d	Backend cleanup (#6025 )	2024-05-21 13:32:02 -03:00
dependabot[bot]	2de586f586	Update accelerate requirement from ==0.27.* to ==0.30.* (#5989 )	2024-05-19 20:03:18 -03:00
oobabooga	0d90b3a25c	Bump llama-cpp-python to 0.2.75	2024-05-18 05:26:26 -07:00
oobabooga	9557f49f2f	Bump llama-cpp-python to 0.2.73	2024-05-11 10:53:19 -07:00
oobabooga	e61055253c	Bump llama-cpp-python to 0.2.69, add --flash-attn option	2024-05-03 04:31:22 -07:00
oobabooga	0476f9fe70	Bump ExLlamaV2 to 0.0.20	2024-05-01 16:20:50 -07:00
oobabooga	ae0f28530c	Bump llama-cpp-python to 0.2.68	2024-05-01 08:40:50 -07:00
oobabooga	51fb766bea	Add back my llama-cpp-python wheels, bump to 0.2.65 (#5964 )	2024-04-30 09:11:31 -03:00
oobabooga	9b623b8a78	Bump llama-cpp-python to 0.2.64, use official wheels (#5921 )	2024-04-23 23:17:05 -03:00
Ashley Kleynhans	0877741b03	Bumped ExLlamaV2 to version 0.0.19 to resolve #5851 (#5880 )	2024-04-19 19:04:40 -03:00
oobabooga	b30bce3b2f	Bump transformers to 4.40	2024-04-18 16:19:31 -07:00
Philipp Emanuel Weidmann	a0c69749e6	Revert sse-starlette version bump because it breaks API request cancellation (#5873 )	2024-04-18 15:05:00 -03:00
dependabot[bot]	597556cb77	Bump sse-starlette from 1.6.5 to 2.1.0 (#5831 )	2024-04-11 18:54:05 -03:00
oobabooga	3e3a7c4250	Bump llama-cpp-python to 0.2.61 & fix the crash	2024-04-11 14:15:34 -07:00
oobabooga	5f5ceaf025	Revert "Bump llama-cpp-python to 0.2.61" This reverts commit `3ae61c0338`.	2024-04-11 13:24:57 -07:00
dependabot[bot]	bd71a504b8	Update gradio requirement from ==4.25.* to ==4.26.* (#5832 )	2024-04-11 02:24:53 -03:00
oobabooga	3ae61c0338	Bump llama-cpp-python to 0.2.61	2024-04-10 21:39:46 -07:00
oobabooga	ed4001e324	Bump ExLlamaV2 to 0.0.18	2024-04-08 18:05:16 -07:00
oobabooga	f6828de3f2	Downgrade llama-cpp-python to 0.2.56	2024-04-07 07:00:12 -07:00
Jared Van Bortel	39ff9c9dcf	requirements: add psutil (#5819 )	2024-04-06 23:02:20 -03:00
oobabooga	dfb01f9a63	Bump llama-cpp-python to 0.2.60	2024-04-06 18:32:36 -07:00
dependabot[bot]	a4c67e1974	Bump aqlm[cpu,gpu] from 1.1.2 to 1.1.3 (#5790 )	2024-04-05 13:26:49 -03:00
oobabooga	14f6194211	Bump Gradio to 4.25	2024-04-05 09:22:44 -07:00
oobabooga	d423021a48	Remove CTransformers support (#5807 )	2024-04-04 20:23:58 -03:00
oobabooga	3952560da8	Bump llama-cpp-python to 0.2.59	2024-04-04 11:20:48 -07:00
oobabooga	70c58b5fc2	Bump ExLlamaV2 to 0.0.17	2024-03-30 21:08:26 -07:00
oobabooga	3ce0d9221b	Bump transformers to 4.39	2024-03-28 19:40:31 -07:00
dependabot[bot]	3609ea69e4	Bump aqlm[cpu,gpu] from 1.1.0 to 1.1.2 (#5728 )	2024-03-26 16:36:16 -03:00
oobabooga	2a92a842ce	Bump gradio to 4.23 (#5758 )	2024-03-26 16:32:20 -03:00
oobabooga	a102c704f5	Add numba to requirements.txt	2024-03-10 16:13:29 -07:00
oobabooga	b3ade5832b	Keep AQLM only for Linux (fails to install on Windows)	2024-03-10 09:41:17 -07:00
oobabooga	67b24b0b88	Bump llama-cpp-python to 0.2.56	2024-03-10 09:07:27 -07:00
oobabooga	763f9beb7e	Bump bitsandbytes to 0.43, add official Windows wheel	2024-03-10 08:30:53 -07:00
oobabooga	9271e80914	Add back AutoAWQ for Windows https://github.com/casper-hansen/AutoAWQ/issues/377#issuecomment-1986440695	2024-03-08 14:54:56 -08:00
oobabooga	d0663bae31	Bump AutoAWQ to 0.2.3 (Linux only) (#5658 )	2024-03-08 17:36:28 -03:00
oobabooga	0e6eb7c27a	Add AQLM support (transformers loader) (#5466 )	2024-03-08 17:30:36 -03:00
oobabooga	bde7f00cae	Change the exllamav2 version number	2024-03-06 21:08:29 -08:00
oobabooga	2ec1d96c91	Add cache_4bit option for ExLlamaV2 (#5645 )	2024-03-06 23:02:25 -03:00
oobabooga	2174958362	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
oobabooga	03f03af535	Revert "Update peft requirement from ==0.8.* to ==0.9.* (#5626 )" This reverts commit `72a498ddd4`.	2024-03-05 02:56:37 -08:00
oobabooga	ae12d045ea	Merge remote-tracking branch 'refs/remotes/origin/dev' into dev	2024-03-05 02:35:04 -08:00
dependabot[bot]	72a498ddd4	Update peft requirement from ==0.8.* to ==0.9.* (#5626 )	2024-03-05 07:34:32 -03:00
oobabooga	1437f757a1	Bump HQQ to 0.1.5	2024-03-05 02:33:51 -08:00
oobabooga	63a1d4afc8	Bump gradio to 4.19 (#5522 )	2024-03-05 07:32:28 -03:00
oobabooga	527ba98105	Do not install extensions requirements by default (#5621 )	2024-03-04 04:46:39 -03:00
oobabooga	8bd4960d05	Update PyTorch to 2.2 (also update flash-attn to 2.5.6) (#5618 )	2024-03-03 19:40:32 -03:00
oobabooga	70047a5c57	Bump bitsandytes to 0.42.0 on Windows	2024-03-03 13:19:27 -08:00
oobabooga	24e86bb21b	Bump llama-cpp-python to 0.2.55	2024-03-03 12:14:48 -08:00
oobabooga	314e42fd98	Fix transformers requirement	2024-03-03 10:49:28 -08:00
dependabot[bot]	dfdf6eb5b4	Bump hqq from 0.1.3 to 0.1.3.post1 (#5582 )	2024-02-26 20:51:39 -03:00
oobabooga	332957ffec	Bump llama-cpp-python to 0.2.52	2024-02-26 15:05:53 -08:00
Bartowski	21acf504ce	Bump transformers to 4.38 for gemma compatibility (#5575 )	2024-02-25 20:15:13 -03:00
oobabooga	c07dc56736	Bump llama-cpp-python to 0.2.50	2024-02-24 21:34:11 -08:00
oobabooga	98580cad8e	Bump exllamav2 to 0.0.14	2024-02-24 18:35:42 -08:00
oobabooga	527f2652af	Bump llama-cpp-python to 0.2.47	2024-02-22 19:48:49 -08:00
oobabooga	3f42e3292a	Revert "Bump autoawq from 0.1.8 to 0.2.2 (#5547 )" This reverts commit `d04fef6a07`.	2024-02-22 19:48:04 -08:00
dependabot[bot]	5f7dbf454a	Update optimum requirement from ==1.16.* to ==1.17.* (#5548 )	2024-02-19 19:15:21 -03:00
dependabot[bot]	d04fef6a07	Bump autoawq from 0.1.8 to 0.2.2 (#5547 )	2024-02-19 19:14:55 -03:00
dependabot[bot]	ed6ff49431	Update accelerate requirement from ==0.25.* to ==0.27.* (#5546 )	2024-02-19 19:14:04 -03:00
oobabooga	0b2279d031	Bump llama-cpp-python to 0.2.44	2024-02-19 13:42:31 -08:00
oobabooga	c375c753d6	Bump bitsandbytes to 0.42 (Linux only)	2024-02-16 10:47:57 -08:00
oobabooga	080f7132c0	Revert gradio to 3.50.2 (#5513 )	2024-02-15 20:40:23 -03:00
oobabooga	ea0e1feee7	Bump llama-cpp-python to 0.2.43	2024-02-14 21:58:24 -08:00
oobabooga	549f106879	Bump ExLlamaV2 to v0.0.13.2	2024-02-14 21:57:48 -08:00
DominikKowalczyk	33c4ce0720	Bump gradio to 4.19 (#5419 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2024-02-14 23:28:26 -03:00
oobabooga	04d8bdf929	Fix ExLlamaV2 requirement on Windows	2024-02-14 06:31:20 -08:00
oobabooga	193548edce	Minor fix to ExLlamaV2 requirements	2024-02-13 16:00:06 -08:00
oobabooga	25b655faeb	Merge remote-tracking branch 'refs/remotes/origin/dev' into dev	2024-02-13 15:49:53 -08:00
oobabooga	f99f1fc68e	Bump llama-cpp-python to 0.2.42	2024-02-13 15:49:20 -08:00
dependabot[bot]	d8081e85ec	Update peft requirement from ==0.7.* to ==0.8.* (#5446 )	2024-02-13 16:27:18 -03:00
dependabot[bot]	653b195b1e	Update numpy requirement from ==1.24.* to ==1.26.* (#5490 )	2024-02-13 16:26:35 -03:00
dependabot[bot]	147b4cf3e0	Bump hqq from 0.1.2.post1 to 0.1.3 (#5489 )	2024-02-13 16:25:02 -03:00
oobabooga	e9fea353c5	Bump llama-cpp-python to 0.2.40	2024-02-13 11:22:34 -08:00
oobabooga	acea6a6669	Add more exllamav2 wheels	2024-02-07 08:24:29 -08:00
oobabooga	35537ad3d1	Bump exllamav2 to 0.0.13.1 (#5463 )	2024-02-07 13:17:04 -03:00
oobabooga	b8e25e8678	Bump llama-cpp-python to 0.2.39	2024-02-07 06:50:47 -08:00
oobabooga	a210999255	Bump safetensors version	2024-02-04 18:40:25 -08:00
oobabooga	e98d1086f5	Bump llama-cpp-python to 0.2.38 (#5420 )	2024-02-01 20:09:30 -03:00

1 2 3 4 5

227 Commits