text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-12-04 21:10:30 +01:00

Author	SHA1	Message	Date
oobabooga	70f9565f37	Update README.md	2023-03-25 02:35:30 -03:00
oobabooga	25be9698c7	Fix LoRA on mps	2023-03-25 01:18:32 -03:00
oobabooga	3da633a497	Merge pull request #529 from EyeDeck/main Allow loading of .safetensors through GPTQ-for-LLaMa	2023-03-24 23:51:01 -03:00
jllllll	1e260544cd	Update install.bat Added C:\Windows\System32 to PATH to avoid issues with broken? Windows installs.	2023-03-24 21:25:14 -05:00
catalpaaa	d51cb8292b	Update server.py yea i should go to bed	2023-03-24 17:36:31 -07:00
catalpaaa	9e2963e0c8	Update server.py	2023-03-24 17:35:45 -07:00
catalpaaa	ec2a1facee	Update server.py	2023-03-24 17:34:33 -07:00
catalpaaa	b37c54edcf	lora-dir, model-dir and login auth Added lora-dir, model-dir, and a login auth arguments that points to a file contains usernames and passwords in the format of "u:pw,u:pw,..."	2023-03-24 17:30:18 -07:00
jllllll	fa916aa1de	Update INSTRUCTIONS.txt Added clarification on new variable added to download-model.bat.	2023-03-24 18:28:46 -05:00
jllllll	586775ad47	Update download-model.bat Removed redundant %ModelName% variable.	2023-03-24 18:25:49 -05:00
jllllll	bddbc2f898	Update start-webui.bat Updated virtual environment handling to use Micromamba.	2023-03-24 18:19:23 -05:00
jllllll	2604e3f7ac	Update download-model.bat Added variables for model selection and text only mode. Updated virtual environment handling to use Micromamba.	2023-03-24 18:15:24 -05:00
jllllll	24870e51ed	Update micromamba-cmd.bat Add cd command for admin.	2023-03-24 18:12:02 -05:00
jllllll	f0c82f06c3	Add files via upload Add script to open cmd within installation environment for easier modification.	2023-03-24 18:09:44 -05:00
oobabooga	9fa47c0eed	Revert GPTQ_loader.py (accident)	2023-03-24 19:57:12 -03:00
oobabooga	a6bf54739c	Revert models.py (accident)	2023-03-24 19:56:45 -03:00
jllllll	eec773b1f4	Update install.bat Corrected libbitsandbytes_cudaall.dll install.	2023-03-24 17:54:47 -05:00
oobabooga	0a16224451	Update GPTQ_loader.py	2023-03-24 19:54:36 -03:00
oobabooga	a80aa65986	Update models.py	2023-03-24 19:53:20 -03:00
jllllll	817e6c681e	Update install.bat Added `cd /D "%~dp0"` in case the script is ran as admin.	2023-03-24 17:51:13 -05:00
jllllll	a80a5465f2	Update install.bat Updated Conda packages and channels to install cuda-toolkit and override 12.0 cuda packages requested by pytorch with their 11.7 equivalent. Removed Conda installation since we can use the downloaded Micromamba.exe for the same purpose with a smaller footprint. Removed redundant PATH changes. Changed %gpuchoice% comparisons to be case-insensitive. Added additional error handling and removed the use of .tmp files. Added missing extension requirements. Added GPTQ installation. Will attempt to compile locally and, if failed, will download and install a precompiled wheel. Incorporated fixes from one-click-bandaid. Fixed and expanded first sed command from one-click-bandaid. libbitsandbytes_cudaall.dll is used here as the cuda116.dll used by one-click-bandaid does not work on my 1080ti. This can be changed if needed.	2023-03-24 17:27:29 -05:00
oobabooga	507db0929d	Do not use empty user messages in chat mode This allows the bot to send messages by clicking on Generate with empty inputs.	2023-03-24 17:22:22 -03:00
oobabooga	6e1b16c2aa	Update html_generator.py	2023-03-24 17:18:27 -03:00
oobabooga	ffb0187e83	Update chat.py	2023-03-24 17:17:29 -03:00
oobabooga	c14e598f14	Merge pull request #433 from mayaeary/fix/api-reload Fix api extension duplicating	2023-03-24 16:56:10 -03:00
oobabooga	bfe960731f	Merge branch 'main' into fix/api-reload	2023-03-24 16:54:41 -03:00
oobabooga	4a724ed22f	Reorder imports	2023-03-24 16:53:56 -03:00
oobabooga	8fad84abc2	Update extensions.py	2023-03-24 16:51:27 -03:00
oobabooga	d8e950d6bd	Don't load the model twice when using --lora	2023-03-24 16:30:32 -03:00
oobabooga	fd99995b01	Make the Stop button more consistent in chat mode	2023-03-24 15:59:27 -03:00
Forkoz	b740c5b284	Add display of context when input was generated Not sure if I did this right but it does move with the conversation and seems to match value.	2023-03-24 08:56:07 -05:00
oobabooga	4f5c2ce785	Fix chat_generation_attempts	2023-03-24 02:03:30 -03:00
oobabooga	04417b658b	Update README.md	2023-03-24 01:40:43 -03:00
oobabooga	bb4cb22453	Download .pt files using download-model.py (for 4-bit models)	2023-03-24 00:49:04 -03:00
oobabooga	143b5b5edf	Mention one-click-bandaid in the README	2023-03-23 23:28:50 -03:00
EyeDeck	dcfd866402	Allow loading of .safetensors through GPTQ-for-LLaMa	2023-03-23 21:31:34 -04:00
oobabooga	8747c74339	Another missing import	2023-03-23 22:19:01 -03:00
oobabooga	7078d168c3	Missing import	2023-03-23 22:16:08 -03:00
oobabooga	d1327f99f9	Fix broken callbacks.py	2023-03-23 22:12:24 -03:00
oobabooga	9bdb3c784d	Minor fix	2023-03-23 22:02:40 -03:00
oobabooga	b0abb327d8	Update LoRA.py	2023-03-23 22:02:09 -03:00
oobabooga	bf22d16ebc	Clear cache while switching LoRAs	2023-03-23 21:56:26 -03:00
oobabooga	4578e88ffd	Stop the bot from talking for you in chat mode	2023-03-23 21:38:20 -03:00
oobabooga	9bf6ecf9e2	Fix LoRA device map (attempt)	2023-03-23 16:49:41 -03:00
oobabooga	c5ebcc5f7e	Change the default names (#518 ) * Update shared.py * Update settings-template.json	2023-03-23 13:36:00 -03:00
Φφ	483d173d23	Code reuse + indication Now shows the message in the console when unloading weights. Also reload_model() calls unload_model() first to free the memory so that multiple reloads won't overfill it.	2023-03-23 07:06:26 +03:00
Φφ	1917b15275	Unload and reload models on request	2023-03-23 07:06:26 +03:00
oobabooga	29bd41d453	Fix LoRA in CPU mode	2023-03-23 01:05:13 -03:00
oobabooga	eac27f4f55	Make LoRAs work in 16-bit mode	2023-03-23 00:55:33 -03:00
oobabooga	bfa81e105e	Fix FlexGen streaming	2023-03-23 00:22:14 -03:00

... 41 42 43 44 45 ...

3153 Commits