text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2025-01-23 01:59:21 +01:00

History

Alex "mcmonkey" Goodwin a3eec62b50 Lora trainer improvements part 3 (#1098 ) * add support for other model types dependent on future-peft-changes but with fallback to function now * use encoding=utf8 for training format * make shuffling optional and describe dropout a bit more * add eval_steps to control evaluation * make callbacks not depend on globals * make save steps controllable * placeholder of initial loading-existing-model support and var name cleanup * save/load parameters * last bit of cleanup * remove `gptq_bits` ref as main branch removed that setting * add higher_rank_limit option 2048 is basically unreachable due to VRAM, but i trained at 1536 with batch size = 1 on a 7B model. Note that it's in the do_train input just to save as a parameter * fix math on save_steps		2023-04-16 02:35:13 -03:00
..
api.py	Minor change to API code	2023-04-14 12:11:47 -03:00
callbacks.py	Make the code more like PEP8 for readability (#862 )	2023-04-07 00:15:45 -03:00
chat.py	Merge pull request from GHSA-hv5m-3rp9-xcpf	2023-04-16 01:36:50 -03:00
deepspeed_parameters.py	Fix deepspeed (oops)	2023-02-02 10:39:37 -03:00
extensions.py	Merge pull request from GHSA-hv5m-3rp9-xcpf	2023-04-16 01:36:50 -03:00
GPTQ_loader.py	Simplify GPTQ_loader.py	2023-04-13 12:13:07 -03:00
html_generator.py	Don't treat Intruct mode histories as regular histories	2023-04-10 15:48:07 -03:00
llama_attn_hijack.py	Added xformers support to Llama (#950 )	2023-04-09 23:08:40 -03:00
llamacpp_model_alternative.py	Bump llama-cpp-python to use LlamaCache	2023-04-16 00:53:40 -03:00
llamacpp_model.py	Make the code more like PEP8 for readability (#862 )	2023-04-07 00:15:45 -03:00
LoRA.py	initial multi-lora support (#1103 )	2023-04-14 14:52:06 -03:00
models.py	Add "Save current settings for this model" button	2023-04-15 12:54:02 -03:00
RWKV.py	Make the code more like PEP8 for readability (#862 )	2023-04-07 00:15:45 -03:00
shared.py	Merge pull request from GHSA-hv5m-3rp9-xcpf	2023-04-16 01:36:50 -03:00
text_generation.py	revert incorrect eos_token_id change from #814 (#1261 )	2023-04-16 01:47:01 -03:00
training.py	Lora trainer improvements part 3 (#1098 )	2023-04-16 02:35:13 -03:00
ui.py	Automatically set wbits/groupsize/instruct based on model name (#1167 )	2023-04-14 11:07:28 -03:00