Forkoz
|
74ea7522a0
|
Lora fixes for AutoGPTQ (#2818)
|
2023-07-09 01:03:43 -03:00 |
|
oobabooga
|
5ac4e4da8b
|
Make --model work with argument like models/folder_name
|
2023-07-08 10:22:54 -07:00 |
|
oobabooga
|
b6643e5039
|
Add decode functions to llama.cpp/exllama
|
2023-07-07 09:11:30 -07:00 |
|
oobabooga
|
1ba2e88551
|
Add truncation to exllama
|
2023-07-07 09:09:23 -07:00 |
|
oobabooga
|
c21b73ff37
|
Minor change to ui.py
|
2023-07-07 09:09:14 -07:00 |
|
oobabooga
|
de994331a4
|
Merge remote-tracking branch 'refs/remotes/origin/main'
|
2023-07-06 22:25:43 -07:00 |
|
oobabooga
|
9aee1064a3
|
Block a cloudfare request
|
2023-07-06 22:24:52 -07:00 |
|
Fernando Tarin Morales
|
d7e14e1f78
|
Fixed the param name when loading a LoRA using a model loaded in 4 or 8 bits (#3036)
|
2023-07-07 02:24:07 -03:00 |
|
Xiaojian "JJ" Deng
|
ff45317032
|
Update models.py (#3020)
Hopefully fixed error with "ValueError: Tokenizer class GPTNeoXTokenizer does not exist or is not currently
imported."
|
2023-07-05 21:40:43 -03:00 |
|
oobabooga
|
8705eba830
|
Remove universal llama tokenizer support
Instead replace it with a warning if the tokenizer files look off
|
2023-07-04 19:43:19 -07:00 |
|
oobabooga
|
333075e726
|
Fix #3003
|
2023-07-04 11:38:35 -03:00 |
|
oobabooga
|
463ddfffd0
|
Fix start_with
|
2023-07-03 23:32:02 -07:00 |
|
oobabooga
|
373555c4fb
|
Fix loading some histories (thanks kaiokendev)
|
2023-07-03 22:19:28 -07:00 |
|
Panchovix
|
10c8c197bf
|
Add Support for Static NTK RoPE scaling for exllama/exllama_hf (#2955)
|
2023-07-04 01:13:16 -03:00 |
|
oobabooga
|
7e8340b14d
|
Make greetings appear in --multi-user mode
|
2023-07-03 20:08:14 -07:00 |
|
oobabooga
|
4b1804a438
|
Implement sessions + add basic multi-user support (#2991)
|
2023-07-04 00:03:30 -03:00 |
|
FartyPants
|
1f8cae14f9
|
Update training.py - correct use of lora_names (#2988)
|
2023-07-03 17:41:18 -03:00 |
|
FartyPants
|
c23c88ee4c
|
Update LoRA.py - avoid potential error (#2953)
|
2023-07-03 17:40:22 -03:00 |
|
FartyPants
|
33f56fd41d
|
Update models.py to clear LORA names after unload (#2951)
|
2023-07-03 17:39:06 -03:00 |
|
FartyPants
|
48b11f9c5b
|
Training: added trainable parameters info (#2944)
|
2023-07-03 17:38:36 -03:00 |
|
Turamarth14
|
847f70b694
|
Update html_generator.py (#2954)
With version 10.0.0 of Pillow the constant Image.ANTIALIAS has been removed. Instead Image.LANCZOS should be used.
|
2023-07-02 01:43:58 -03:00 |
|
ardfork
|
3c076c3c80
|
Disable half2 for ExLlama when using HIP (#2912)
|
2023-06-29 15:03:16 -03:00 |
|
missionfloyd
|
ac0f96e785
|
Some more character import tweaks. (#2921)
|
2023-06-29 14:56:25 -03:00 |
|
oobabooga
|
79db629665
|
Minor bug fix
|
2023-06-29 13:53:06 -03:00 |
|
oobabooga
|
3443219cbc
|
Add repetition penalty range parameter to transformers (#2916)
|
2023-06-29 13:40:13 -03:00 |
|
oobabooga
|
20740ab16e
|
Revert "Fix exllama_hf gibbersh above 2048 context, and works >5000 context. (#2913)"
This reverts commit 37a16d23a7 .
|
2023-06-28 18:10:34 -03:00 |
|
Panchovix
|
37a16d23a7
|
Fix exllama_hf gibbersh above 2048 context, and works >5000 context. (#2913)
|
2023-06-28 12:36:07 -03:00 |
|
FartyPants
|
ab1998146b
|
Training update - backup the existing adapter before training on top of it (#2902)
|
2023-06-27 18:24:04 -03:00 |
|
oobabooga
|
22d455b072
|
Add LoRA support to ExLlama_HF
|
2023-06-26 00:10:33 -03:00 |
|
oobabooga
|
c52290de50
|
ExLlama with long context (#2875)
|
2023-06-25 22:49:26 -03:00 |
|
oobabooga
|
9290c6236f
|
Keep ExLlama_HF if already selected
|
2023-06-25 19:06:28 -03:00 |
|
oobabooga
|
75fd763f99
|
Fix chat saving issue (closes #2863)
|
2023-06-25 18:14:57 -03:00 |
|
FartyPants
|
21c189112c
|
Several Training Enhancements (#2868)
|
2023-06-25 15:34:46 -03:00 |
|
oobabooga
|
95212edf1f
|
Update training.py
|
2023-06-25 12:13:15 -03:00 |
|
oobabooga
|
f31281a8de
|
Fix loading instruction templates containing literal '\n'
|
2023-06-25 02:13:26 -03:00 |
|
oobabooga
|
f0fcd1f697
|
Sort some imports
|
2023-06-25 01:44:36 -03:00 |
|
oobabooga
|
365b672531
|
Minor change to prevent future bugs
|
2023-06-25 01:38:54 -03:00 |
|
jllllll
|
bef67af23c
|
Use pre-compiled python module for ExLlama (#2770)
|
2023-06-24 20:24:17 -03:00 |
|
oobabooga
|
cec5fb0ef6
|
Failed attempt at evaluating exllama_hf perplexity
|
2023-06-24 12:02:25 -03:00 |
|
快乐的我531
|
e356f69b36
|
Make stop_everything work with non-streamed generation (#2848)
|
2023-06-24 11:19:16 -03:00 |
|
oobabooga
|
ec482f3dae
|
Apply input extensions after yielding *Is typing...*
|
2023-06-24 11:07:11 -03:00 |
|
oobabooga
|
3e80f2aceb
|
Apply the output extensions only once
Relevant for google translate, silero
|
2023-06-24 10:59:07 -03:00 |
|
missionfloyd
|
51a388fa34
|
Organize chat history/character import menu (#2845)
* Organize character import menu
* Move Chat history upload/download labels
|
2023-06-24 09:55:02 -03:00 |
|
oobabooga
|
8bb3bb39b3
|
Implement stopping string search in string space (#2847)
|
2023-06-24 09:43:00 -03:00 |
|
oobabooga
|
3ae9af01aa
|
Add --no_use_cuda_fp16 param for AutoGPTQ
|
2023-06-23 12:22:56 -03:00 |
|
Panchovix
|
5646690769
|
Fix some models not loading on exllama_hf (#2835)
|
2023-06-23 11:31:02 -03:00 |
|
oobabooga
|
383c50f05b
|
Replace old presets with the results of Preset Arena (#2830)
|
2023-06-23 01:48:29 -03:00 |
|
Panchovix
|
b4a38c24b7
|
Fix Multi-GPU not working on exllama_hf (#2803)
|
2023-06-22 16:05:25 -03:00 |
|
LarryVRH
|
580c1ee748
|
Implement a demo HF wrapper for exllama to utilize existing HF transformers decoding. (#2777)
|
2023-06-21 15:31:42 -03:00 |
|
EugeoSynthesisThirtyTwo
|
7625c6de89
|
fix usage of self in classmethod (#2781)
|
2023-06-20 16:18:42 -03:00 |
|