oobabooga
|
ebb4f22028
|
Change a comment
|
2023-07-31 20:06:10 -07:00 |
|
oobabooga
|
8e2217a029
|
Minor changes to the Parameters tab
|
2023-07-31 19:55:11 -07:00 |
|
oobabooga
|
b2207f123b
|
Update docs
|
2023-07-31 19:20:48 -07:00 |
|
oobabooga
|
84297d05c4
|
Add a "Filter by loader" menu to the Parameters tab
|
2023-07-31 19:09:02 -07:00 |
|
oobabooga
|
e6be25ea11
|
Fix a regression
|
2023-07-30 18:12:30 -07:00 |
|
oobabooga
|
5ca37765d3
|
Only replace {{user}} and {{char}} at generation time
|
2023-07-30 11:42:30 -07:00 |
|
oobabooga
|
6e16af34fd
|
Save uploaded characters as yaml
Also allow yaml characters to be uploaded directly
|
2023-07-30 11:25:38 -07:00 |
|
oobabooga
|
ed80a2e7db
|
Reorder llama.cpp params
|
2023-07-25 20:45:20 -07:00 |
|
oobabooga
|
0e8782df03
|
Set instruction template when switching from default/notebook to chat
|
2023-07-25 20:37:01 -07:00 |
|
oobabooga
|
1b89c304ad
|
Update README
|
2023-07-25 15:46:12 -07:00 |
|
oobabooga
|
75c2dd38cf
|
Remove flexgen support
|
2023-07-25 15:15:29 -07:00 |
|
Shouyi
|
031fe7225e
|
Add tensor split support for llama.cpp (#3171)
|
2023-07-25 18:59:26 -03:00 |
|
oobabooga
|
7bc408b472
|
Change rms_norm_eps to 5e-6 for llama-2-70b ggml
Based on https://github.com/ggerganov/llama.cpp/pull/2384
|
2023-07-25 14:54:57 -07:00 |
|
oobabooga
|
08c622df2e
|
Autodetect rms_norm_eps and n_gqa for llama-2-70b
|
2023-07-24 15:27:34 -07:00 |
|
oobabooga
|
a07d070b6c
|
Add llama-2-70b GGML support (#3285)
|
2023-07-24 16:37:03 -03:00 |
|
jllllll
|
d7a14174a2
|
Remove auto-loading when only one model is available (#3187)
|
2023-07-18 11:39:08 -03:00 |
|
oobabooga
|
f83fdb9270
|
Don't reset LoRA menu when loading a model
|
2023-07-17 12:50:25 -07:00 |
|
oobabooga
|
2de0cedce3
|
Fix reload screen color
|
2023-07-15 22:39:39 -07:00 |
|
oobabooga
|
27a84b4e04
|
Make AutoGPTQ the default again
Purely for compatibility with more models.
You should still use ExLlama_HF for LLaMA models.
|
2023-07-15 22:29:23 -07:00 |
|
oobabooga
|
5e3f7e00a9
|
Create llamacpp_HF loader (#3062)
|
2023-07-16 02:21:13 -03:00 |
|
Panchovix
|
7c4d4fc7d3
|
Increase alpha value limit for NTK RoPE scaling for exllama/exllama_HF (#3149)
|
2023-07-16 01:56:04 -03:00 |
|
oobabooga
|
b284f2407d
|
Make ExLlama_HF the new default for GPTQ
|
2023-07-14 14:03:56 -07:00 |
|
oobabooga
|
22341e948d
|
Merge branch 'main' into dev
|
2023-07-12 14:19:49 -07:00 |
|
oobabooga
|
0e6295886d
|
Fix lora download folder
|
2023-07-12 14:19:33 -07:00 |
|
oobabooga
|
eb823fce96
|
Fix typo
|
2023-07-12 13:55:19 -07:00 |
|
oobabooga
|
d0a626f32f
|
Change reload screen color
|
2023-07-12 13:54:43 -07:00 |
|
oobabooga
|
c592a9b740
|
Fix #3117
|
2023-07-12 13:33:44 -07:00 |
|
Gabriel Pena
|
eedb3bf023
|
Add low vram mode on llama cpp (#3076)
|
2023-07-12 11:05:13 -03:00 |
|
Axiom Wolf
|
d986c17c52
|
Chat history download creates more detailed file names (#3051)
|
2023-07-12 00:10:36 -03:00 |
|
Salvador E. Tropea
|
324e45b848
|
[Fixed] wbits and groupsize values from model not shown (#2977)
|
2023-07-11 23:27:38 -03:00 |
|
oobabooga
|
bfafd07f44
|
Change a message
|
2023-07-11 18:29:20 -07:00 |
|
micsthepick
|
3708de2b1f
|
respect model dir for downloads (#3077) (#3079)
|
2023-07-11 18:55:46 -03:00 |
|
oobabooga
|
9aee1064a3
|
Block a cloudfare request
|
2023-07-06 22:24:52 -07:00 |
|
oobabooga
|
40c5722499
|
Fix #2998
|
2023-07-04 11:35:25 -03:00 |
|
oobabooga
|
55457549cd
|
Add information about presets to the UI
|
2023-07-03 22:39:01 -07:00 |
|
Panchovix
|
10c8c197bf
|
Add Support for Static NTK RoPE scaling for exllama/exllama_hf (#2955)
|
2023-07-04 01:13:16 -03:00 |
|
FartyPants
|
eb6112d5a2
|
Update server.py - clear LORA after reload (#2952)
|
2023-07-04 00:13:38 -03:00 |
|
oobabooga
|
4b1804a438
|
Implement sessions + add basic multi-user support (#2991)
|
2023-07-04 00:03:30 -03:00 |
|
missionfloyd
|
ac0f96e785
|
Some more character import tweaks. (#2921)
|
2023-06-29 14:56:25 -03:00 |
|
oobabooga
|
5d2a8b31be
|
Improve Parameters tab UI
|
2023-06-29 14:33:47 -03:00 |
|
oobabooga
|
3443219cbc
|
Add repetition penalty range parameter to transformers (#2916)
|
2023-06-29 13:40:13 -03:00 |
|
oobabooga
|
22d455b072
|
Add LoRA support to ExLlama_HF
|
2023-06-26 00:10:33 -03:00 |
|
oobabooga
|
b7c627f9a0
|
Set UI defaults
|
2023-06-25 22:55:43 -03:00 |
|
oobabooga
|
c52290de50
|
ExLlama with long context (#2875)
|
2023-06-25 22:49:26 -03:00 |
|
oobabooga
|
f0fcd1f697
|
Sort some imports
|
2023-06-25 01:44:36 -03:00 |
|
oobabooga
|
e6e5f546b8
|
Reorganize Chat settings tab
|
2023-06-25 01:10:20 -03:00 |
|
jllllll
|
bef67af23c
|
Use pre-compiled python module for ExLlama (#2770)
|
2023-06-24 20:24:17 -03:00 |
|
missionfloyd
|
51a388fa34
|
Organize chat history/character import menu (#2845)
* Organize character import menu
* Move Chat history upload/download labels
|
2023-06-24 09:55:02 -03:00 |
|
oobabooga
|
3ae9af01aa
|
Add --no_use_cuda_fp16 param for AutoGPTQ
|
2023-06-23 12:22:56 -03:00 |
|
LarryVRH
|
580c1ee748
|
Implement a demo HF wrapper for exllama to utilize existing HF transformers decoding. (#2777)
|
2023-06-21 15:31:42 -03:00 |
|