Commit Graph

138 Commits

Author SHA1 Message Date
oobabooga
510a01ef46 Lint 2023-11-16 18:03:06 -08:00
oobabooga
a475aa7816 Improve API documentation 2023-11-15 18:39:08 -08:00
oobabooga
a85ce5f055 Add more info messages for truncation / instruction template 2023-11-15 16:20:31 -08:00
oobabooga
e6f44d6d19 Print context length / instruction template to terminal when loading models 2023-11-15 16:00:51 -08:00
oobabooga
be125e2708 Add /v1/internal/model/unload endpoint 2023-11-15 15:48:33 -08:00
oobabooga
52758f15da Remove sentence-transformers requirement (for #1575) 2023-11-10 07:35:29 -08:00
oobabooga
c5be3f7acb Make /v1/embeddings functional, add request/response types 2023-11-10 07:34:27 -08:00
oobabooga
0777b0d3c7 Add system_message parameter, document model (unused) parameter 2023-11-10 06:47:18 -08:00
oobabooga
4aabff3728 Remove old API, launch OpenAI API with --api 2023-11-10 06:39:08 -08:00
GuizzyQC
6a7cd01ebf
Fix bug with /internal/model/load (#4549)
Update shared.model_name after loading model through API call
2023-11-10 00:16:38 -03:00
oobabooga
d86f1fd2c3 OpenAI API: stop streaming on client disconnect (closes #4521) 2023-11-09 06:37:32 -08:00
oobabooga
effb3aef42 Prevent deadlocks in OpenAI API with simultaneous requests 2023-11-08 20:55:39 -08:00
oobabooga
678fd73aef Document /v1/internal/model/load and fix a bug 2023-11-08 17:41:12 -08:00
MrMojoR
1754a3761b
Include trust remote code usage in openai api's embedder (#4513) 2023-11-08 11:25:43 -03:00
hronoas
6c7aad11f3
openai extension: wrong frequency_penalty type (#4512) 2023-11-08 11:23:51 -03:00
oobabooga
881e8a6e70
Small bug fix in /v1/internal/model/load 2023-11-08 02:34:13 -03:00
oobabooga
050ff36bd6 Revert "Add a comment to /v1/models"
This reverts commit 38b07493a0.
2023-11-07 21:09:47 -08:00
oobabooga
38b07493a0 Add a comment to /v1/models 2023-11-07 21:07:12 -08:00
oobabooga
2358706453 Add /v1/internal/model/load endpoint (tentative) 2023-11-07 20:58:06 -08:00
oobabooga
43c53a7820 Refactor the /v1/models endpoint 2023-11-07 19:59:27 -08:00
oobabooga
1b69694fe9 Add types to the encode/decode/token-count endpoints 2023-11-07 19:32:14 -08:00
oobabooga
f6ca9cfcdc Add /v1/internal/model-info endpoint 2023-11-07 18:59:02 -08:00
oobabooga
6e2e0317af
Separate context and system message in instruction formats (#4499) 2023-11-07 20:02:58 -03:00
oobabooga
3fc505dc0f Document unused parameters 2023-11-07 08:56:09 -08:00
oobabooga
3d59346871 Implement echo/suffix parameters 2023-11-07 08:43:45 -08:00
oobabooga
48c9c31440 Document the "preset" option in the API 2023-11-07 08:23:17 -08:00
oobabooga
79b3f5a546
Add /v1/internal/stop-generation to OpenAI API (#4498) 2023-11-07 00:10:42 -03:00
oobabooga
97c21e5667 Don't strip leading spaces in OpenAI API 2023-11-06 19:09:41 -08:00
oobabooga
4a45dc4041 Reorder the parameters in the FastAPI documentation 2023-11-06 09:55:36 -08:00
oobabooga
0db81355bc Reorder a parameter 2023-11-06 07:11:49 -08:00
oobabooga
b87c6213ae Remove obsolete endpoint 2023-11-06 05:45:45 -08:00
oobabooga
28fd535f9c Make chat API more robust 2023-11-06 05:22:01 -08:00
oobabooga
ec17a5d2b7
Make OpenAI API the default API (#4430) 2023-11-06 02:38:29 -03:00
俞航
84d957ba62
[Fix] fix openai embedding_model loading as str (#4147) 2023-11-05 20:42:45 -03:00
kabachuha
e18a0460d4
fix openai extension not working because of absent new defaults (#4477) 2023-11-04 16:12:51 -03:00
tdrussell
72f6fc6923
Rename additive_repetition_penalty to presence_penalty, add frequency_penalty (#4376) 2023-10-25 12:10:28 -03:00
tdrussell
4440f87722
Add additive_repetition_penalty sampler setting. (#3627) 2023-10-23 02:28:07 -03:00
hronoas
db7ecdd274
openai: fix empty models list on query present in url (#4139) 2023-10-16 17:02:47 -03:00
Jesus Alvarez
ed66ca3cdf
Add HTTPS support to APIs (openai and default) (#4270)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-10-13 01:31:13 -03:00
oobabooga
08cf150c0c
Add a grammar editor to the UI (#4061) 2023-09-24 18:05:24 -03:00
oobabooga
b227e65d86 Add grammar to llama.cpp loader (closes #4019) 2023-09-24 07:10:45 -07:00
Chenxiao Wang
9de2dfa887
extensions/openai: Fix error when preparing cache for embedding models (#3995) 2023-09-24 00:58:28 -03:00
Chenxiao Wang
347aed4254
extensions/openai: load extension settings via settings.yaml (#3953) 2023-09-17 22:39:29 -03:00
Shulzhenko Anatolii
0668f4e67f
Add speechrecognition dependency for OpenAI extension (#3959) 2023-09-16 13:49:48 -03:00
wizd
cc7f345c29
add whisper api to openai plugin (#3958) 2023-09-16 12:04:04 -03:00
oobabooga
f5fb1ee666 Change a comment 2023-09-15 20:16:30 -07:00
oobabooga
2c1b548cea Minor fix 2023-09-15 20:14:32 -07:00
oobabooga
8f97e87cac Lint the openai extension 2023-09-15 20:11:16 -07:00
saltacc
f01b9aa71f
Add customizable ban tokens (#3899) 2023-09-15 18:27:27 -03:00
Chang Chi, Meng
b61d9aef19
openai API: add support for chunked transfer encoding in POST requests (#3870) 2023-09-12 15:54:42 -03:00
oobabooga
df123a20fc Prevent extra keys from being saved to settings.yaml 2023-09-11 20:13:10 -07:00
oobabooga
9331ab4798
Read GGUF metadata (#3873) 2023-09-11 18:49:30 -03:00
oobabooga
cec8db52e5
Add max_tokens_second param (#3533) 2023-08-29 17:44:31 -03:00
oobabooga
c237ce607e Move characters/instruction-following to instruction-templates 2023-08-06 17:50:32 -07:00
oobabooga
0af10ab49b
Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325) 2023-08-06 17:22:48 -03:00
matatonic
8f98268252
extensions/openai: include content-length for json replies (#3416) 2023-08-03 16:10:49 -03:00
oobabooga
e931844fe2
Add auto_max_new_tokens parameter (#3419) 2023-08-02 14:52:20 -03:00
matatonic
9ae0eab989
extensions/openai: +Array input (batched) , +Fixes (#3309) 2023-08-01 22:26:00 -03:00
matatonic
90a4ab631c
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 11:28:12 -03:00
oobabooga
e202190c4f lint 2023-07-12 11:33:25 -07:00
matatonic
3e7feb699c
extensions/openai: Major openai extension updates & fixes (#3049)
* many openai updates

* total reorg & cleanup.

* fixups

* missing import os for images

* +moderations, custom_stopping_strings, more fixes

* fix bugs in completion streaming

* moderation fix (flagged)

* updated moderation categories

---------

Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-07-11 18:50:08 -03:00
oobabooga
3443219cbc
Add repetition penalty range parameter to transformers (#2916) 2023-06-29 13:40:13 -03:00
oobabooga
c52290de50
ExLlama with long context (#2875) 2023-06-25 22:49:26 -03:00
matatonic
b45baeea41
extensions/openai: Major docs update, fix #2852 (critical bug), minor improvements (#2849) 2023-06-24 22:50:04 -03:00
matatonic
1e97aaac95
extensions/openai: docs update, model loader, minor fixes (#2557) 2023-06-17 19:15:24 -03:00
matatonic
4a17a5db67
[extensions/openai] various fixes (#2533) 2023-06-06 01:43:04 -03:00
matatonic
a6d3f010a5
extensions/openai: include all available models in Model.list (#2368)
Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-05-30 22:13:37 -03:00
matatonic
e5b756ecfe
Fixes #2331, IndexError: string index out of range (#2383) 2023-05-30 22:07:40 -03:00
matatonic
ebcadc0042
extensions/openai: cross_origin + chunked_response (updated fix) (#2423) 2023-05-30 21:54:24 -03:00
matatonic
df50f077db
fixup missing tfs top_a params, defaults reorg (#2443) 2023-05-30 21:52:33 -03:00
matatonic
2b7ba9586f
Fixes #2326, KeyError: 'assistant' (#2382) 2023-05-29 14:19:57 -03:00
matatonic
9714072692
[extensions/openai] use instruction templates with chat_completions (#2291) 2023-05-23 19:58:41 -03:00
oobabooga
c0fd7f3257
Add mirostat parameters for llama.cpp (#2287) 2023-05-22 19:37:24 -03:00
oobabooga
8ac3636966
Add epsilon_cutoff/eta_cutoff parameters (#2258) 2023-05-21 15:11:57 -03:00
matatonic
78b2478d9c
assistant: space fix, system: prompt fix (#2219) 2023-05-20 23:32:34 -03:00
matatonic
ab08cf6465
[extensions/openai] clip extra leading space (#2042) 2023-05-14 12:57:52 -03:00
oobabooga
c746a5bd00 Add .rstrip(' ') to openai api 2023-05-12 14:40:48 -03:00
matatonic
f98fd01dcd
is_chat=False for /edits (#2011) 2023-05-11 19:15:11 -03:00
oobabooga
0d36c18f5d Always return only the new tokens in generation functions 2023-05-11 17:07:20 -03:00
matatonic
c4f0e6d740
is_chat changes fix for openai extension (#2008) 2023-05-11 16:32:25 -03:00
matatonic
309b72e549
[extension/openai] add edits & image endpoints & fix prompt return in non --chat modes (#1935) 2023-05-11 11:06:39 -03:00
oobabooga
3913155c1f
Style improvements (#1957) 2023-05-09 22:49:39 -03:00
Jeffrey Lin
791a38bad1
[extensions/openai] Support undocumented base64 'encoding_format' param for compatibility with official OpenAI client (#1876) 2023-05-08 22:31:34 -03:00
oobabooga
8aafb1f796
Refactor text_generation.py, add support for custom generation functions (#1817) 2023-05-05 18:53:03 -03:00
Thireus ☠
4883e20fa7
Fix openai extension script.py - TypeError: '_Environ' object is not callable (#1753) 2023-05-03 09:51:49 -03:00
oobabooga
c31b0f15a7 Remove some spaces 2023-05-02 23:07:07 -03:00
oobabooga
320fcfde4e Style/pep8 improvements 2023-05-02 23:05:38 -03:00
matatonic
7ac41b87df
add openai compatible api (#1475) 2023-05-02 22:49:53 -03:00