nai-degen
|
0d5dfeccf8
|
adds gpt4-turbo model family and support for gpt-4-1106-preview model
|
2023-11-06 15:29:43 -06:00 |
|
nai-degen
|
a27163a629
|
adds option to not disable keys when reaching IP limit
|
2023-11-06 10:15:57 -06:00 |
|
nai-degen
|
51dd0c71ba
|
removes unused import in openai proxy
|
2023-10-24 13:17:46 -05:00 |
|
nai-degen
|
89e1ed46d5
|
re-signs AWS requests on every attempt to fix fucked up queueing
|
2023-10-24 13:10:50 -05:00 |
|
nai-degen
|
26dc79c8f1
|
fixes broken AWS rate limit backoff
|
2023-10-24 09:19:46 -05:00 |
|
nai-degen
|
89e9b67f3f
|
fixes AWS mid-stream rate limits not actually marking key as rate-limited
|
2023-10-23 22:47:29 -05:00 |
|
nai-degen
|
52ec2ec265
|
fixes blank AWS responses due to reqs sometimes using wrong handler
|
2023-10-23 22:23:06 -05:00 |
|
nai-degen
|
8bd2f749c1
|
reduces logging severity of prompt validation errors
|
2023-10-23 20:30:27 -05:00 |
|
nai-degen
|
3f7e50f87e
|
follow-up 'fixes empty AWS streaming responses when under heavy load'
|
2023-10-15 00:06:38 -05:00 |
|
nai-degen
|
f6cfc6e882
|
fixes empty AWS streaming responses when under heavy load
|
2023-10-15 00:05:36 -05:00 |
|
nai-degen
|
af4d8dae40
|
changes default AMZ_HOST to bedrock-runtime.region.amazonaws.com
|
2023-10-12 15:39:06 -05:00 |
|
nai-degen
|
725fd6e6f1
|
deprioritizes queued Agnai.chat requests and limits concurrency to five across all shared IPs
|
2023-10-09 12:36:54 -05:00 |
|
nai-degen
|
12f78fa1f2
|
exempts 'special' role from rate limiting
|
2023-10-06 20:29:28 -05:00 |
|
nai-degen
|
daf6a123d5
|
adjusts Agnai.chat and RisuAI rate limiting
|
2023-10-04 09:39:59 -05:00 |
|
nai-degen
|
5033d00444
|
improves clarity of errors sent back to streaming clients
|
2023-10-03 19:45:15 -05:00 |
|
nai-degen
|
ba0b20617e
|
ensures AWS always uses anthropic-version 2023-06-01 parser
|
2023-10-03 19:43:30 -05:00 |
|
khanon
|
ecf897e685
|
Refactor handleStreamingResponse to make it less shit (khanon/oai-reverse-proxy!46)
|
2023-10-03 06:14:19 +00:00 |
|
nai-degen
|
ede274c117
|
disables AWS key on AccessDeniedException
|
2023-10-02 11:18:08 -05:00 |
|
nai-degen
|
0837c89a42
|
fixes incorrect context size limit for aws claude v1
|
2023-10-02 03:53:04 -05:00 |
|
nai-degen
|
f67560a17b
|
refactors proxy routing
|
2023-10-01 12:12:28 -05:00 |
|
nai-degen
|
e13361a323
|
removes dead koboldai code
|
2023-10-01 11:27:11 -05:00 |
|
khanon
|
fa4bf468d2
|
Implement AWS Bedrock support (khanon/oai-reverse-proxy!45)
|
2023-10-01 01:40:18 +00:00 |
|
nai-degen
|
7e681a7bef
|
strips OAI request parameters when translating to Claude format
|
2023-09-29 03:01:39 -05:00 |
|
nai-degen
|
1b0106a1ea
|
strips reverse proxy originating IP headers
|
2023-09-29 03:00:55 -05:00 |
|
nai-degen
|
f5521aa6c3
|
prevents selecting trial keys for embeddings requests due to rate limits
|
2023-09-26 01:26:07 -05:00 |
|
nai-degen
|
f8b480f4c2
|
adds support for proxying text-embedding-ada-002 requests
|
2023-09-26 00:58:38 -05:00 |
|
khanon
|
35b44e1c6b
|
fixes issue with OpenAIV1ChatCompletionSchema and PaLM compat
|
2023-09-24 10:48:56 +00:00 |
|
nai-degen
|
075e415343
|
makes incoming model name validation less strict for PaLM endpoint
|
2023-09-20 23:55:53 -05:00 |
|
khanon
|
35a6c393ed
|
Add support for Google PaLM and OpenAI Turbo Instruct (khanon/oai-reverse-proxy!44)
|
2023-09-19 23:13:08 +00:00 |
|
nai-degen
|
5e57dbb8f1
|
attempts to improve compatibility with BetterGPT frontend
|
2023-09-16 11:04:40 -05:00 |
|
nai-degen
|
7b3d6efb02
|
reverts anthropic-version change as it breaks some frontends
|
2023-09-07 22:01:19 -05:00 |
|
nai-degen
|
63542bfabb
|
adds anthropic-version header in all cases
|
2023-09-07 20:23:34 -05:00 |
|
khanon
|
f05e196994
|
Refactor project structure and add user self-serve UI (khanon/oai-reverse-proxy!41)
|
2023-09-02 19:36:44 +00:00 |
|
nai-degen
|
980abcc01f
|
fixes tsc build
|
2023-08-31 13:50:16 -05:00 |
|
nai-degen
|
fe0f04ceb8
|
improves display of large token numbers
|
2023-08-31 13:23:36 -05:00 |
|
nai-degen
|
4b32130eaa
|
adds maintenance function to clear all users' token records
|
2023-08-30 22:38:33 -05:00 |
|
nai-degen
|
2c0a659b2d
|
adds token consumption stats to infopage
|
2023-08-30 20:40:40 -05:00 |
|
nai-degen
|
7cab0a5c52
|
fixes tsc issue breaking build
|
2023-08-30 14:31:47 -05:00 |
|
nai-degen
|
27a1181752
|
adds optional token quota limits for gpt4-32k
|
2023-08-30 13:57:10 -05:00 |
|
khanon
|
4d781e1720
|
Add GPT-4-32k support (khanon/oai-reverse-proxy!39)
|
2023-08-29 22:56:54 +00:00 |
|
nai-degen
|
3c56103de0
|
adds optional user_token nicknames
|
2023-08-29 14:20:28 -05:00 |
|
khanon
|
6833736392
|
Clone keys assigned to multiple organizations (khanon/oai-reverse-proxy!38)
|
2023-08-28 21:11:49 +00:00 |
|
nai-degen
|
7c9c3a640c
|
minor cleanup for user quota docs/examples
|
2023-08-28 14:51:28 -05:00 |
|
khanon
|
cb780e85da
|
Per-user token quotas and automatic quota refreshing (khanon/oai-reverse-proxy!37)
|
2023-08-28 19:33:14 +00:00 |
|
nai-degen
|
785b1f69f3
|
implements new local risu validation (via @kwaroran)
|
2023-08-28 05:28:58 -05:00 |
|
nai-degen
|
6bb67281d9
|
removes QUEUE_MODE config (now always enabled)
|
2023-08-09 18:29:34 -05:00 |
|
nai-degen
|
d1d83b41fa
|
uses accurate Claude tokenization
|
2023-08-08 17:29:36 -05:00 |
|
nai-degen
|
125bbe6441
|
fixes issue with writeErrorResponse
|
2023-08-04 13:49:11 -05:00 |
|
nai-degen
|
00346360af
|
fixes turbo-16k incompatibility
|
2023-07-23 20:13:38 -05:00 |
|
nai-degen
|
e2bd8a6b86
|
extracts Risu auth into new middleware so queue can use it too
|
2023-07-22 13:48:02 -05:00 |
|