Commit Graph

141 Commits

Author SHA1 Message Date
nai-degen c7a095d345 removes debug log 2023-11-09 16:25:57 -06:00
nai-degen e9110611fa adds REJECT_PHRASES configuration setting 2023-11-09 16:24:49 -06:00
nai-degen b6f8f15a1f tries to prevent per-day rate limited keys from bricking the queue 2023-11-06 21:16:36 -06:00
nai-degen 0d5dfeccf8 adds gpt4-turbo model family and support for gpt-4-1106-preview model 2023-11-06 15:29:43 -06:00
nai-degen a27163a629 adds option to not disable keys when reaching IP limit 2023-11-06 10:15:57 -06:00
nai-degen 51dd0c71ba removes unused import in openai proxy 2023-10-24 13:17:46 -05:00
nai-degen 89e1ed46d5 re-signs AWS requests on every attempt to fix fucked up queueing 2023-10-24 13:10:50 -05:00
nai-degen 26dc79c8f1 fixes broken AWS rate limit backoff 2023-10-24 09:19:46 -05:00
nai-degen 89e9b67f3f fixes AWS mid-stream rate limits not actually marking key as rate-limited 2023-10-23 22:47:29 -05:00
nai-degen 52ec2ec265 fixes blank AWS responses due to reqs sometimes using wrong handler 2023-10-23 22:23:06 -05:00
nai-degen 8bd2f749c1 reduces logging severity of prompt validation errors 2023-10-23 20:30:27 -05:00
nai-degen 3f7e50f87e follow-up 'fixes empty AWS streaming responses when under heavy load' 2023-10-15 00:06:38 -05:00
nai-degen f6cfc6e882 fixes empty AWS streaming responses when under heavy load 2023-10-15 00:05:36 -05:00
nai-degen af4d8dae40 changes default AMZ_HOST to bedrock-runtime.region.amazonaws.com 2023-10-12 15:39:06 -05:00
nai-degen 725fd6e6f1 deprioritizes queued Agnai.chat requests and limits concurrency to five across all shared IPs 2023-10-09 12:36:54 -05:00
nai-degen 12f78fa1f2 exempts 'special' role from rate limiting 2023-10-06 20:29:28 -05:00
nai-degen daf6a123d5 adjusts Agnai.chat and RisuAI rate limiting 2023-10-04 09:39:59 -05:00
nai-degen 5033d00444 improves clarity of errors sent back to streaming clients 2023-10-03 19:45:15 -05:00
nai-degen ba0b20617e ensures AWS always uses anthropic-version 2023-06-01 parser 2023-10-03 19:43:30 -05:00
khanon ecf897e685 Refactor handleStreamingResponse to make it less shit (khanon/oai-reverse-proxy!46) 2023-10-03 06:14:19 +00:00
nai-degen ede274c117 disables AWS key on AccessDeniedException 2023-10-02 11:18:08 -05:00
nai-degen 0837c89a42 fixes incorrect context size limit for aws claude v1 2023-10-02 03:53:04 -05:00
nai-degen f67560a17b refactors proxy routing 2023-10-01 12:12:28 -05:00
nai-degen e13361a323 removes dead koboldai code 2023-10-01 11:27:11 -05:00
khanon fa4bf468d2 Implement AWS Bedrock support (khanon/oai-reverse-proxy!45) 2023-10-01 01:40:18 +00:00
nai-degen 7e681a7bef strips OAI request parameters when translating to Claude format 2023-09-29 03:01:39 -05:00
nai-degen 1b0106a1ea strips reverse proxy originating IP headers 2023-09-29 03:00:55 -05:00
nai-degen f5521aa6c3 prevents selecting trial keys for embeddings requests due to rate limits 2023-09-26 01:26:07 -05:00
nai-degen f8b480f4c2 adds support for proxying text-embedding-ada-002 requests 2023-09-26 00:58:38 -05:00
khanon 35b44e1c6b fixes issue with OpenAIV1ChatCompletionSchema and PaLM compat 2023-09-24 10:48:56 +00:00
nai-degen 075e415343 makes incoming model name validation less strict for PaLM endpoint 2023-09-20 23:55:53 -05:00
khanon 35a6c393ed Add support for Google PaLM and OpenAI Turbo Instruct (khanon/oai-reverse-proxy!44) 2023-09-19 23:13:08 +00:00
nai-degen 5e57dbb8f1 attempts to improve compatibility with BetterGPT frontend 2023-09-16 11:04:40 -05:00
nai-degen 7b3d6efb02 reverts anthropic-version change as it breaks some frontends 2023-09-07 22:01:19 -05:00
nai-degen 63542bfabb adds anthropic-version header in all cases 2023-09-07 20:23:34 -05:00
khanon f05e196994 Refactor project structure and add user self-serve UI (khanon/oai-reverse-proxy!41) 2023-09-02 19:36:44 +00:00
nai-degen 980abcc01f fixes tsc build 2023-08-31 13:50:16 -05:00
nai-degen fe0f04ceb8 improves display of large token numbers 2023-08-31 13:23:36 -05:00
nai-degen 4b32130eaa adds maintenance function to clear all users' token records 2023-08-30 22:38:33 -05:00
nai-degen 2c0a659b2d adds token consumption stats to infopage 2023-08-30 20:40:40 -05:00
nai-degen 7cab0a5c52 fixes tsc issue breaking build 2023-08-30 14:31:47 -05:00
nai-degen 27a1181752 adds optional token quota limits for gpt4-32k 2023-08-30 13:57:10 -05:00
khanon 4d781e1720 Add GPT-4-32k support (khanon/oai-reverse-proxy!39) 2023-08-29 22:56:54 +00:00
nai-degen 3c56103de0 adds optional user_token nicknames 2023-08-29 14:20:28 -05:00
khanon 6833736392 Clone keys assigned to multiple organizations (khanon/oai-reverse-proxy!38) 2023-08-28 21:11:49 +00:00
nai-degen 7c9c3a640c minor cleanup for user quota docs/examples 2023-08-28 14:51:28 -05:00
khanon cb780e85da Per-user token quotas and automatic quota refreshing (khanon/oai-reverse-proxy!37) 2023-08-28 19:33:14 +00:00
nai-degen 785b1f69f3 implements new local risu validation (via @kwaroran) 2023-08-28 05:28:58 -05:00
nai-degen 6bb67281d9 removes QUEUE_MODE config (now always enabled) 2023-08-09 18:29:34 -05:00
nai-degen d1d83b41fa uses accurate Claude tokenization 2023-08-08 17:29:36 -05:00