nai-degen
|
e813cd9d22
|
default claude 2.1 instead of 1.3 in openai compat endpoint since 1.3 is not accessible on all keys
|
2024-01-18 04:14:15 -06:00 |
|
nai-degen
|
4c2a2c1e6c
|
improves handle-streamed-response comments/docs [skip-ci]
|
2024-01-18 04:14:15 -06:00 |
|
twinkletoes
|
96a0f94041
|
Fix Mistral safe_prompt schema property (khanon/oai-reverse-proxy!60)
|
2024-01-14 00:11:39 +00:00 |
|
nai-degen
|
7f92565739
|
SSE queueing adjustments, untested
|
2024-01-07 16:19:22 -06:00 |
|
nai-degen
|
8dc7464381
|
strips extraneous properties on zod schemas
|
2024-01-07 13:00:48 -06:00 |
|
twinkletoes
|
4a823b216f
|
Mistral AI support (khanon/oai-reverse-proxy!58)
|
2023-12-25 18:33:16 +00:00 |
|
nai-degen
|
655703e680
|
refactors infopage
|
2023-12-16 20:30:20 -06:00 |
|
nai-degen
|
5599a83ae4
|
improves streaming error handling
|
2023-12-14 05:01:10 -06:00 |
|
nai-degen
|
de34d41918
|
fixes gemini name prefixing when 'Add character names' is disabled in ST
|
2023-12-13 23:21:30 -06:00 |
|
nai-degen
|
c5cd90dcef
|
adjusts prompt transform to discourage Gemini from speaking for user
|
2023-12-13 23:03:57 -06:00 |
|
nai-degen
|
8a135a960d
|
fixes gemini prompt reformatting for jbs; adds stop sequences
|
2023-12-13 21:45:53 -06:00 |
|
nai-degen
|
707cbbce16
|
fixes gemini throwing an error on JB prompts
|
2023-12-13 19:14:31 -06:00 |
|
khanon
|
fad16cc268
|
Add Google AI API (khanon/oai-reverse-proxy!57)
|
2023-12-13 21:56:07 +00:00 |
|
nai-degen
|
0d3682197c
|
treats 403 from anthropic as key dead
|
2023-12-11 09:13:53 -06:00 |
|
valadaptive
|
e0624e30fd
|
Fix some corner cases in SSE parsing (khanon/oai-reverse-proxy!56)
|
2023-12-09 06:18:01 +00:00 |
|
nai-degen
|
94d4efe9bb
|
properly enforce allowedModelFamilies; refactor HPM proxyReq handlers
|
2023-12-05 22:07:56 -06:00 |
|
nai-degen
|
fdd824f0e4
|
adds azure rate limit auto-retry
|
2023-12-04 01:24:33 -06:00 |
|
khanon
|
fbdea30264
|
Azure OpenAI suport (khanon/oai-reverse-proxy!48)
|
2023-12-04 04:21:18 +00:00 |
|
nai-degen
|
9e61d9029f
|
adds claude-2.1 (untested)
|
2023-11-21 11:32:43 -06:00 |
|
nai-degen
|
f95e24afbb
|
fixes incorrect max model size for gpt4-v
|
2023-11-19 02:23:41 -06:00 |
|
khanon
|
f29049f993
|
Support for GPT-4-Vision (khanon/oai-reverse-proxy!54)
|
2023-11-19 05:06:21 +00:00 |
|
khanon
|
20c064394a
|
OpenAI DALL-E Image Generation (khanon/oai-reverse-proxy!52)
|
2023-11-14 05:41:19 +00:00 |
|
nai-degen
|
c7a095d345
|
removes debug log
|
2023-11-09 16:25:57 -06:00 |
|
nai-degen
|
e9110611fa
|
adds REJECT_PHRASES configuration setting
|
2023-11-09 16:24:49 -06:00 |
|
nai-degen
|
b6f8f15a1f
|
tries to prevent per-day rate limited keys from bricking the queue
|
2023-11-06 21:16:36 -06:00 |
|
nai-degen
|
0d5dfeccf8
|
adds gpt4-turbo model family and support for gpt-4-1106-preview model
|
2023-11-06 15:29:43 -06:00 |
|
nai-degen
|
89e1ed46d5
|
re-signs AWS requests on every attempt to fix fucked up queueing
|
2023-10-24 13:10:50 -05:00 |
|
nai-degen
|
26dc79c8f1
|
fixes broken AWS rate limit backoff
|
2023-10-24 09:19:46 -05:00 |
|
nai-degen
|
89e9b67f3f
|
fixes AWS mid-stream rate limits not actually marking key as rate-limited
|
2023-10-23 22:47:29 -05:00 |
|
nai-degen
|
52ec2ec265
|
fixes blank AWS responses due to reqs sometimes using wrong handler
|
2023-10-23 22:23:06 -05:00 |
|
nai-degen
|
8bd2f749c1
|
reduces logging severity of prompt validation errors
|
2023-10-23 20:30:27 -05:00 |
|
nai-degen
|
3f7e50f87e
|
follow-up 'fixes empty AWS streaming responses when under heavy load'
|
2023-10-15 00:06:38 -05:00 |
|
nai-degen
|
f6cfc6e882
|
fixes empty AWS streaming responses when under heavy load
|
2023-10-15 00:05:36 -05:00 |
|
nai-degen
|
af4d8dae40
|
changes default AMZ_HOST to bedrock-runtime.region.amazonaws.com
|
2023-10-12 15:39:06 -05:00 |
|
nai-degen
|
12f78fa1f2
|
exempts 'special' role from rate limiting
|
2023-10-06 20:29:28 -05:00 |
|
nai-degen
|
5033d00444
|
improves clarity of errors sent back to streaming clients
|
2023-10-03 19:45:15 -05:00 |
|
nai-degen
|
ba0b20617e
|
ensures AWS always uses anthropic-version 2023-06-01 parser
|
2023-10-03 19:43:30 -05:00 |
|
khanon
|
ecf897e685
|
Refactor handleStreamingResponse to make it less shit (khanon/oai-reverse-proxy!46)
|
2023-10-03 06:14:19 +00:00 |
|
nai-degen
|
ede274c117
|
disables AWS key on AccessDeniedException
|
2023-10-02 11:18:08 -05:00 |
|
nai-degen
|
0837c89a42
|
fixes incorrect context size limit for aws claude v1
|
2023-10-02 03:53:04 -05:00 |
|
nai-degen
|
f67560a17b
|
refactors proxy routing
|
2023-10-01 12:12:28 -05:00 |
|
nai-degen
|
e13361a323
|
removes dead koboldai code
|
2023-10-01 11:27:11 -05:00 |
|
khanon
|
fa4bf468d2
|
Implement AWS Bedrock support (khanon/oai-reverse-proxy!45)
|
2023-10-01 01:40:18 +00:00 |
|
nai-degen
|
7e681a7bef
|
strips OAI request parameters when translating to Claude format
|
2023-09-29 03:01:39 -05:00 |
|
nai-degen
|
1b0106a1ea
|
strips reverse proxy originating IP headers
|
2023-09-29 03:00:55 -05:00 |
|
nai-degen
|
f5521aa6c3
|
prevents selecting trial keys for embeddings requests due to rate limits
|
2023-09-26 01:26:07 -05:00 |
|
nai-degen
|
f8b480f4c2
|
adds support for proxying text-embedding-ada-002 requests
|
2023-09-26 00:58:38 -05:00 |
|
khanon
|
35b44e1c6b
|
fixes issue with OpenAIV1ChatCompletionSchema and PaLM compat
|
2023-09-24 10:48:56 +00:00 |
|
nai-degen
|
075e415343
|
makes incoming model name validation less strict for PaLM endpoint
|
2023-09-20 23:55:53 -05:00 |
|
khanon
|
35a6c393ed
|
Add support for Google PaLM and OpenAI Turbo Instruct (khanon/oai-reverse-proxy!44)
|
2023-09-19 23:13:08 +00:00 |
|