Commit Graph

559 Commits

Author SHA1 Message Date
SternAnon 7cfaf5777e Update file checker.ts 2025-03-06 16:53:32 +00:00
SternAnon 4f6ef38222 fix xai 2 2025-03-06 16:36:36 +00:00
SternAnon d21b232a8e fixes xai 2025-03-06 16:33:12 +00:00
SternAnon 72c9516679 Update 13 files
- /src/config.ts
- /src/info-page.ts
- /src/proxy/xai.ts
- /src/proxy/middleware/request/mutators/add-key.ts
- /src/proxy/middleware/request/preprocessors/validate-context-size.ts
- /src/proxy/middleware/response/index.ts
- /src/proxy/routes.ts
- /src/service-info.ts
- /src/shared/key-management/xai/checker.ts
- /src/shared/key-management/xai/provider.ts
- /src/shared/key-management/index.ts
- /src/shared/key-management/key-pool.ts
- /src/shared/models.ts
2025-03-06 16:25:48 +00:00
user fcaad65ccb Slop AI code to unify Anthropic model list and give Anthropic-style IDs for AWS /v1/models requests, needed for e.g. big-AGI 2025-02-27 20:25:30 +00:00
user b3d4650275 Initial GPT 4.5 bringup, separate model family due to extreme price 2025-02-27 20:25:30 +00:00
user 70c7f2aae9 aws sign fix for 3.7 2025-02-25 20:59:49 +00:00
user aecc934fad untested 3.7 sonnet, treating it like another 3.5 sonnet model 2025-02-24 18:38:56 +00:00
user a8d36f832e Check tool_result images for vision 2025-02-12 14:09:22 +00:00
user c1db122016 Simplify model reassignment in GCP 2025-02-12 13:27:15 +00:00
user e9bd6127a4 merge 2025-02-12 13:27:15 +00:00
user e230e9acec Remove 3.5 Sonnet v1 from GCP checking 2025-02-12 13:27:15 +00:00
penurin 239f95e8a1 Merge branch 'patch-1' into 'main'
Fix anthropic content schema

See merge request penurin/oai-reverse-proxy!1
2025-02-12 13:18:21 +00:00
W92k6zuinOCClyWS 17475447a0 Fix anthropic content schema (penurin/oai-reverse-proxy!1) 2025-02-12 13:18:21 +00:00
user d2b37b8455 Fix Gemini key checking: old code didn't properly check for the error message. Swapped the check to use 2.0 Flash because it catches more 429 keys 2025-02-05 17:11:27 +00:00
user cec66cdc44 Newer Gemini 2.0 models in the list 2025-02-05 17:11:27 +00:00
user a5c9e95929 Add all o models to the OpenAI model list 2025-01-31 20:22:06 +00:00
user c5d4fe44e6 Fix for the timeout workaround for o1 2025-01-31 20:22:06 +00:00
user 8ed883eaff o3 mini 2025-01-31 20:22:06 +00:00
user 6de338c6ac Properly separate deepseek keys from the generic ones in service info 2025-01-25 11:06:38 +00:00
user 45576db441 [Deepseek] Properly handle over-quota keys 2025-01-25 11:59:41 +00:00
user bcc83f30d9 Properly count DS reasoning tokens and properly save them per-user 2025-01-25 11:06:38 +00:00
user e5a26215e1 Add native Gemini model list endpoint 2025-01-24 08:34:12 +00:00
user cd6cc76a46 Attempt to improve the o1 timeout hack 2025-01-22 15:23:17 +00:00
faggot 613bb789fb fix error checking o1 deployments 2025-01-22 09:59:11 +00:00
user f1c698388e [Gemini] Support the new thinking config for 2.0 Flash Thinking 2025-01-22 09:59:11 +00:00
user 75605a2bfb Add preliminary deepseek-reasoner support 2025-01-20 07:28:30 +00:00
user 58e67d40e2 Check logging for AWS keys (untested) 2025-01-20 11:28:53 +00:00
user 796b4eee47 Make the OpenAI checker properly clone orgs again, and fix the error with the liveness check 2025-01-06 06:55:41 +00:00
user 0f482e67d2 Fix OpenAI -> Google AI conversion 2025-01-05 14:02:17 +00:00
user 496ec09905 Add v1alpha support (needed for 2.0 flash thinking with the new 'thought' parameter), already used by ST 2025-01-04 19:11:02 +00:00
user f522dba6a3 Fix errors with o1-preview and o1-mini 2025-01-03 04:41:29 +00:00
user 25ba8447d9 And add display names as well 2025-01-03 04:41:29 +00:00
user 91b8c01a9d Do the same for the AWS endpoint 2025-01-03 04:41:29 +00:00
user 82b88764ba Fix Anthropic model list to be actually compatible with Anthropic API (required by some frontends), remove old models 2025-01-03 04:50:55 +00:00
user 6ea9235ff8 Actually camelCase is canon for Gemini, oops 2024-12-31 08:23:45 +00:00
user 372ad85283 Support camelCase Gemini params and validate vision 2024-12-31 10:16:04 +00:00
user c2f5d2fbf3 Add /v1/models to deepseek 2024-12-31 08:23:45 +00:00
user c264413495 Leave a comment about concurrency for "special" user tokens 2024-12-31 08:23:45 +00:00
user 8d27082ad0 Fix formatting changes with upstream 2024-12-31 08:23:45 +00:00
user e2b602fd52 Adjust chunked transfer to send 4KB (CF's buffer size) of data every 49 seconds 2024-12-31 08:23:45 +00:00
user b00fb88cab Don't overwrite the reasoning effort by default 2024-12-31 07:45:11 +00:00
user 1cc281f6fe Add automatic prefill for Deepseek - works the same way as with Claude 2024-12-31 07:45:11 +00:00
user 8f4d00ed26 Init commit, some things:
- 'Transfer-Encoding: chunked' for o1 requests to prevent CF's 100 second limit
- Better tool/function call support
- Deepseek support
- Handling system as an array for AWS Claude
- Image support for Gemini
- Better o1 support (reasoning effort, developer role, context size)
2024-12-31 00:00:00 +00:00
nai-degen 36e2430a8f adjusts gemini keychecker to trigger real generation for better rate limit detection 2024-12-07 01:31:10 -06:00
nai-degen 28447d0811 resolves server-side error when a Gemini prompt is blocked due to safety 2024-12-07 00:54:13 -06:00
nai-degen 6d54cbc785 maybe handles gemini daily key block idk 2024-12-05 15:06:29 -06:00
nai-degen 9d7a4f4b51 maybe fixes gemini's fucked error messages idk 2024-12-05 14:54:35 -06:00
nai-degen 3496a2a9bd fixes incorrect 3.5 sonnet v2 model id in gcp model reassignment 2024-12-03 19:53:56 -06:00
nai-degen 5072638ec2 attempt at fixing persistent 'invalid csrf token' error on some browsers 2024-12-03 19:49:47 -06:00