Commit Graph

  • bcc83f30d9 Properly count DS reasoning tokens and properly save them per-user user 2025-01-25 11:06:38 +00:00
  • e5a26215e1 Add native Gemini model list endpoint user 2025-01-24 08:34:12 +00:00
  • cd6cc76a46 Attempt to improve the o1 timeout hack user 2025-01-22 15:23:17 +00:00
  • 613bb789fb fix error checking o1 deployments faggot 2024-12-18 00:54:44 +00:00
  • f1c698388e [Gemini] Support the new thinking config for 2.0 Flash Thinking user 2025-01-22 09:59:11 +00:00
  • 75605a2bfb Add preliminary deepseek-reasoner support user 2025-01-20 07:28:30 +00:00
  • 58e67d40e2 Check logging for AWS keys (untested) user 2025-01-20 11:28:53 +00:00
  • 796b4eee47 Make the OpenAI checker properly clone orgs again, and fix the error with the liveness check user 2025-01-06 06:55:41 +00:00
  • 0f482e67d2 Fix OpenAI -> Google AI conversion user 2025-01-05 14:02:17 +00:00
  • 496ec09905 Add v1alpha support (needed for 2.0 flash thinking with the new 'thought' parameter), already used by ST user 2025-01-04 19:11:02 +00:00
  • f522dba6a3 Fix errors with o1-preview and o1-mini user 2025-01-03 04:41:29 +00:00
  • 25ba8447d9 And add display names as well user 2025-01-03 04:41:29 +00:00
  • 91b8c01a9d Do the same for the AWS endpoint user 2025-01-03 04:41:29 +00:00
  • 82b88764ba Fix Anthropic model list to be actually compatible with Anthropic API (required by some frontends), remove old models user 2025-01-03 04:50:55 +00:00
  • 6ea9235ff8 Actually camelCase is canon for Gemini, oops user 2024-12-31 08:23:45 +00:00
  • 372ad85283 Support camelCase Gemini params and validate vision user 2024-12-31 10:16:04 +00:00
  • c2f5d2fbf3 Add /v1/models to deepseek user 2024-12-31 08:23:45 +00:00
  • c264413495 Leave a comment about concurrency for "special" user tokens user 2024-12-31 08:23:45 +00:00
  • 8d27082ad0 Fix formatting changes with upstream user 2024-12-31 08:23:45 +00:00
  • e2b602fd52 Adjust chunked transfer to send 4KB (CF's buffer size) of data every 49 seconds user 2024-12-31 08:23:45 +00:00
  • b00fb88cab Don't overwrite the reasoning effort by default user 2024-12-31 07:45:11 +00:00
  • 1cc281f6fe Add automatic prefill for Deepseek - works the same way as with Claude user 2024-12-31 07:45:11 +00:00
  • 8f4d00ed26 Init commit, some things: - 'Transfer-Encoding: chunked' for o1 requests to prevent CF's 100 second limit - Better tool/function call support - Deepseek support - Handling system as an array for AWS Claude - Image support for Gemini - Better o1 support (reasoning effort, developer role, context size) user 2024-12-31 00:00:00 +00:00
  • 75cb98c727 o1 "developer" role support and reasoning effort o1-role-reasoning penurin 2024-12-27 17:13:58 +00:00
  • 14d11d4368 o1 200k context o1-200k penurin 2024-12-27 17:21:05 +00:00
  • 2fbe0bff0d Check for o1 instead of o1-preview for the o1 family o1-check-family penurin 2024-12-27 17:17:39 +00:00
  • d5fd93f64a Adjust release o1 context to 200K penurin-main-patch-71429 penurin 2024-12-27 17:05:48 +00:00
  • 36e2430a8f adjusts gemini keychecker to trigger real generation for better rate limit detection nai-degen 2024-12-07 01:31:10 -06:00
  • 28447d0811 resolves server-side error when a Gemini prompt is blocked due to safety nai-degen 2024-12-07 00:54:13 -06:00
  • 6d54cbc785 maybe handles gemini daily key block idk nai-degen 2024-12-05 15:06:29 -06:00
  • 9d7a4f4b51 maybe fixes gemini's fucked error messages idk nai-degen 2024-12-05 14:54:35 -06:00
  • 3496a2a9bd fixes incorrect 3.5 sonnet v2 model id in gcp model reassignment nai-degen 2024-12-03 19:53:56 -06:00
  • 5072638ec2 attempt at fixing persistent 'invalid csrf token' error on some browsers nai-degen 2024-12-03 19:49:16 -06:00
  • 8a325a1e0b Add 2 million Google AI context (khanon/oai-reverse-proxy!82) Nopm 2024-12-04 01:27:21 +00:00
  • 5eeb2875b4 adds haiku 3.5 khanon 2024-11-04 18:20:19 +00:00
  • c67dad1617 fixes AWS claude v2 khanon 2024-10-25 18:10:26 +00:00
  • fe61745e24 fixes issue with AWS model assignment when requesting legacy claude (claude-2.1, etc) khanon 2024-10-25 17:49:24 +00:00
  • 251ea6d412 fixes typo in AWS ThrottlingException nai-degen 2024-10-23 15:36:21 -05:00
  • 55f7337ea4 adjusts AWS keychecker to treat rate limited models as available models nai-degen 2024-10-23 15:35:32 -05:00
  • f3b876887e fixes issue with AWS model name reassignment nai-degen 2024-10-23 13:59:10 -05:00
  • 49c578f4dc adds Sonnet 3.5v2 AWS model ID and adjusts AWS model assignment to raise error on no match nai-degen 2024-10-23 13:39:34 -05:00
  • 4190d5fef6 fixes missing comma....... khanon 2024-10-22 16:06:39 +00:00
  • 1644e82f25 adds Sonnet 20241022 snapshot IDs khanon 2024-10-22 15:52:52 +00:00
  • 0bbdc0b841 fixes google ai language filter and updates readme nai-degen 2024-10-18 23:15:02 -05:00
  • c4a633a5d6 fixes gcp oauth2 token refresh not updating cloned key instance nai-degen 2024-10-18 22:41:24 -05:00
  • 0c6ec3254f finally DOES something about broken GCP streaming, boebeitfully nai-degen 2024-10-12 20:10:59 -05:00
  • 13aa55cd3d handles gemini ai test message from sillytavern nai-degen 2024-10-12 09:01:08 -05:00
  • ba4532b38d more fixes for annoying gemini API design that allows arrays or single objects for contents parts nai-degen 2024-10-09 17:11:53 -05:00
  • b57627e69b adds stripHeaders to global mutators in createQueuedProxyMiddleware nai-degen 2024-10-09 16:59:12 -05:00
  • 536803853a uses removeHeader instead of setHeader to empty string nai-degen 2024-10-09 16:44:53 -05:00
  • ad0a3c0936 removes cors/sec-fetch headers to fix venus chub with anthropic api nai-degen 2024-10-09 16:32:48 -05:00
  • 161f5aba3e handles sillytavern using both camel and snake-cased parameters for gemini api nai-degen 2024-10-06 11:03:48 -05:00
  • 514d1b7e31 fixes azure nai-degen 2024-10-01 16:15:04 -05:00
  • 22d7f966c6 fixes for gemini api streaming nai-degen 2024-09-29 12:44:18 -05:00
  • cfb6353c65 updates google ai safety settings schema nai-degen 2024-09-25 21:19:17 -05:00
  • a7fed3136e fixes google ai gemini nai-degen 2024-09-25 15:58:52 -05:00
  • 29638cf26e minor cleanup to pow challenge nai-degen 2024-09-22 11:28:33 -05:00
  • ee26e7be65 various improvements and fixes to PoW challenge UI and token refresh nai-degen 2024-09-22 11:11:30 -05:00
  • ff0d3dfdcd prevents overwriting anthropic-version header if it's already provided nai-degen 2024-09-19 00:55:17 -05:00
  • 81a3ae1746 maybe fixes missing anthropic version header in some cases nai-degen 2024-09-19 00:50:17 -05:00
  • 4dfd57fcb4 updates render dockerfile to correctly copy patches dir into build context nai-degen 2024-09-16 23:39:43 -05:00
  • d21e274358 Add configurable network interface or SOCKS/HTTP proxy for outgoing requests (khanon/oai-reverse-proxy!80) khanon 2024-09-16 15:17:57 +00:00
  • 6e97e036b2 fixes refreshed PoW tokens not actually being reactivated nai-degen 2024-09-15 18:01:23 -05:00
  • 7a4a16dd2f fixes chatgpt-latest missing from models endpoint nai-degen 2024-09-15 06:02:35 -05:00
  • f1cfa644c5 maybe fixes openai sk-svcacct keys nai-degen 2024-09-12 23:05:43 -05:00
  • 6a908b09cb adds preliminary openai o1 support and some improvements to openai keychecker nai-degen 2024-09-12 22:55:45 -05:00
  • 86772ab32a adds 503 as a 'successful' AWS keychecker response to deal with temporary outages nai-degen 2024-09-10 16:07:13 -05:00
  • bd87ca60f7 Implement priority queue by tokens (khanon/oai-reverse-proxy!79) honeytree 2024-09-09 16:48:46 +00:00
  • ac1897fd17 returns more clear proxy_note hint on AWS 503 error nai-degen 2024-09-09 09:56:18 -05:00
  • 2a6f85e2e2 Revert "handles AWS HTTP 503 ServiceUnavailableException similarly to 429s" nai-degen 2024-09-09 09:43:59 -05:00
  • ffcaa23511 handles AWS HTTP 503 ServiceUnavailableException similarly to 429s nai-degen 2024-09-09 08:07:08 -05:00
  • 1d5b8efa23 reduces key lockout period to more quickly drain queue after AWS rate limit resolves nai-degen 2024-09-08 17:17:22 -05:00
  • 905273abf2 fixes aws mistral token cost estimation nai-degen 2024-09-08 17:15:59 -05:00
  • 9e6fd7c24c Implement tools (function calling) for Claude add-tool-fix-claude user 2024-08-16 15:45:49 +03:00
  • ac92a19946 improves reliability of inference profile detection for AWS keychecker nai-degen 2024-09-07 17:36:29 -05:00
  • 96fe974ad0 Use AWS Inference Profiles for higher rate limits (khanon/oai-reverse-proxy!78) khanon 2024-09-01 22:55:07 +00:00
  • 578615fbd2 fixes typo in new Claude system prompt schema nai-degen 2024-08-30 10:23:57 -05:00
  • 5dc4050e52 disable periodic GCP key rechecks to workaround keychecker bug nai-degen 2024-08-29 15:25:37 -05:00
  • cf615ee62c applies prettier to GCP checker nai-degen 2024-08-29 15:15:56 -05:00
  • ee61f9be2b removes unnecessary log from last commit nai-degen 2024-08-27 23:58:32 -05:00
  • 0c448cb59d fixes azure dalle using wrong rate limit and out-of-spec Retry-After header nai-degen 2024-08-27 23:43:11 -05:00
  • 51a9ccceb2 supports alternate claude system prompt format nai-degen 2024-08-27 23:27:20 -05:00
  • ce490efd7d minor adjustments to HMAC signing nai-degen 2024-08-22 19:53:53 -05:00
  • 5000e59a61 fix for google makersuite prompt validation/transformation nai-degen 2024-08-22 14:18:18 -05:00
  • 27cbb7aaaf Make deprecated function_call OpenAI field check less strict oai-deprecated-toolcall-fix user 2024-08-16 16:09:35 +03:00
  • d54acad6ad adds support for sonnet 8192 output tokens on anthropic api nai-degen 2024-08-15 11:55:13 -05:00
  • 5e1fffe07d adds chatgpt-4o-latest nai-degen 2024-08-15 11:54:42 -05:00
  • f7fd5f00f2 fixes nai-degen 2024-08-14 14:41:47 -05:00
  • 6d323f6ea1 do not transform mistral chat prompts to text when using la plateforme nai-degen 2024-08-14 12:26:27 -05:00
  • 2959ed3f7f fixes aws keychecker not detecting claude 2.1 nai-degen 2024-08-14 10:44:55 -05:00
  • b58e7cb830 always applies Mistral prompt fixes on messages input nai-degen 2024-08-14 10:44:22 -05:00
  • cfc1290f83 fixes aws keychecker not detecting claude 2.1 aws-mistral nai-degen 2024-08-14 10:44:55 -05:00
  • 14f228f666 always applies Mistral prompt fixes on messages input nai-degen 2024-08-14 10:44:22 -05:00
  • f531272b00 Refactor AWS service code and add AWS Mistral support (khanon/oai-reverse-proxy!75) khanon 2024-08-14 04:40:41 +00:00
  • d264fdd573 adds mistral chat-to-text transformation, for better prefix compatibility nai-degen 2024-08-13 23:16:31 -05:00
  • 9c3e345720 update deps nai-degen 2024-08-13 20:31:19 -05:00
  • 37c421bb45 fixes token counting for streaming Mistral Text prompts nai-degen 2024-08-13 20:27:31 -05:00
  • 6c5fed90e2 rename function nai-degen 2024-08-13 20:15:14 -05:00
  • 9479fa4ab0 serviceinfo tweak nai-degen 2024-08-13 20:13:46 -05:00
  • e145f5757e implements aws mistral streaming nai-degen 2024-08-13 20:04:02 -05:00