Compare commits

...

2873 Commits

Author SHA1 Message Date
claude[bot] f5f491bb86 fix: align Anthropic Files tooltip with dropdown labels and correct default
The tooltip used stale labels (Off/Inline/Inline + Free) and marked "Off"
as the default, but the dropdown uses Show/Embed/Embed + Free and the
actual default is "Embed" (inline-file). Updated tooltip to match.

Fixes #1076

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2026-04-24 04:20:19 +00:00
Enrico Ros 9bb178413b Upstream removal: Update language 2026-04-23 15:46:07 -07:00
Enrico Ros d85f0ebfc4 AIX: Upstream Deletion via both CSF or tRPC 2026-04-23 15:29:39 -07:00
Enrico Ros 8f84dc2f24 AIX: more clenaups 2026-04-23 15:09:24 -07:00
Enrico Ros c8b4301bcd AIX: Client: cleanups 2026-04-23 14:50:21 -07:00
Enrico Ros bd8eaf0b9f Bits 2026-04-23 14:03:46 -07:00
Enrico Ros a4148cf694 BlockPartModelAux: reasoning: detection of markdown ignores newlines 2026-04-23 14:02:42 -07:00
Enrico Ros 4cb0b493dc BlockPartPlaceholder: timeout of a week for very long ops 2026-04-23 13:51:26 -07:00
Enrico Ros e6354e9089 ChatMessage: link to star 2026-04-23 13:42:19 -07:00
Enrico Ros 08506abaee AIX: Gemini Interactions: rewrap to nullify retry errors, to not re-submit expensive Deep Research runs 2026-04-23 03:08:10 -07:00
Enrico Ros 078c80d572 AIX: Gemini Interactions: full SSE migration (no post + get_loop anymore) 2026-04-23 02:15:35 -07:00
Enrico Ros b1c9f6be45 AIX: ContentReassembler: improve not-wiping ops on conent (yes to wiping vp) 2026-04-23 02:15:35 -07:00
Enrico Ros fc497e9beb AIX: Gemini Interactions: improve wires 2026-04-23 02:15:35 -07:00
Enrico Ros 6ad01fd981 AIX: update _upstream 2026-04-23 02:15:35 -07:00
Enrico Ros 44ed8664c8 Placeholder: elevate to 6hrs the UI counter 2026-04-22 23:28:27 -07:00
Enrico Ros 4cb16ee715 AIX: upstream protocol docs cache; local copy of the specs, let's see if it's useful 2026-04-22 23:16:14 -07:00
Enrico Ros 2dc9b87cda AIX: Gemini Interactions: allow multiple reattaches, use the Gemini default timeout 2026-04-22 22:29:57 -07:00
Enrico Ros 0e587c4889 ContentReassembler: report image conversion fail 2026-04-22 19:15:57 -07:00
Enrico Ros 41d42d82fb AIX: Gemini interactions: improvements 2026-04-22 18:28:16 -07:00
Enrico Ros f703c8a8c9 AIX: Cleanups 2026-04-22 18:20:40 -07:00
Enrico Ros bf753eab55 AIX: XAI: support for reasoning items 2026-04-22 17:59:32 -07:00
Enrico Ros 698b67af06 AIX: Gemini Interactions: images are usually figures, skip conversion 2026-04-22 17:59:10 -07:00
Enrico Ros 377d61056a ChatMessageList: resume: remove pending and update only on done 2026-04-22 17:50:54 -07:00
Enrico Ros 94b32c8fe3 BlockOpUpstreamResume: update style 2026-04-22 17:47:25 -07:00
Enrico Ros 1e70a59ad6 ChatMessageList: do not remove the connection button on error - tradeoff: less removal on 404, but not removal on network error 2026-04-22 17:40:38 -07:00
Enrico Ros 44d05181f4 AIX: Gemini Interactions: update with support for Audio, Thougts 2026-04-22 17:37:20 -07:00
Enrico Ros 996998a5cc AIX: Gemini Interactions: In/Out of images, Out of Audio 2026-04-22 17:37:20 -07:00
Enrico Ros 98474b2721 AIX: OpenAI Responses: Encrypted reasoning 2026-04-22 17:37:20 -07:00
Enrico Ros 198dc0e23f AIX: OpenAI Responses: Wires: allow encrypted reasoning 2026-04-22 17:37:20 -07:00
Enrico Ros 079731c573 ContentReassembler: recreate Ops placeholder part with anchored CTS, to show the correct start time 2026-04-22 17:37:20 -07:00
Enrico Ros 492c89650a tRPC 4xx (!ok): return quoted reason 2026-04-22 17:35:56 -07:00
Enrico Ros 5b5bbb7649 dMessageUtils: update gemini namings 2026-04-22 16:15:54 -07:00
Enrico Ros 27d1f081ab AI Inspector: memstored collapse state, and disable auto-snap 2026-04-22 16:15:54 -07:00
Enrico Ros 76183fd840 LLMs: flush openai 2026-04-22 13:55:56 -07:00
Enrico Ros 345165eabf AIX: fix OpenAI Responses - Tool re-parsing 2026-04-22 13:52:14 -07:00
Enrico Ros c186732b3b AI Inspector: collapsable header/body/particlews 2026-04-22 13:39:14 -07:00
Enrico Ros 04916b700e AIX: Reattach: preserve the creation time and don't overwrite handle on reattach 2026-04-22 13:15:25 -07:00
Enrico Ros 013dab185c ChatMessageList: remove the button if Gemini Reattach is 404 2026-04-22 13:15:25 -07:00
Enrico Ros 5ab93faccf AIX: ContentReassembler: document filtering causes 2026-04-22 13:15:25 -07:00
Enrico Ros fa301e3675 Chat: move message timestamps to the tooltip 2026-04-22 13:01:05 -07:00
claude[bot] fa6e7dd9c5 Chat: show inline message timestamps - TimeAgo for today, locale date for older
Shows a lightweight timestamp at the bottom of each message in the chat view.
Messages from today or currently generating display relative time via TimeAgo.
Older messages show the locale-formatted date. Hidden in zen mode and during edits.

Closes #1065
2026-04-22 12:45:19 -07:00
Enrico Ros 01736ad5da AIX: ContentReassembler: consider refusals as failure modes, and add the error messages too 2026-04-22 02:49:20 -07:00
Enrico Ros ce682b1f85 AIX: Client: CSF for resuming chat generate content 2026-04-22 02:34:22 -07:00
Enrico Ros 96d801f40a AIX: Gemini Interactions: elevate polling to 10 2026-04-22 02:15:20 -07:00
Enrico Ros 8985868f63 Roll AIX 2026-04-22 01:35:55 -07:00
Enrico Ros 8febdcd0c0 ChatMessageList: pass generator to the resume request 2026-04-22 01:35:18 -07:00
Enrico Ros 4d21d5134a AIX: Gemini Interactions: Alpha support - fresh with 2-stages poller, plus reconnect - both feed into the same stream and parser 2026-04-22 01:23:36 -07:00
Enrico Ros 09d44a4314 AIX: Dispatch: custom-connect suppport, for non-single-request solutions; add connection ownership 2026-04-22 01:22:44 -07:00
Enrico Ros 40066e975a AIX: Dispatch: cleanup router 2026-04-22 01:14:17 -07:00
Enrico Ros 202382c80a BlockPartPlaceholder: format as min/sec 2026-04-22 00:57:26 -07:00
Enrico Ros 6ffbb32c57 BlockPartPlaceholder: bump seconds timer to 1h 2026-04-22 00:54:11 -07:00
Enrico Ros 9b8a3ca503 ChatMessage: wire resume 2026-04-22 00:49:33 -07:00
Enrico Ros cdd7892077 AIX: Client: resume support with entry point for future resumes 2026-04-22 00:46:54 -07:00
Enrico Ros 974aa12137 DMessages/AIX: broaden upstreamHandle 2026-04-21 16:15:37 -07:00
Enrico Ros d8f8999333 LLMs: extract isometric/free host matching 2026-04-21 15:26:34 -07:00
Enrico Ros 0efd87b522 T2I: GPT-Image-2 support 2026-04-21 14:16:25 -07:00
Enrico Ros ec76e1c5cf AIX: OpenAI Responses: image generation: add details 2026-04-21 14:16:25 -07:00
Enrico Ros 1e04efe748 LLMs: OpenAI: gpt-image-2 non chat 2026-04-21 14:16:25 -07:00
Enrico Ros 69c135ae78 LLMs: Moonshot: add Kinmi K2.6 2026-04-20 23:22:47 -07:00
Enrico Ros 205fb1bb5b LLMs: sync scores 2026-04-20 23:13:56 -07:00
Enrico Ros c8e7315de3 Roll AIX 2026-04-20 22:16:58 -07:00
Enrico Ros 725f3b0fd7 AutoBlocksRenderer: optimize inline md flicker (zenMode only)
Uses a 512 chars bypass too.
2026-04-20 17:46:18 -07:00
Enrico Ros 7ee3701607 AIX Inspector: bits 2026-04-20 16:05:28 -07:00
Enrico Ros 9537ce59e8 LLM: cap initial max response to 128k 2026-04-20 16:05:28 -07:00
Enrico Ros 6c0a60e0d1 CSF/DC: update docs 2026-04-20 16:05:27 -07:00
Enrico Ros 436a858cb0 AIX Inspector: render as code 2026-04-20 10:46:40 -07:00
Enrico Ros 6ea6c55f65 LLMs: xAI: align-params 2026-04-20 10:46:40 -07:00
Enrico Ros c477fa86ce LLMs: Gemini: align-params 2026-04-20 10:46:40 -07:00
Enrico Ros 08cd5ed5b6 LLMs: OpenAI/Gemini: sync Fn, remove Json 2026-04-20 10:46:40 -07:00
Enrico Ros b5f2cd35f2 LLMs: deprecate Json 2026-04-20 10:46:40 -07:00
Enrico Ros 4cb0f6d67e Parameters sweep: xAI 2026-04-20 10:45:58 -07:00
Enrico Ros 5260ec68cc Remove Code Editors - unused 2026-04-19 22:47:22 -07:00
Enrico Ros 72ce4d2884 CC: Sweep: update verify-params 2026-04-19 22:26:06 -07:00
Enrico Ros ed65f989d9 Parameters sweep: Ant,Gem,Oai 2026-04-19 22:26:06 -07:00
Enrico Ros 588ebf4993 Sweep: add fn(auto,required,roundtrip) measure 2026-04-19 22:26:06 -07:00
Enrico Ros 22969033a7 LLMs: Gemini: restore 3 Pro Preview (still served) 2026-04-17 13:59:32 -07:00
Enrico Ros 8b5e00480b Tests: +listModels: list models for all vendors and show Model IDs per test (or DEV warns/failures)
Each test either asserts live behavior or skips with "needs <ENV_VAR>" -no silent passes.

Requires:
- NODE_ENV=development (DEV-gated validators; ESM hoisting blocks setting
  it in-file)
- Per-vendor API keys; otherwise that dialect is skipped

Returns:
- Model IDs per test in IntelliJ's output panel / spec reporter
- Failure on any [DEV] validator output, pointing at llms:update-models-*
- 4 no-creds live tests (perplexity, minimax-host, zai, openrouter)

Run: NODE_ENV=development npx tsx --test src/modules/llms/server/listModels.test.ts
2026-04-17 13:52:53 -07:00
Enrico Ros aaf752fa9c CC: code:thread-progress 2026-04-17 12:48:05 -07:00
Enrico Ros 82d3b36048 Roll AIX 2026-04-17 08:31:50 -07:00
Enrico Ros 588c81f9ad AnthropicSkillsConfig: improve select (now default) 2026-04-17 00:11:33 -07:00
Enrico Ros 4013a3f997 ConversationsManager: early null warning 2026-04-16 16:37:33 -07:00
Enrico Ros 5823e18904 ExpanderSection: allow persistent divider 2026-04-16 16:36:46 -07:00
Enrico Ros 31ea6863aa LLMs: OpenRouter: extend 2026-04-16 15:57:03 -07:00
Enrico Ros f3f58f26ae LLMs: Chutes: parse model attributes 2026-04-16 15:57:03 -07:00
Enrico Ros 67132f285e LLMs: Misc: sync models 2026-04-16 15:57:03 -07:00
Enrico Ros 20a638a8c9 LLMs: Ollama: sync models 2026-04-16 15:57:00 -07:00
Enrico Ros c9174e995f LLMs: Anthropic: notes on API reported tokens 2026-04-16 15:56:55 -07:00
Enrico Ros 656c507c94 LLMs: Gemini: add 3.1 Flash TTS and Robotics-ER 1.6 2026-04-16 15:56:55 -07:00
Enrico Ros a1fb744eb1 Share Service Add Button 2026-04-16 15:56:42 -07:00
Enrico Ros 28367547fd LLMs: Ant Skills: show by default nwo 2026-04-16 14:14:35 -07:00
Enrico Ros 6610211eac AIX: Ant: change tools label 2026-04-16 14:14:23 -07:00
Enrico Ros b66e3e2afa LLMs: Anthropic 1M tokens review 2026-04-16 13:39:39 -07:00
Enrico Ros 4bf965953a LLMs/AIX: Anthropic Claude Opus 4.7 full support 2026-04-16 13:30:40 -07:00
Enrico Ros 1bd6513d59 AIX: ReassemberTransform: Embed/Delete Anthropic Files 2026-04-16 04:55:04 -07:00
Enrico Ros 6ce457913e AIX: ReassemblerParticleTransform: framework 2026-04-16 04:34:20 -07:00
Enrico Ros ef84ca5a04 AIX: CSF Dispatch: remove unsafe transforms that the client does not support 2026-04-16 04:10:18 -07:00
Enrico Ros f76524c650 BlockPartHostedResource: show meta info message on error 2026-04-16 04:03:33 -07:00
Enrico Ros 0be676229f AIX: Dispatch: tag the Anthropic server-side tranform-fileInline as csfUnsafe because incurs into CORS block by the Anthropic File API servers 2026-04-16 03:44:44 -07:00
Enrico Ros 40a0ca7235 Bits 2026-04-16 03:41:01 -07:00
Enrico Ros 1563c3a9dc Improve more image icons usage 2026-04-16 01:22:50 -07:00
Enrico Ros 80f32be80d Improve image icons usage 2026-04-16 01:15:09 -07:00
Enrico Ros eea53714cc Ph: add ImaegeSquare, Wrench 2026-04-16 01:12:25 -07:00
Enrico Ros 148f1ec22c ExpanderControlledBox: fix a regression with ERC's OverlayButtons' viewport-fixed positioning 2026-04-14 16:12:18 -07:00
Enrico Ros b5a2a70e73 RenderCode: correctly un-tooltip code being written 2026-04-14 15:30:14 -07:00
Enrico Ros e7667e4b7d AIX: Resiliency: detect hard TLS disconnections by the AI service 2026-04-14 15:02:43 -07:00
Enrico Ros 9250eb9aff RenderCode: change fullscreen wrapper 2026-04-14 14:41:09 -07:00
Enrico Ros 92883caaab RenderCode: extract syntax style 2026-04-14 14:38:53 -07:00
Enrico Ros 6d57450efc RenderCode: extract styles 2026-04-14 14:33:38 -07:00
Enrico Ros 5dd4c600ea Roll packages misc 2026-04-14 14:17:34 -07:00
Enrico Ros 392a3b7949 Optimization: Code: memo style & stable copy handler 2026-04-14 14:11:40 -07:00
Enrico Ros e22c40c7e4 Optimization: Memo reasoning Chips (UI component) 2026-04-14 13:56:55 -07:00
Enrico Ros c7abee6969 Roll packages 2026-04-14 13:46:20 -07:00
Enrico Ros 4772e63fdb SyncExternalStorable: helper for useSyncExternalStore, to declare state/snapshot + a lifecycle hook helper 2026-04-14 13:40:20 -07:00
Enrico Ros f3d7abefec Optimization: Code Render Highlight: throttle highlighting of large/partial streams 2026-04-14 13:03:17 -07:00
Enrico Ros ac76b156cf Optimization: Code Render Highlighted: defer pre-coloring, saving even more CPU, avoiding system instability 2026-04-14 11:34:39 -07:00
Enrico Ros 97e65efc31 Optimization: Code Render Highlighted: defer code syntax coloring 2026-04-14 09:46:54 -07:00
Enrico Ros 13dcaa0a57 Optimization: Code language infer: max input size 2026-04-14 09:46:54 -07:00
Enrico Ros 1f42b0ae66 Optimization: AutoBlocks: don't bust fianlized sub-fragment Code/Markdown blocks 2026-04-14 09:46:54 -07:00
Enrico Ros 003a50f181 Optimization: Draft attachments: cache tokens compute 2026-04-14 09:45:16 -07:00
Enrico Ros 32c5849a50 useShallowObject: add lint rule for deps 2026-04-14 09:44:53 -07:00
Enrico Ros 44a8ee0593 useShallowObject: add useMemoShallowStable 2026-04-14 09:44:53 -07:00
Enrico Ros 1ad70c7b1b AGENTS.md as symlink 2026-04-12 23:12:05 -07:00
Enrico Ros 7413983159 Per-Chat stores: debug break on lazy creation 2026-04-12 16:54:32 -07:00
Enrico Ros 6c3e8c6a8f Per-Chat stores: lazily create (with warning)
This also breaks the eager module-scope chain that caused
UKV:exec-defaults getState() to fire 14+ times during build.
2026-04-12 16:54:28 -07:00
Enrico Ros 7e3e9854ac FormInputKey: update icons 2026-04-12 16:51:41 -07:00
Enrico Ros 41fc93345c FormInputKey: allow size 2026-04-12 16:51:40 -07:00
Enrico Ros b9275177e3 Merge pull request #1066 from enricoros/dependabot/github_actions/docker/build-push-action-7.1.0
chore(deps): bump docker/build-push-action from 7.0.0 to 7.1.0
2026-04-12 12:22:06 -07:00
Enrico Ros 5ea95e4095 Merge pull request #1067 from enricoros/dependabot/github_actions/actions/upload-artifact-7.0.1
chore(deps): bump actions/upload-artifact from 7.0.0 to 7.0.1
2026-04-12 12:21:56 -07:00
dependabot[bot] 0ea041ed5b chore(deps): bump actions/upload-artifact from 7.0.0 to 7.0.1
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 7.0.0 to 7.0.1.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/bbbca2ddaa5d8feaa63e36b76fdaad77386f024f...043fb46d1a93c77aae656e7c1c64a875d1fc6a0a)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-version: 7.0.1
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-04-11 11:32:23 +00:00
dependabot[bot] 037e3b62d8 chore(deps): bump docker/build-push-action from 7.0.0 to 7.1.0
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 7.0.0 to 7.1.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](https://github.com/docker/build-push-action/compare/d08e5c354a6adb9ed34480a06d141179aa583294...bcafcacb16a39f128d818304e6c9c0c18556b85f)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-version: 7.1.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-04-11 11:32:19 +00:00
Enrico Ros 517c18c902 BlockPartHostedResource: enable always-embed 2026-04-10 03:17:23 -07:00
Enrico Ros 685b5c5130 AIX: Apply global AI settings 2026-04-10 03:17:07 -07:00
Enrico Ros cfdab2f900 Anthropic File Embed: change global setting 2026-04-10 03:17:07 -07:00
Enrico Ros 1a743ff264 Store-AI: global AI settings 2026-04-10 03:17:07 -07:00
Enrico Ros 85463fafb1 AIX: Anthropic: Container -> File API Images inlining 2026-04-10 03:17:07 -07:00
Enrico Ros 0641b0df97 AIX: Anthropic: Container -> File API Text inlining 2026-04-10 03:17:07 -07:00
Enrico Ros 98825081a9 AIX: Dispatch: ChatGenerateParticleTransformFunction 2026-04-10 03:17:07 -07:00
Enrico Ros f549c13465 LLMs: Anthropic: file get metadata schema 2026-04-10 00:40:48 -07:00
Enrico Ros 8bf7fd7106 BlockPartHostedResource: support copy of images 2026-04-09 17:07:30 -07:00
Enrico Ros d8d889c706 BlockPartHostedResource: remove disabled button 2026-04-09 16:24:46 -07:00
Enrico Ros 90665ed84a BlockPartPlaceholder.tsx: space inputs 2026-04-09 16:24:15 -07:00
Enrico Ros dd3d10a391 MIME: support for Anthropic File API mimes 2026-04-09 16:24:04 -07:00
Enrico Ros 19ebd399a8 AIX: Anthropic: parser: improve server_tool_use ends 2026-04-09 16:22:25 -07:00
Enrico Ros f21a2973e9 BlockPartHostedResource: further simplify Anthropic ops 2026-04-09 15:15:27 -07:00
Enrico Ros 04bb8f9c12 BlockPartHostedResource: improve Anthropic documents fetching (with cache), more reliable 2026-04-09 14:54:55 -07:00
Enrico Ros 5ea63c8734 LLMs: store: FP NOTICE - hook to resolve access for a llm id
Note: to make this portable across machines with different configurations, we also allow to fall back
to the first access of a certain type for the
llm.
2026-04-09 14:36:26 -07:00
Enrico Ros f4f4ad9373 LLMs: improve DModelVendor typings, and reverse Id->type mappings 2026-04-09 14:25:30 -07:00
Enrico Ros ba06d70c05 BlockPartHostedResource: Anthropic text documents inlining 2026-04-09 05:43:31 -07:00
Enrico Ros 62ddd17715 Fix Chat Store improper zero-state.
No point in always inserting a chat beyond the zero-state (i.e. even if many are loaded).
Now it's only created if it matters.
2026-04-09 04:20:53 -07:00
Enrico Ros f76db1d19e Merge branch 'main' of https://github.com/alexdenton123/big-AGI into alexdenton123-main 2026-04-08 19:02:09 -07:00
Enrico Ros f0901dbc03 Merge pull request #1063 from enricoros/claude/issue-1061-20260406-2309
fix: hide voice features in browsers without Speech Recognition support
2026-04-08 19:00:22 -07:00
Enrico Ros c65a2ce387 LLMs: Anthropic: auto-detect features and 0-day models (Mythos) 2026-04-08 18:41:17 -07:00
Enrico Ros eaee372938 ModelsList: bolden customized models 2026-04-08 18:08:32 -07:00
Enrico Ros d8836534cb LLMs: Anthropic: Sonnet 4.6 can effort: max 2026-04-08 18:01:53 -07:00
Enrico Ros 7d2e64b458 LLMs: Ollama: relax enrichment 2026-04-08 17:38:40 -07:00
Enrico Ros bc942c5581 LLMs: Ollama: fix missing descriptions 2026-04-08 17:35:44 -07:00
Alex Denton 4ca24f8314 Add 'New Chat on Startup' setting to reopen last chat
Adds a toggle in Preferences > Chat that controls whether a new empty
chat is created on startup (default, current behavior) or the last
conversation is reopened instead.
2026-04-09 00:24:07 +02:00
Enrico Ros b299dec68e BlockPartPlaceholder: spacing 2026-04-08 14:14:05 -07:00
Enrico Ros b9f07d011b CC: code:thread-progress 2026-04-07 23:39:02 -07:00
Enrico Ros 9259be8dbb CC: +code:thread-progress 2026-04-07 20:39:29 -07:00
Enrico Ros 4b0b7c4493 ChatMessage: add info popup 2026-04-07 16:42:29 -07:00
Enrico Ros 73f0760809 BlockPartHostedResource: follow-ups 2026-04-07 14:25:09 -07:00
Enrico Ros db6c2b1620 BlockPartHostedResource: alllow deletion, copy 2026-04-07 13:45:58 -07:00
Enrico Ros 1233e846db Beam Gather: error improvements 2026-04-07 13:38:20 -07:00
Enrico Ros 27312537a7 AIX Client errors: reflect 413 2026-04-07 13:38:20 -07:00
Enrico Ros 1dfd4d8395 BlockPartHostedResource: fix: react to service changes 2026-04-07 05:09:15 -07:00
Enrico Ros ccd9f0980f AIX: bits 2026-04-07 04:50:00 -07:00
Enrico Ros 5cc48d24ec AIX: Anthropic: Download Files (AIX hosted resource support) 2026-04-07 04:31:34 -07:00
Enrico Ros 7929d4eb30 AIX: Anthropic: Containers support (1h) - allows to continue Skills, bash sessions, dynamic filtering, temp files, etc. 2026-04-07 03:06:29 -07:00
Enrico Ros 14c5c83f91 AIX: Anthropic: Dispatch: Headers (beta flags) update 2026-04-07 01:36:21 -07:00
Enrico Ros 263412c422 AIX: Anthropic: Files download 2026-04-07 00:21:35 -07:00
Enrico Ros d395fa817d AIX: Anthropic: Containers: parsing support via 'svs' 2026-04-07 00:21:35 -07:00
Enrico Ros 9cfc8c513b AIX: Anthropic: show return code errors in blocks 2026-04-06 16:44:04 -07:00
Enrico Ros c92a1cfcb1 Merge pull request #1059 from enricoros/dependabot/github_actions/docker/login-action-4.1.0
chore(deps): bump docker/login-action from 4.0.0 to 4.1.0
2026-04-06 16:17:39 -07:00
claude[bot] f45e45ca8f fix: hide voice features in browsers without Speech Recognition support
Add Brave browser detection to pwaUtils.ts since Brave exposes the
SpeechRecognition API but silently blocks it from returning results,
causing false positive feature detection.

- Add `Is.Browser.Brave` detection via `navigator.brave` property
- Update `browserSpeechRecognitionCapability()` to mark Brave as
  unsupported with a clear warning message
- Gate Call button (both mobile and desktop) on speech recognition
  capability since Call fundamentally requires voice input
- CallWizard already displays capability warnings, so Brave users
  navigating directly to /call will see a clear explanation

Closes #1061

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2026-04-06 23:12:45 +00:00
Enrico Ros e44d4b8b01 AIX: make stop reason setting idempotent 2026-04-06 16:05:50 -07:00
Enrico Ros c342f553db AIX: bits 2026-04-06 16:05:27 -07:00
Enrico Ros 2fab208ccf package: tsclint 2026-04-06 13:59:53 -07:00
Enrico Ros eab3eee19f roll posthog 2026-04-06 13:59:41 -07:00
Enrico Ros fcb3903b5f AIX: flush bits 2026-04-06 13:43:41 -07:00
Enrico Ros 90ccb64bd0 AIX: Core: immutable structural sharing, typed outcome, improve layering
Refactor the AIX client streaming pipeline for Zustand-style immutability
and clean outcome classification, preparing for the agentic execution layer.

ContentReassembler:
- All fragment mutations go through _pushFragment/_replaceFragmentAt/_spliceFragment
  (new array refs per update, no in-place mutation)
- Generator fields (genModelName, genProviderInfraLabel, genUpstreamHandle,
  legacyGenTokenStopReason) consolidated into a single `generator` object,
  replaced immutably when particles arrive
- _classifyTermination() replaces _deriveTokenStopReasonOrAppendError() -
  pure function returning { outcome, tsr, errorMessage }
- finalizeReassembly() returns AixChatGenerateContent_LL_Result (extends
  streaming type with outcome + cgMetricsLg) instead of void
- Initial state snapshot for full reset (replaces initialGenerator field)

Type system:
- AixChatGenerateContent_LL: streaming-only (fragments + generator)
- AixChatGenerateContent_LL_Result: extends LL with outcome + cgMetricsLg
- AixChatGenerateTerminal_LL: 'completed' | 'failed' | 'aborted'
- Outcome flows LL -> L2 -> L3 without leaking into DMessage/stores
- Unified vocabulary throughout (no more success/errored mapping)

LL streaming loop:
- Restructured with break/continue for guaranteed finalizeReassembly()
- Drain in-flight processing before retry/terminal decisions
- Abort-during-retry-backoff surfaces original error (not 'aborted')
- Retryable path first, terminal fallthrough

Callers:
- Remove structuredClone() calls (structural sharing makes them unnecessary)
- Spread fragments/generator directly into stores
2026-04-06 12:53:33 -07:00
Enrico Ros 1772db5e98 AIX: Anthropic Wires/Parser: improve 2026-04-05 07:50:39 -07:00
Enrico Ros a04ee4de95 LLMs: openai-autocomplete: improve descriptions 2026-04-04 18:21:53 -07:00
Enrico Ros 73b6a54f9e LLMs: MiniMax: update models 2026-04-04 15:48:15 -07:00
Enrico Ros 52b08b407c ChatMessage: optimize messageFragmentsReduceText 2026-04-04 15:27:22 -07:00
Enrico Ros 269a3a9991 CC: llms:update-models-minimax 2026-04-04 13:51:41 -07:00
Enrico Ros 1b2050cd96 MiniMax: add auto-configure UI option and icon 2026-04-04 13:51:25 -07:00
Enrico Ros a71dd5e3aa LLMs: add MiniMax with hardcoded models 2026-04-04 13:51:25 -07:00
Enrico Ros 8d91ea0413 AIX: ListModels: support server-side env-var match 2026-04-04 13:33:21 -07:00
dependabot[bot] 81b39c7f9c chore(deps): bump docker/login-action from 4.0.0 to 4.1.0
Bumps [docker/login-action](https://github.com/docker/login-action) from 4.0.0 to 4.1.0.
- [Release notes](https://github.com/docker/login-action/releases)
- [Commits](https://github.com/docker/login-action/compare/b45d80f862d83dbcd57f89517bcf500b2ab88fb2...4907a6ddec9925e35a0a9e82d7399ccc52663121)

---
updated-dependencies:
- dependency-name: docker/login-action
  dependency-version: 4.1.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-04-04 11:32:21 +00:00
Enrico Ros a3200e1aab AIX: remove the pause_continue token stop reason, as we handle it in the server-side loop 2026-04-03 16:21:12 -07:00
Enrico Ros 4c8fa8e477 next.config: restore 2026-04-03 12:09:31 -07:00
Enrico Ros f64aae10c5 /tools: add tsconfig for node types 2026-04-03 11:48:14 -07:00
Enrico Ros bd8f484cd2 next.config: single title on build, not dev (4) 2026-04-02 23:47:54 -07:00
Enrico Ros 4c3151e3be roll posthog 2026-04-02 23:08:25 -07:00
Enrico Ros 4e3377f1df roll eslint 2026-04-02 23:06:42 -07:00
Enrico Ros f95b643a5c roll Typescript 6.0.2 2026-04-02 23:05:21 -07:00
Enrico Ros 85083f323d AIX: OpenaI parser: cleanup 2026-04-02 22:36:33 -07:00
Enrico Ros b884386143 LLMs: link autocomplete icons 2026-04-02 22:33:08 -07:00
Enrico Ros 01a8d858cf LLMs: optimize icons 2026-04-02 22:32:54 -07:00
Enrico Ros 08fed36a61 LLMs: add new icons 2026-04-02 22:17:12 -07:00
Enrico Ros f8b110e108 LLMs: Groq: update models 2026-04-02 21:35:12 -07:00
Enrico Ros b78b0f1323 LLMs: OpenAI autocomplete: icons support 2026-04-02 21:31:29 -07:00
Enrico Ros 148c0b1d77 CC: do not git stash 2026-04-02 21:20:09 -07:00
Enrico Ros fe501831b2 Roll AIX 2026-04-02 21:15:39 -07:00
Enrico Ros 1862b72ba5 LLMs: Arcee AI: add OpenAI-compatible provider, with auto-qualified models 2026-04-02 21:12:47 -07:00
Enrico Ros a609071966 AIX: Arcee AI: fix for null fields (protocol breakage; they shall be optional, not null) 2026-04-02 20:50:10 -07:00
Enrico Ros dc2d162e6e Roll AIX 2026-04-02 18:02:43 -07:00
Enrico Ros 07f2cd291e AIX: Arcee AI: fix trinity with some null chatCompletions fields 2026-04-02 18:02:43 -07:00
Enrico Ros a6e040e3e5 AIX: Reassembly: Snapshots for continuation & retry reset
Also includes:
- move Accumulator inside the Reassembler
- drain in-flight before client-retry
- improved continue/retry visualizations
- retrier has less status inside (removed resume handle, now external)

Fixes #1055
2026-04-02 17:36:43 -07:00
Enrico Ros 3e6cfc9775 AIX: Anthropic: parser: signal text vs bash script 2026-04-02 16:13:59 -07:00
Enrico Ros 0e2abd2615 CC: slashcommands: update llms:ollama 2026-04-02 12:50:06 -07:00
Enrico Ros 394e79510e LLMs: Ollama: sync models 2026-04-02 12:47:57 -07:00
Enrico Ros 848977820e LLMs: LocalAI: search models 2026-04-02 12:17:09 -07:00
Enrico Ros c893f1969c LLMs: Gemini: add Gemma4 models 2026-04-02 11:59:49 -07:00
Enrico Ros bb9a8b81d1 BlockPartModelAux: improve Reasoning render 2026-03-31 18:04:19 -07:00
Enrico Ros 188b338bdc CC: mv -> git mv 2026-03-31 18:04:19 -07:00
Enrico Ros 463ef406a7 AIX: CG Eexecutor: Anthropic retry 2026-03-31 13:06:46 -07:00
Enrico Ros a916ff46dc BlockPartPlaceholder: shrink 2026-03-31 13:06:43 -07:00
Enrico Ros db3a5c0b1b BlockPartPlaceholder: improve quality of render with timeouts 2026-03-30 23:21:45 -07:00
Enrico Ros b760250da1 AIX: CGR: respect source ordering of messages, and remove the Tool messages in AIX up 2026-03-28 09:04:12 -07:00
Enrico Ros b5829ac541 AIX: Gemini: comment 2026-03-28 06:55:53 -07:00
Enrico Ros fa4f2b8fcd AIX: Reassembler: persistence-of-vision of 500ms for VP fragments
Allows for a more natural read.
2026-03-28 06:55:49 -07:00
Enrico Ros 333c318a62 AIX: Anthropic: parse code exec/bash failures 2026-03-28 06:55:49 -07:00
Enrico Ros 5f6f7086d0 AIX: Anthropic: trace bash_code_execution ops 2026-03-28 06:55:49 -07:00
Enrico Ros a7495bd4cf LLMs: document the llm-reset-no-duplicates behavior 2026-03-28 06:55:49 -07:00
Enrico Ros 76c4919e9c LLMs: when resetting parameters, preserve the ones on User Clones 2026-03-28 01:42:18 -07:00
Enrico Ros 5530a0253e AIX: Gemini: wires: safer 2026-03-28 01:00:44 -07:00
Enrico Ros 86aaa65d10 CC: code:grep-history 2026-03-28 00:07:58 -07:00
Enrico Ros 65bf147e04 DMessage/AIX: ToolInvocationResponse parts and AIX ToolResponsePart['FunctionCallResponse'] must carry the function name 2026-03-27 22:25:53 -07:00
Enrico Ros f76ad186f0 AIX: Wires: FIX: fix aix.wires function name (_name was silently stripped) 2026-03-27 21:51:19 -07:00
Enrico Ros e5e333db70 AIX: Gemini: improve tool hosted/user cohexistence 2026-03-27 21:51:19 -07:00
Enrico Ros ddee08c2da AIX: Gemini: improve wires 2026-03-27 21:51:19 -07:00
Enrico Ros 93b7686f18 AIX: Gemini: Hosted (Server) Tools Notifications 2026-03-27 21:49:15 -07:00
Enrico Ros e61e9626e2 AIX: Reassembler: Fix: PH termination on aix client only iof active 2026-03-27 21:49:15 -07:00
Enrico Ros 3c6bfe0152 BlockPartPlaceholder: (small) ops not on zen 2026-03-27 21:49:14 -07:00
Enrico Ros e4fc44bc9c LLMs: Gemini: sync models 2026-03-27 21:45:50 -07:00
Enrico Ros 51e23ad3a4 Bits 2026-03-27 06:48:36 -07:00
Enrico Ros 5ebbe45a63 BlockPartPlaceholder: improve incomplete 2026-03-27 06:31:06 -07:00
Enrico Ros 6df276d51d DMessage: remove pendingIncomplete on duplication 2026-03-27 06:06:23 -07:00
Enrico Ros f811500b60 AIX: Trace Hosted Operations 2026-03-27 05:42:35 -07:00
Enrico Ros 2b51605c18 ChatFragments: rendering of trees of operations in 'vp' 2026-03-27 05:35:57 -07:00
Enrico Ros 513b840b47 ChatFragments: VoidPlaceholder with alternative presentations - improve a single model op to a list of ops 2026-03-27 05:29:51 -07:00
Enrico Ros d94c8c8a3b AIX: aix-info/aix-retry-reset 2026-03-27 05:11:58 -07:00
Enrico Ros 3dd641a398 AIX: Anthropic: small protocol doc 2026-03-27 05:03:02 -07:00
Enrico Ros 8e545f1738 AIX: OpenAI: protocol bits 2026-03-27 05:00:41 -07:00
Enrico Ros 2a12597567 AIX: OpenAI: parse code execution images 2026-03-27 04:45:54 -07:00
Enrico Ros e003683040 Chat converters: only convert the last VP to E
Other placeholders can stay they're void anyway, however
converting the last allows to signal incomplete in-progress ops
2026-03-27 04:31:25 -07:00
Enrico Ros 0338b3d2e9 AIX: client error: describe Failed to fetch
This happens when for instance Chrome queues up a connection, and while still queued
the WiFi goes off.
2026-03-27 04:20:59 -07:00
Enrico Ros 5d5bc403c4 Wire: cleanup wire print 2026-03-27 04:16:53 -07:00
Enrico Ros b646149980 AIX: misc client.errors notice 2026-03-27 04:16:47 -07:00
Enrico Ros 1e7e8ac632 LLMParametersEditor: improve code-based Antropic web refiner 2026-03-27 04:16:28 -07:00
Enrico Ros 309786e01e ChatDrawerItem: round frequency to 1 decimal, if fractional 2026-03-25 13:39:43 -07:00
Enrico Ros 08e3caf8c2 Logger: ignore defaultPrevented 2026-03-25 03:52:05 -07:00
Enrico Ros 21b68d7660 Update canonic 2026-03-25 00:14:16 -07:00
Enrico Ros 4986c61b2a Composer: remove the alt+enter hint 2026-03-24 22:47:36 -07:00
Enrico Ros 801479cb5c Composer: remove the ctrl+enter hint 2026-03-24 22:45:12 -07:00
Enrico Ros 1d18e21018 CC: update release-open 2026-03-24 22:44:22 -07:00
Enrico Ros 4c329a8f51 AIX: Anthropic: Parser: ignore server-side recovered errors, #1010 2026-03-24 22:10:50 -07:00
Enrico Ros 1eb4eeea42 2.0.4: update readme 2026-03-24 19:17:52 -07:00
Enrico Ros 5ca094111c 2.0.4: update news (removing old beam callout) 2026-03-24 19:16:04 -07:00
Enrico Ros 4ce4202750 2.0.4: update package 2026-03-24 19:03:28 -07:00
Enrico Ros 4873c0c390 Json-ld: OS 2026-03-24 15:33:56 -07:00
Enrico Ros 351a28f34f Json-ld: ALTS 2026-03-24 14:50:55 -07:00
Enrico Ros a2e99ed84f Big-AGI: descs 2026-03-24 13:05:51 -07:00
Enrico Ros 7d2a26ab66 Roll AIX 2026-03-24 12:59:59 -07:00
Enrico Ros 94268187f1 Big-AGI: Capitalize 2026-03-24 12:36:08 -07:00
Enrico Ros 5aafa98f1c README: remove expired link 2026-03-24 12:33:31 -07:00
Enrico Ros c42c34acb4 KB: adding LLM vendors 2026-03-24 11:56:28 -07:00
Enrico Ros f052963da3 Md cleanup 2026-03-24 11:53:01 -07:00
Enrico Ros 07fa93609d CC: allow head|tail 2026-03-24 11:38:29 -07:00
Enrico Ros cbef9e5a57 BlockPartPlaceholder: slight render change 2026-03-23 18:59:10 -07:00
Enrico Ros 0b342339d4 AIX/Fragments: preserve placeholder location 2026-03-23 18:59:06 -07:00
Enrico Ros 9de3d5a26f AIX: Anthropic: parser: bits 2026-03-23 18:58:58 -07:00
Enrico Ros 78878076c2 errorUtils: add convenience fucntion for proper signal abort() 2026-03-23 17:55:06 -07:00
Enrico Ros 65cca958a6 AIX: Transmitter: show dialect 2026-03-23 17:51:59 -07:00
Enrico Ros 19263f8494 AIX: CG Exeuctor: Continuation ephemeral notice. #1010 2026-03-23 17:28:27 -07:00
Enrico Ros 5f71cbed47 AIX: CG Exeuctor: Continuation framework for Anthropic. #1010, #1005 2026-03-23 17:28:27 -07:00
Enrico Ros fe93a66d3b AIX: CG Exeuctor: rename to operation retry signal 2026-03-23 17:27:45 -07:00
Enrico Ros aa3b451e00 AIX: CG Exeuctor: slight rename 2026-03-23 17:27:45 -07:00
Enrico Ros ca245bf8b8 AIX: Retriers: cleanup name 2026-03-23 17:27:45 -07:00
Enrico Ros 9868068cd6 AIX: Anthropic: disable the fix for reusing blocks (seems to have been fixed upstream now) 2026-03-23 17:27:37 -07:00
Enrico Ros 5fd27629d0 idUtils: safer fallback for browser not having the crypto function (shall NEVER happen, but people may deploy on HTTP connections). Fixes #1034 2026-03-23 13:47:29 -07:00
Enrico Ros 4bfc7636c9 Beam: Merge: perform merges discarding the reasoning fragments if the policy says so. Fixes #1042 2026-03-23 13:36:58 -07:00
Enrico Ros 305a7784ee ChatThinkingPolicy: backport. #1042 2026-03-23 13:15:07 -07:00
Enrico Ros 87ecc11661 Allow for 2 Gemini vendors. Fixes #1045 2026-03-23 12:36:12 -07:00
Enrico Ros 0faf5d5957 Roll AIX 2026-03-21 19:51:58 -07:00
Enrico Ros 55d7ebd804 AIX/LLMS: Anthropic: Dynamic Web Filtering 2026-03-21 19:51:30 -07:00
Enrico Ros 842b5b96c2 AIX: Anthropic: parser: cleanup 2026-03-21 18:53:48 -07:00
Enrico Ros b07fc759c2 AIX: Anthropic: wires: update with new API features and tools
- tools allowed callers for client and server
- all tool definitions common options
- new code_execution, web_fetch, web_search tools
- top-level cache_contol
- thinking with disabled summaries for speed
- message updates with container variants
-fix tool_search_tool results
2026-03-21 18:53:48 -07:00
Enrico Ros 0afa70aaab System Theme: partially revert c8a33a06 to keep the default to the light mode 2026-03-21 16:14:32 -07:00
Enrico Ros c2cf93bf1a Events: remove dead code 2026-03-21 16:12:13 -07:00
Enrico Ros 88639b8b57 AttachmentSources: raise popups 2026-03-21 16:12:13 -07:00
Enrico Ros bfecc63d0d CC: allow select eslint tsc 2026-03-21 16:12:13 -07:00
Enrico Ros 20bea327e4 AIX: Anthropic: stremaing FC parser edge case 2026-03-21 16:12:13 -07:00
Enrico Ros 1e5c26b490 AIX: Anthropic: fix double newline elision post start 2026-03-21 16:12:13 -07:00
Enrico Ros d9183c9658 LLMs: xAI: add Grok 4.20 models, including multi-agent 2026-03-21 16:12:13 -07:00
Enrico Ros 3ecbbc3b70 LLMs: OpenAI: sweep align (add images support on select models) 2026-03-21 16:12:13 -07:00
Enrico Ros 1c1d21eed7 Sweep: update OpenAI params (more image supports) 2026-03-21 16:12:13 -07:00
Enrico Ros 6129971bb2 LLMs: OpenAI: add 5.4 mini/nano 2026-03-21 16:12:13 -07:00
Enrico Ros 8a3d75f077 Merge pull request #1033
feat(ui): add system theme mode for dark mode controls
2026-03-21 16:11:56 -07:00
Enrico Ros 9c249b513f Merge pull request #1041 from dLo999/fix/issue-1037-export-filename-local-time
fix: use local time for flash backup export filename (#1037)
2026-03-21 15:48:20 -07:00
Dustin 04d3fe6e99 fix: use local time for flash backup export filename (#1037)
Replace inline toISOString() with prettyTimestampForFilenames(false)
to match the other two export options that already use local time.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-20 08:04:50 -07:00
Enrico Ros ea7283b96e Merge pull request #1028 from enricoros/dependabot/github_actions/actions/download-artifact-8.0.1
chore(deps): bump actions/download-artifact from 8.0.0 to 8.0.1
2026-03-18 22:24:20 -07:00
Enrico Ros 295fc111c4 Expander: update 2026-03-18 02:33:36 -07:00
Enrico Ros 58d73d5d81 ModelsList: show Code designation as well. Fixes #1039 2026-03-17 22:07:43 -07:00
Enrico Ros fd8ce2e99a model.domains.registry: do not include a model name. Fixes #1038 2026-03-17 22:07:43 -07:00
blacksuan19 c8a33a06fa feat(ui): add system theme mode for dark mode controls
- default Joy color scheme to system
- cycle theme control through light, dark, and system modes
- update labels and icons to reflect the active theme preference

Signed-off-by: blacksuan19 <abubakaryagob@gmail.com>
2026-03-15 20:18:51 -05:00
Enrico Ros 874be92a56 ChatDrawer: include current chat, if missing 2026-03-14 16:00:48 -07:00
Enrico Ros 6bdb01e3c5 BlockOpOptions: allow spaces after the bold 2026-03-14 14:47:41 -07:00
dependabot[bot] ba03ab3aa8 chore(deps): bump actions/download-artifact from 8.0.0 to 8.0.1
Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 8.0.0 to 8.0.1.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](https://github.com/actions/download-artifact/compare/70fc10c6e5e1ce46ad2ea6f2b72d43f7d47b13c3...3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-version: 8.0.1
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-03-14 11:32:12 +00:00
Enrico Ros 3d554e513d PostHog: more proper way to disable /flags refresh 2026-03-14 00:14:56 -07:00
Enrico Ros e516b9dae9 PostHog: we don't use Feature Flags - stop them 2026-03-14 00:06:32 -07:00
Enrico Ros 281d5a611e BlockOpOptions: support numbered lists 2026-03-13 14:10:52 -07:00
Enrico Ros 03eec23efe BlockOpOptions: supports bold options 2026-03-13 14:02:31 -07:00
Enrico Ros e3d01f6615 Reverting 61a60c5b: "Markdown: bundle in main chunk instead of lazy-loading" because of bundle size (for now) 2026-03-13 13:49:48 -07:00
Enrico Ros 99e15333cb Roll posthog again 2026-03-13 13:47:07 -07:00
Enrico Ros 5efd16c060 LLMs: LocalAI/Ollama/LMStudio: always allow CSF 2026-03-13 12:58:30 -07:00
Enrico Ros b4a6c80d8c Composer: correct browsing flag 2026-03-13 12:37:31 -07:00
Enrico Ros 7991920f08 Attachments: show disabled 2026-03-13 12:37:17 -07:00
Enrico Ros a113b8223b Roll deps 2026-03-13 12:25:24 -07:00
Enrico Ros 7bb720a903 Beam: Fusion: fix stop/stage 2026-03-13 04:00:55 -07:00
Enrico Ros 515de2679e InlineTextarea: size support 2026-03-13 01:57:59 -07:00
Enrico Ros 38caacf816 Expander component, externally controllable 2026-03-13 00:47:30 -07:00
Enrico Ros 676b0537e6 ChatMessage: chat/words count 2026-03-12 23:15:56 -07:00
Enrico Ros a24341cda6 Sel highlighter: export type 2026-03-12 23:15:54 -07:00
Enrico Ros d937bc246a AppChat: filter by open beam (support) 2026-03-12 21:45:40 -07:00
Enrico Ros 5d2543131a selHighlighter: cut also copies 2026-03-12 21:42:54 -07:00
Enrico Ros ca5d6872b5 clipboardUtils: improve dom copy 2026-03-12 21:42:51 -07:00
Enrico Ros a97ce26072 Replace PhTreeStructure for diagrams 2026-03-12 19:55:29 -07:00
Enrico Ros c698f78f92 FormRadioControl: fix hierarchy 2026-03-12 17:50:56 -07:00
Enrico Ros 77782a63eb Radio Controls: support tooltips 2026-03-12 16:35:56 -07:00
Enrico Ros 41e1e44ef0 TooltipOutlined: support size 2026-03-12 16:35:54 -07:00
Enrico Ros 7b1fc56320 LLMs: Deepseek: misc comment 2026-03-12 15:03:06 -07:00
Enrico Ros c0ed41a529 llms.parameters: find Spec and TS fix 2026-03-12 15:03:06 -07:00
Enrico Ros ba47fe1cfe AttachmentSources: strings again 2026-03-12 04:10:05 -07:00
Enrico Ros f1356d8fdc AttachmentSources: optimize RichMenuItem 2026-03-12 04:10:05 -07:00
Enrico Ros 7a899c538f Sources: bits 2026-03-12 01:28:57 -07:00
Enrico Ros 3daac973b1 AttachmentSources: tooltips on live 2026-03-11 15:17:53 -07:00
Enrico Ros b0ec5f7459 Attachments: add live types 2026-03-10 23:12:36 -07:00
Enrico Ros 71d6868512 AttachmentSources: bits 2026-03-10 23:12:36 -07:00
Enrico Ros 605bb83eb3 Components: add MediaStreamPreview 2026-03-10 23:12:36 -07:00
Enrico Ros 3092e02ce9 DBlobs: allow attachment image on destination scope (rather than moving it later) 2026-03-10 23:12:36 -07:00
Enrico Ros 5d82374975 DBlobs: GC: debug option 2026-03-10 23:12:36 -07:00
Enrico Ros ab4d63e596 screenCaptureUtils: export stream 2026-03-10 17:16:16 -07:00
Enrico Ros f800bb8dae CameraCaptureModal: open with options 2026-03-10 17:16:16 -07:00
Enrico Ros 18862c0ff4 Fragments: set origin Id in place 2026-03-10 11:32:10 -07:00
Enrico Ros 3765e8c69e Fragments: set origin Id 2026-03-10 11:28:58 -07:00
Enrico Ros 70d54a9aa3 Labs: option to skip image compression. Fixes #1024 2026-03-10 01:24:24 -07:00
Enrico Ros 50c6ee69af FormSwitchControl: pass through tooltipWarning 2026-03-10 01:05:49 -07:00
Enrico Ros dd2532e269 AttachmentSources: allow external menu button 2026-03-10 00:42:16 -07:00
Enrico Ros 16a54b3452 Audio: catch low-level errors 2026-03-10 00:08:21 -07:00
Enrico Ros 8373c1c785 AudioPlayer: make them cancelable & renames 2026-03-09 23:37:14 -07:00
Enrico Ros 39beda5519 revert AudioPlayer reason changes 2026-03-09 22:45:10 -07:00
Enrico Ros c7d1eae327 Speex: voice url preview with cancelation 2026-03-09 22:33:57 -07:00
Enrico Ros ec81e2ff5b AudioPlayer: pre-open 2026-03-09 22:33:57 -07:00
Enrico Ros 697090b695 AIX: Reassembler: audio player 2026-03-09 22:13:36 -07:00
Enrico Ros 8680fcc3db Image rendering: view on click 2026-03-09 21:30:59 -07:00
Enrico Ros 233037edd2 RenderImageRefDBlob: only regen if prompt is present 2026-03-09 21:29:38 -07:00
Enrico Ros 81c3251c6e AIX: Gemini: small note 2026-03-09 21:29:35 -07:00
Enrico Ros dc0fe7f4ca Beam Briefinx/Speex: use speakText with the rpc audio hint 2026-03-09 17:08:47 -07:00
Enrico Ros 2c9c0f2e0b Merge pull request #1019 from enricoros/dependabot/github_actions/docker/login-action-4.0.0
chore(deps): bump docker/login-action from 3.7.0 to 4.0.0
2026-03-09 01:20:51 -07:00
Enrico Ros 9c3fb9aadb Merge pull request #1018 from enricoros/dependabot/github_actions/docker/build-push-action-7.0.0
chore(deps): bump docker/build-push-action from 6.19.2 to 7.0.0
2026-03-09 01:20:43 -07:00
Enrico Ros de37ac2c51 Merge pull request #1017 from enricoros/dependabot/github_actions/docker/metadata-action-6.0.0
chore(deps): bump docker/metadata-action from 5.10.0 to 6.0.0
2026-03-09 01:20:35 -07:00
Enrico Ros d6b57702bd Merge pull request #1016 from enricoros/dependabot/github_actions/docker/setup-buildx-action-4.0.0
chore(deps): bump docker/setup-buildx-action from 3.12.0 to 4.0.0
2026-03-09 01:20:25 -07:00
dependabot[bot] d94642c29f chore(deps): bump docker/login-action from 3.7.0 to 4.0.0
Bumps [docker/login-action](https://github.com/docker/login-action) from 3.7.0 to 4.0.0.
- [Release notes](https://github.com/docker/login-action/releases)
- [Commits](https://github.com/docker/login-action/compare/c94ce9fb468520275223c153574b00df6fe4bcc9...b45d80f862d83dbcd57f89517bcf500b2ab88fb2)

---
updated-dependencies:
- dependency-name: docker/login-action
  dependency-version: 4.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-03-07 11:32:35 +00:00
dependabot[bot] 75378ea88f chore(deps): bump docker/build-push-action from 6.19.2 to 7.0.0
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.19.2 to 7.0.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](https://github.com/docker/build-push-action/compare/10e90e3645eae34f1e60eeb005ba3a3d33f178e8...d08e5c354a6adb9ed34480a06d141179aa583294)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-version: 7.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-03-07 11:32:31 +00:00
dependabot[bot] d539c1369b chore(deps): bump docker/metadata-action from 5.10.0 to 6.0.0
Bumps [docker/metadata-action](https://github.com/docker/metadata-action) from 5.10.0 to 6.0.0.
- [Release notes](https://github.com/docker/metadata-action/releases)
- [Commits](https://github.com/docker/metadata-action/compare/c299e40c65443455700f0fdfc63efafe5b349051...030e881283bb7a6894de51c315a6bfe6a94e05cf)

---
updated-dependencies:
- dependency-name: docker/metadata-action
  dependency-version: 6.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-03-07 11:32:27 +00:00
dependabot[bot] 555ee6f333 chore(deps): bump docker/setup-buildx-action from 3.12.0 to 4.0.0
Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 3.12.0 to 4.0.0.
- [Release notes](https://github.com/docker/setup-buildx-action/releases)
- [Commits](https://github.com/docker/setup-buildx-action/compare/8d2750c68a42422c14e847fe6c8ac0403b4cbd6f...4d04d5d9486b7bd6fa91e7baf45bbb4f8b9deedd)

---
updated-dependencies:
- dependency-name: docker/setup-buildx-action
  dependency-version: 4.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-03-07 11:32:23 +00:00
Enrico Ros ad989d8a0b CameraCaptureModal: improve multi-attach 2026-03-06 19:11:50 -08:00
Enrico Ros aae7af4713 useCameraCapture: vastly improve state, flow, remove race conditions, add detach 2026-03-06 17:53:01 -08:00
Enrico Ros df0a204767 CameraCaptureModal: full promised control 2026-03-06 16:36:34 -08:00
Enrico Ros 5cdefc7b5e AttachmentSources: live streams support 2026-03-06 15:04:18 -08:00
Enrico Ros c1bdb1fc61 Merge pull request #1014 from enricoros/claude/issue-1013-20260306-1801
feat: add Ctrl+( / Ctrl+) shortcuts to toggle left drawer and right panel
2026-03-06 10:13:33 -08:00
claude[bot] dde22a080b feat: add Ctrl+( / Ctrl+) shortcuts to toggle left drawer and right panel
Add keyboard shortcuts for toggling left drawer (Ctrl+() and right panel
(Ctrl+)). Also adds a reusable `skipIfInput` flag on ShortcutObject that
skips shortcuts when a text input, textarea, or contenteditable element
(or child thereof) is focused - not applied to these layout shortcuts but
available for future use.

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2026-03-06 18:05:06 +00:00
Enrico Ros 7f5ff30f97 Speex: unmarkdown 2026-03-05 19:16:54 -08:00
Enrico Ros 38e1708e91 AIX: Gemini: Parser: improve finish reason reporting 2026-03-05 18:36:12 -08:00
Enrico Ros fe4e755304 AIX: Dispatch: nit 2026-03-05 18:36:09 -08:00
Enrico Ros 67f1c87d3a AIX: OpenAI Responses: infer image type 2026-03-05 18:36:09 -08:00
Enrico Ros eef88ffae2 AIX: OpenAI Responses: Queued 2026-03-05 18:36:08 -08:00
Enrico Ros 319965c55c FormChipGroupControl: must stretch 2026-03-05 18:36:05 -08:00
Enrico Ros 1f309b5c81 Speex: future northbridge nav 2026-03-05 16:55:58 -08:00
Enrico Ros 5273352ae9 Speex: Engine: pass labels 2026-03-05 16:45:59 -08:00
Enrico Ros 5a48256d77 AIX: OpenAI: small fixes 2026-03-05 16:45:46 -08:00
Enrico Ros 1d41294c1d LLMs/Sweep: OpenAI GPT-5.4, -Pro, and non-thinking (with temperature control) 2026-03-05 16:27:55 -08:00
Enrico Ros ff76229706 LLMs: Bedrock: respell 2026-03-04 22:13:07 -08:00
Enrico Ros b0f4b30ebe ChipGroupControl: single chip multiple options 2026-03-04 16:31:31 -08:00
Enrico Ros 7be8f6c6a7 OptimaPanelGroupedList: absorb collapsed pad 2026-03-04 16:28:28 -08:00
Enrico Ros b003993961 No mdashes in comments 2026-03-04 14:29:22 -08:00
Enrico Ros 4878f361b5 CLAUDE.md: no emdashes 2026-03-04 14:27:56 -08:00
Enrico Ros a82a3899c5 Beam: strip reasoning traces per user's thinking policy. Fixes #1003 2026-03-04 13:28:05 -08:00
Enrico Ros ff0685e6e8 Nit 2026-03-04 13:19:24 -08:00
Enrico Ros a597489526 Merge pull request #1011 from Blacksuan19/fix-sherpa-ssr
store-logic-sherpa: guard usage count increment against SSR
2026-03-04 13:03:07 -08:00
Enrico Ros 32e8890f62 LLMs: Sync Sweep params 2026-03-04 12:44:50 -08:00
Enrico Ros 211a43eab4 Parameters sweep: 2026-03-04.2 2026-03-04 12:42:10 -08:00
Enrico Ros 8c28df77cc Parameters sweep: resorting 2026-03-04 12:23:22 -08:00
Enrico Ros 4e82a12899 AIX: Gemini: Disable URL Context for Nano Banana models 2026-03-04 12:20:04 -08:00
Enrico Ros 8d0e0dea89 Parameters sweep: 2026-03-04 2026-03-04 12:09:13 -08:00
Enrico Ros 5703f23b99 Roll AIX 2026-03-04 11:37:46 -08:00
Enrico Ros 196d08b4fd CLAUDE.md: try stopping compound 2026-03-04 11:37:38 -08:00
Enrico Ros 2f9738f6fb LLMs: Gemini: Nano Banana 2 (aka 3.1 flash image) and 3.1 Flash-Lite 2026-03-04 11:34:51 -08:00
Enrico Ros d4db225d1e LLMs: OpenAI: remove shut down 2026-03-04 11:30:10 -08:00
Enrico Ros efff785713 LLMs: OpenAI: 5.3 Instant 2026-03-04 11:29:40 -08:00
Enrico Ros 234accad3f LLMs: ANT: Sync retired 2026-03-04 11:15:57 -08:00
blacksuan19 588b4b2c64 store-logic-sherpa: guard usage count increment against SSR
The useLogicSherpaStore.setState() call at module level ran during
server-side rendering where localStorage is unavailable, causing a
hydration crash. Wrap with isBrowser so it only executes in the
browser context.

Signed-off-by: blacksuan19 <abubakaryagob@gmail.com>
2026-03-04 12:49:46 -06:00
Enrico Ros 7de34d8478 InReferenceToBubble: fix h-compression 2026-03-03 23:46:42 -08:00
Enrico Ros 741980adfc Allow new attachments for previous messages in a chat. Fixes #945 2026-03-03 20:18:07 -08:00
Enrico Ros 2690380bfd ChatMessage: support changing attachments in mesages. #945 2026-03-03 18:43:12 -08:00
Enrico Ros b482b07335 Composer: use the standard Attachment hanlders 2026-03-03 18:43:06 -08:00
Enrico Ros 03b4c6f941 Attachments: standard handlers 2026-03-03 18:43:06 -08:00
Enrico Ros b7fd1b13de Remove setLabsEnhanceCodeLiveFile 2026-03-03 10:47:02 -08:00
Enrico Ros 10a6f2d3c7 Rename getLabsHighPerformance 2026-03-03 10:03:21 -08:00
Enrico Ros ba149d3b43 Remove labsEnhanceCodeBlocks - always on now 2026-03-03 10:03:08 -08:00
Enrico Ros f175d071c4 Remove labsShowCosts - always on now 2026-03-03 10:00:16 -08:00
Enrico Ros 874d0bca05 Attachments: by default use the Menu on desktop, not the inlines 2026-03-03 09:53:50 -08:00
Enrico Ros 81ad0328b7 Remove labsAttachScreenCapture/labsCameraDesktop - always on now 2026-03-03 09:53:50 -08:00
Enrico Ros 5198fa66cf Attachments: consolidated/unified menu 2026-03-03 09:53:50 -08:00
Enrico Ros a807bdd6b6 InlineTextArea: remove the alt key - only usage 2026-03-02 21:18:05 -08:00
Enrico Ros 2b209bb679 LLMParametersEditor: improve config. Fixes #1004 2026-03-02 20:04:02 -08:00
Enrico Ros 2f018dce9f AIX: do not set a default fox max anymore - as the underlying APIs may change and it's a user param now. #1004 2026-03-02 20:03:33 -08:00
Enrico Ros 2eb77f532a FormNumberInput: add number|undefined input 2026-03-02 20:03:30 -08:00
Enrico Ros 69063bb544 ExpanderControlledBox - allow compression (issue introduced by f21fe411 on the ChatPanelModelParameters with log model names) 2026-03-02 20:03:30 -08:00
Enrico Ros 7fad2f8790 LLMs/AIX: Parameters: Anthropic: max Fetch/Search depth. #1004 2026-03-02 14:58:46 -08:00
Enrico Ros 620275a1f5 Attachments: move GDrive/Web sources 2026-03-02 14:36:55 -08:00
Enrico Ros ba583fc448 Attachments: move buttons 2026-03-02 14:28:29 -08:00
Enrico Ros 0b96870644 Camera: share and rationalize use 2026-03-02 13:40:25 -08:00
Enrico Ros eb2b682eb5 Attachments: centralize components, make composible 2026-03-02 11:59:52 -08:00
Enrico Ros 577b52120a Update #984 2026-03-01 20:33:07 -08:00
Enrico Ros b69ae3edae Beam: raise max rays to 24, add 16 to presets. Fixes #1001 2026-03-01 20:30:43 -08:00
Enrico Ros 624b177996 Merge pull request #999 from enricoros/dependabot/github_actions/actions/upload-artifact-7.0.0
chore(deps): bump actions/upload-artifact from 6.0.0 to 7.0.0
2026-03-01 20:30:07 -08:00
Enrico Ros bbf01b49c0 Merge pull request #998 from enricoros/dependabot/github_actions/actions/download-artifact-8.0.0
chore(deps): bump actions/download-artifact from 7.0.0 to 8.0.0
2026-03-01 20:29:42 -08:00
Enrico Ros 86b2d8ae71 LLMs: Anthropic PowerPoint -> PPT 2026-03-01 15:41:07 -08:00
dependabot[bot] d18af42d43 chore(deps): bump actions/upload-artifact from 6.0.0 to 7.0.0
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 6.0.0 to 7.0.0.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/b7c566a772e6b6bfb58ed0dc250532a479d7789f...bbbca2ddaa5d8feaa63e36b76fdaad77386f024f)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-version: 7.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-02-28 11:32:12 +00:00
dependabot[bot] 4f6e110bf9 chore(deps): bump actions/download-artifact from 7.0.0 to 8.0.0
Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 7.0.0 to 8.0.0.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](https://github.com/actions/download-artifact/compare/37930b1c2abaa49bbe596cd826c3c89aef350131...70fc10c6e5e1ce46ad2ea6f2b72d43f7d47b13c3)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-version: 8.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-02-28 11:32:08 +00:00
Enrico Ros 62cf334e2f AIX: Z.ai: handle their network errors 2026-02-28 02:12:02 -08:00
Enrico Ros 8bd6fd40fd Focus-mode for mobile 2026-02-28 01:59:16 -08:00
Enrico Ros f21fe41188 ExpanderControlledBox - fix lagging of content vs parent reveal
Instead of clipping on the Collapsee box, we just use it as the FR target
with a minHeight of 0; have the parent take the correct height, and clip all to the parent.
2026-02-28 01:29:08 -08:00
Enrico Ros cfff23164c Claude.md: CSF 2026-02-26 14:12:13 -08:00
Enrico Ros a8d9233dc4 Claude.md: improve in structure 2026-02-26 14:03:54 -08:00
Enrico Ros 9c973efbbf LLMs: Bedrock: support Converse API for Nova models 2026-02-26 03:39:44 -08:00
Enrico Ros e2c4255920 LLMs: Bedrock: hide inputs on prio 2026-02-26 02:52:47 -08:00
Enrico Ros e01b9ff6a9 LLMs: Bedrock: improve sort 2026-02-26 02:52:22 -08:00
Enrico Ros 0084a635f1 AIX: Debugger: fix URL display 2026-02-26 02:18:24 -08:00
Enrico Ros 0cd20b8d48 Update claude.md 2026-02-26 00:13:42 -08:00
Enrico Ros 7c4094b4c2 OpenAI Service config: rename provider when selecting the host 2026-02-25 23:52:44 -08:00
Enrico Ros acd8430d51 Models List: show free only 2026-02-25 23:50:15 -08:00
Enrico Ros 6ae2195d10 LLMs: add LLMAPI via OpenAI-Compatible and custom host. Fixes #993, Fixes #989. 2026-02-25 23:38:43 -08:00
Enrico Ros 6bcc0dd177 LLMs: Bedrock: auto-interfaces frmo model enumeration 2026-02-25 21:27:55 -08:00
Enrico Ros 2de42c2010 AIX/LLMs: Bedrock: support Mantle (OpenAI-compatible) including model enumeration. Fixes #965 2026-02-25 21:11:27 -08:00
Enrico Ros a231ccb492 LLMs: remove IF_OAI_Complete 2026-02-25 18:27:06 -08:00
Enrico Ros 35875d5837 AIX/LLMs: Bedrock: default to us-east-1 2026-02-25 17:13:59 -08:00
Enrico Ros c36ff1edfa AIX/LLMs: Bedrock: support Bedrock Long-term API Keys 2026-02-25 17:13:59 -08:00
Enrico Ros ed35d5b541 tRPC fetchers: improve local debug output 2026-02-25 17:13:59 -08:00
Enrico Ros 2b2a2d84a9 LLMs: Bedrock: report listModels issues up 2026-02-25 17:13:59 -08:00
Enrico Ros a645a4066c docs: bit 2026-02-25 17:13:58 -08:00
Enrico Ros 508a3beff7 CC: patch cd chaining 2026-02-25 14:26:37 -08:00
Enrico Ros df0c133056 AIX: OpenAI: fix return code 2026-02-24 23:25:06 -08:00
Enrico Ros 2da3942ce2 LLMs: OpenAI: Update models 2026-02-24 23:24:32 -08:00
Enrico Ros 26547dec0d Docs: update 2026-02-24 22:56:00 -08:00
Enrico Ros aa4804bdd5 Docs: update for bedrock 2026-02-24 22:46:00 -08:00
Enrico Ros eafa1f02cb AIX: Bedrock: update msg 2026-02-24 21:53:17 -08:00
Enrico Ros 836533a8c2 AIX: Bedrock: update icon 2026-02-24 21:49:30 -08:00
Enrico Ros cfeb134c20 AIX: Bedrock: disclaimer about unsupported functionality 2026-02-24 21:44:01 -08:00
Enrico Ros 35798b5568 AIX: Bedrock: bolster transformer 2026-02-24 21:43:47 -08:00
Enrico Ros 7a250f0848 AIX: Bedrock: chat generate. #965, #170, #980 2026-02-24 21:05:51 -08:00
Enrico Ros 0a4e6d5142 AIX: Anthropic: reuse model to beta 2026-02-24 20:45:22 -08:00
Enrico Ros f4254a5ffb LLMs: Bedrock: list models. #965 2026-02-24 20:35:45 -08:00
Enrico Ros 7b7718e578 LLMs: Anthropic: review headers 2026-02-24 20:35:39 -08:00
Enrico Ros c261b2b156 Bedrock: sigining utility (client and server compatible) 2026-02-24 17:44:24 -08:00
Enrico Ros 237065553e AIX: Anthropic: make beta headers reusable 2026-02-24 17:44:24 -08:00
Enrico Ros 6116af42df AIX: make createChatGenerateDispatch async 2026-02-24 17:44:24 -08:00
Enrico Ros 08b28cfde8 LLMs: IModelVendor: slight csf mention 2026-02-24 17:26:00 -08:00
Enrico Ros b019655518 LLMs: listModels: update dispatch 2026-02-24 17:14:40 -08:00
Enrico Ros 1264a2ebaf Icons: crab svg 2026-02-24 16:32:37 -08:00
Enrico Ros 1960b4f618 Wire: bits 2026-02-24 16:32:14 -08:00
Enrico Ros c75fbd89e6 Shortcuts: new symbols 2026-02-23 22:38:55 -08:00
Enrico Ros 3e67201665 Shortcuts: new modal 2026-02-23 22:34:52 -08:00
Enrico Ros b60e2bae65 LLM Params: bits2 2026-02-23 21:02:31 -08:00
Enrico Ros 19c7fa4285 LLM Params: bits 2026-02-23 20:58:56 -08:00
Enrico Ros f450dd3eac Models List: improve looks, content 2026-02-23 20:58:41 -08:00
Enrico Ros d366cdd542 BlockPartModelAux: render markdown and buttons appear at the end 2026-02-23 20:24:12 -08:00
Enrico Ros c1ba83fddb ViewDocPartModal/RenderCodePanelFrame: fix properties render on mobile (ellipsize) 2026-02-23 20:12:33 -08:00
Enrico Ros 617d6038b1 LLMs: LocalAI: restore n+1 render 2026-02-23 20:08:53 -08:00
Enrico Ros 0abee15c30 LLMs: LocalAI: safer parsing 2026-02-23 19:57:34 -08:00
Enrico Ros 1aa2e68e4a Merge pull request #982 from enricoros/dependabot/github_actions/docker/build-push-action-6.19.2
chore(deps): bump docker/build-push-action from 6.18.0 to 6.19.2
2026-02-23 15:49:53 -08:00
Enrico Ros cd692218ce Bits 2026-02-23 15:00:15 -08:00
Enrico Ros a5b7191185 DEV Mode: fully remove 2026-02-23 15:00:15 -08:00
Enrico Ros 56baba4cae DEV Mode: remove hardcoded leftover 2026-02-23 15:00:15 -08:00
Enrico Ros b696447be4 DEV Mode: graduated streaming 2026-02-23 15:00:15 -08:00
Enrico Ros e1ef2e72d7 ModelsList: Modal Submenus + DC-all config 2026-02-23 15:00:14 -08:00
Enrico Ros e85905e63c AIX Inspector: option to disable streaming for the current session. #980 2026-02-23 15:00:14 -08:00
Enrico Ros c6208a2900 CSF: global DC status 2026-02-23 12:14:04 -08:00
Enrico Ros 01299e4f19 CloseablePopup: workaround to keep the popup 2026-02-23 12:14:04 -08:00
Enrico Ros 1771575641 LLMs: services: type fix 2026-02-23 12:14:03 -08:00
Enrico Ros 88a796fd87 Tools: sweep: sync openai 2026-02-19 19:00:36 -08:00
Enrico Ros e403467d6d LLMs: Gemini 3.1 Pro. Fixes #987 2026-02-19 19:00:06 -08:00
Enrico Ros 1914a2a8a3 Tools: sweep: add sweeps for oai-thinking-depentent-temp 2026-02-18 17:19:37 -08:00
Enrico Ros 683892afef Tools: sweep: disable the no-temperature fix, as by default we don't set it, and it prevents our sweep with it 2026-02-18 17:19:37 -08:00
Enrico Ros 470f8aab70 LLMs: Together updates 2026-02-18 17:19:36 -08:00
Enrico Ros 7a561d6b42 LLMs: OpenPipe updates 2026-02-18 17:19:36 -08:00
Enrico Ros affff0df4a LLMs: Groq updates 2026-02-18 17:19:36 -08:00
Enrico Ros f5a81bdc94 LLMs: Gemini small updates 2026-02-18 17:19:36 -08:00
Enrico Ros 818ed53b53 LLMs: Sweep Alignment 2026-02-18 17:19:36 -08:00
Enrico Ros 12c875f4e3 AIX: OpenAI responses: fix for the older Deep Research models 2026-02-18 17:19:33 -08:00
Enrico Ros 6ff715c0f0 AIX: aixChatGenerateContent_DMessage_FromConversation: classify an errored outcome when the message is interrupted 2026-02-18 17:19:31 -08:00
Enrico Ros c4a89822d8 LLMs: typo 2026-02-18 15:51:18 -08:00
Enrico Ros a8a917f786 Roll AIX 2026-02-18 15:35:44 -08:00
Enrico Ros 3aa9a71a4b LLM Effort: split definition for UI namings with unified backend. #940 2026-02-18 14:55:00 -08:00
Enrico Ros 3758612ed6 LLMs: improve (Registry's) initialValue 2026-02-17 23:49:30 -08:00
Enrico Ros b71a4265f8 LLMs: dissolve requiredFallback 2026-02-17 23:07:55 -08:00
Enrico Ros 870cdb67cf Tools: sweep: update script and results 2026-02-17 22:21:03 -08:00
Enrico Ros 902c9dc3f4 AIX/LLMs: support search disablement client/server correctly 2026-02-17 22:20:59 -08:00
Enrico Ros 0d1db0a360 AIX: OpenAI Responses: remove forcing of no temperature, LLM_IF_HOTFIX_NoTemperature works well 2026-02-17 22:20:44 -08:00
Enrico Ros ddd784f041 LLM Effort: client-side domain check 2026-02-17 20:09:40 -08:00
Enrico Ros 830d45c06d LLM Effort: server-side dev check 2026-02-17 20:09:40 -08:00
Enrico Ros 6e27a31013 LLM Effort: Unified definition. #944, #940 2026-02-17 20:09:40 -08:00
Enrico Ros ed87595e17 LLMs: Anthropic: bit 2026-02-17 19:17:51 -08:00
Enrico Ros da01b59ae3 AIX: Anthropic: Effort is GA - no header needed 2026-02-17 19:17:51 -08:00
Enrico Ros 79046b808b AIX: Gemini: do not use alpha any longer 2026-02-17 19:17:51 -08:00
Enrico Ros 5a71153390 Custom Names: reset with warning. #970 2026-02-17 13:50:17 -08:00
Enrico Ros 94056cdf4b AutoBlocks: #983 option which does not improve things 2026-02-17 13:23:55 -08:00
Enrico Ros 41cb35c6b9 Custom Names: lingering. #970 2026-02-17 12:42:45 -08:00
Enrico Ros e133fc81f6 Custom Names: preserve. #970 2026-02-17 12:16:26 -08:00
Enrico Ros 418c2e496c LLMs: Anthropic: dMessageUtils 2026-02-17 12:01:46 -08:00
Enrico Ros 3690202b38 LLMs: Anthropic: Sonnet 4.6 2026-02-17 11:51:46 -08:00
Enrico Ros f069c2e5ab Fix: safe iteration over navItems.links in mobile nav
Fixes #984
2026-02-17 11:06:44 -08:00
dependabot[bot] 97bf6ca276 chore(deps): bump docker/build-push-action from 6.18.0 to 6.19.2
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.18.0 to 6.19.2.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](https://github.com/docker/build-push-action/compare/263435318d21b8e681c14492fe198d362a7d2c83...10e90e3645eae34f1e60eeb005ba3a3d33f178e8)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-version: 6.19.2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-02-14 11:32:12 +00:00
Enrico Ros a1390b152f CC: .gitignore 2026-02-13 18:55:05 -08:00
Enrico Ros 4e8c7d46f6 Cleanup: remove ems 2026-02-13 18:44:35 -08:00
Enrico Ros 02944d2015 AIX: Add no-op method for setting provider infra label 2026-02-13 18:41:49 -08:00
Enrico Ros 58726f0425 AIX: OpenRouter: provider infra label 2026-02-13 17:30:26 -08:00
Enrico Ros 85f796fb1d AIX: ContentReassembler: note 2026-02-13 16:46:47 -08:00
Enrico Ros 311a9c2bf2 Roll AIX 2026-02-13 15:58:27 -08:00
Enrico Ros 6768917d44 Bits 2026-02-13 15:56:52 -08:00
Enrico Ros 7beb412738 AIX: Report broken messages. #980 2026-02-13 15:56:50 -08:00
Enrico Ros cf724625cc AIX: CSF: emulate tRPC's client-side abort as a response to the abortSignal being fired. #980
This is because the exception gets actually trapped locally in the deeper layers
due to client-side processing, which then created a particle for the abort,
which then is never used because the outer will discard it without notice
2026-02-13 15:56:49 -08:00
Enrico Ros f60b2410dd AIX: do not fake logical ends. #980 2026-02-13 15:56:07 -08:00
Enrico Ros bbdc16b06a LLMs: Together.AI: fix wire parser 2026-02-13 12:11:38 -08:00
Enrico Ros 0fa2d06725 AIX: logging: bits 2026-02-13 12:08:04 -08:00
Enrico Ros 36cdc4b55f AIX: Parser: capitalized STOP reason 2026-02-13 12:04:25 -08:00
Enrico Ros c2b4a50bfa AIX: Retriers: consolidated denylist 2026-02-13 12:02:05 -08:00
Enrico Ros 73f88d4715 AIX: OpenRouter: don't log on empty reasoning 2026-02-13 12:01:52 -08:00
Enrico Ros af919be2ac AIX: store end reason - for further debug. #980 2026-02-12 16:31:41 -08:00
Enrico Ros facffbc6c8 AIX: require clean connection ends. #980 2026-02-12 16:31:41 -08:00
Enrico Ros dd5b7cb8c2 AIX: dispatch: increase debugging vendor-initiated disconnect. #980 2026-02-12 14:12:25 -08:00
Enrico Ros 3dc61109d7 AIX: Server: debug recovered packets 2026-02-12 01:34:10 -08:00
Enrico Ros 9ef84260b0 Z.ai: no bits 2026-02-11 22:09:54 -08:00
Enrico Ros cf2df7d7f9 Z.ai: dMessageUtils 2026-02-11 22:09:27 -08:00
Enrico Ros 16a883526b Z.ai: readme 2026-02-11 17:44:33 -08:00
Enrico Ros 7b66b1a2eb Z.ai: readme 2026-02-11 17:44:22 -08:00
Enrico Ros a4adce5c79 Z.ai: AIX: fix reasoning effort 2026-02-11 17:43:23 -08:00
Enrico Ros 9e4174df53 Z.ai: AIX: fix dispatch 2026-02-11 17:36:04 -08:00
Enrico Ros b5975713a3 Z.ai: OCR does not support WebP 2026-02-11 17:26:23 -08:00
Enrico Ros 0cd04266b7 Z.ai: improve model spec 2026-02-11 17:26:23 -08:00
Enrico Ros 5cbd162454 Z.ai: Reasoning settings support 2026-02-11 17:26:23 -08:00
Enrico Ros bea1600358 AIX: OpenAI ChatCompletions: empty reasoning_content yields to non-empty content 2026-02-11 17:26:22 -08:00
Enrico Ros 6a2e201cf5 Z.ai: discovered + curated models support 2026-02-11 17:26:22 -08:00
Enrico Ros 960551933e Z.ai LLM vendor support
Note: we don't include server-side config anymore starting from this. To stress test the config system.
2026-02-11 17:26:22 -08:00
Enrico Ros 8b38b6416d Z.ai: icon & sprite 2026-02-11 17:26:22 -08:00
Enrico Ros fac4c39f48 Fix copying of message Sources. Fixes #977. Fixes #978. 2026-02-11 13:02:32 -08:00
Enrico Ros 4c930efbf0 Fix GC on Beams with reference collectors. 2026-02-11 12:59:36 -08:00
Enrico Ros 5a2a47cb87 AIX: Anthropic: Fast mode - unsupported message 2026-02-10 13:31:28 -08:00
Enrico Ros 4912a03250 LLMs: Anthropic: Fast mode research preview 2026-02-10 13:22:47 -08:00
Enrico Ros 3b13580613 LLMs: parameter-value-based enum price multipliers 2026-02-10 13:04:05 -08:00
Enrico Ros 95905113ac LLMs: cached isLLMChatFree_cached 2026-02-10 12:17:21 -08:00
Enrico Ros c6b34bb252 LLMs: Parameters: type guard enums 2026-02-10 11:53:31 -08:00
Enrico Ros e5387c2323 AIX: Moonshot: remove empty messages 2026-02-10 11:07:09 -08:00
Enrico Ros d3b4447669 CLAUDE.md: update 2026-02-10 01:51:56 -08:00
Enrico Ros d5c5eac9ec CC: allow git mv 2026-02-10 01:51:56 -08:00
Enrico Ros 49b61495d0 LLMs: Vendor Settings: unbreak hide advanced despite initially in CSF. Fixes #969 2026-02-09 23:30:41 -08:00
Enrico Ros e8298e9d30 workflows: CC: enable auth 2026-02-09 13:41:23 -08:00
Enrico Ros b29681e1f7 workflows: CC: cleanups 2026-02-09 13:30:35 -08:00
Enrico Ros 1e0b9a2f0c workflows: CC: do not trigger triage on assignment 2026-02-09 13:08:45 -08:00
Enrico Ros 442b8e95b1 workflows: CC: lock in the dm 2026-02-09 12:53:10 -08:00
Enrico Ros 27090d9e28 -Spaces 2026-02-09 05:41:55 -08:00
Enrico Ros c37b4fa076 Chat: option to discard all reasoning traces 2026-02-09 04:51:42 -08:00
Enrico Ros 83161bbe98 AIX: Anthropic: Parser: hotfix for 4.6 to elide the double-newline at the beginning when present 2026-02-09 04:50:14 -08:00
Enrico Ros 4b166120e6 AIX: Anthropic: Dispatch: hotFix for 4.6 interleaved reasoning blocks back-to-back 2026-02-09 04:23:28 -08:00
Enrico Ros 04494ac752 AIX: Anthropic: Dispatch: hotFix for empty text blocks produced by 4.6 - incoming from the Anthropic API 2026-02-09 04:23:27 -08:00
Enrico Ros 979809ddb1 AIX: Anthropic: Parser: rename hotFix 2026-02-09 04:23:26 -08:00
Enrico Ros 5d797c3339 AIX: Anthropic: warn if blocks come out of order, now that Anthropic has fixed it 2026-02-09 04:22:35 -08:00
Enrico Ros 2ff74f6b80 Wire: separate debug wire request and response 2026-02-09 04:22:33 -08:00
Enrico Ros 06b1195f9a workflows: CC: triade with workarounds: restore some 2026-02-09 01:48:51 -08:00
Enrico Ros c337b70a42 LLMs: Anthropic: copy 2026-02-09 01:40:11 -08:00
Enrico Ros 5047354892 CC: /code:review-inflight bits 2026-02-09 01:40:11 -08:00
Enrico Ros ce4e405fc6 workflows: CC: r/o triage 2026-02-09 01:40:11 -08:00
Enrico Ros 30c8d66cd1 workflows: CC: update model 2026-02-09 01:38:37 -08:00
Enrico Ros fb5c8aad29 workflows: CC: update dm 2026-02-09 01:38:19 -08:00
Enrico Ros 08d221d00f Attachments: Text: warn if empty 2026-02-08 17:31:40 -08:00
Enrico Ros af918178f6 Attachments: Markdown table conversion issue fallback 2026-02-08 17:31:40 -08:00
Enrico Ros ed19896e3c LLMs: llms.parameters: remove 'as const' 2026-02-08 17:31:39 -08:00
Enrico Ros 47ad135e4b CC: slashcommands: update-models catch-all 2026-02-08 17:27:08 -08:00
Enrico Ros 0eff7825c8 CC: slashcommands: xAI Reponses API sync 2026-02-08 17:27:08 -08:00
Enrico Ros 5c8baee390 CC: /code:review-inflight 2026-02-07 13:46:01 -08:00
Enrico Ros 3f71facb49 CLAUDE.md: update 2026-02-07 13:46:01 -08:00
Enrico Ros eba42cc8f2 CLAUDE.md: dev env 2026-02-07 13:46:01 -08:00
Enrico Ros 53092cee51 CC: allow tsc, eslint 2026-02-07 13:46:01 -08:00
Enrico Ros 4bf621f128 LLMs: OpenAI GPT-5.3-Codex speculative support 2026-02-07 13:42:12 -08:00
Enrico Ros 33505dbb8e LLMs: Anthropic/OpenRouter: align behavior, align UI #962 2026-02-06 22:40:55 -08:00
Enrico Ros c81e1f144f AIX: OpenRouter: protocol bits 2026-02-06 20:56:39 -08:00
Enrico Ros ee788b967b Roll AIX 2026-02-06 20:11:10 -08:00
Enrico Ros 38ac8733f6 AIX: OpenRouter: comment on debug: too risky 2026-02-06 20:10:48 -08:00
Enrico Ros 737a20ee06 AIX: OpenRouter: enable the stricter 'require_parametrs' mode. #948 2026-02-06 20:05:05 -08:00
Enrico Ros 19f48b8001 AIX: OpenRouter: wires for OR debug parameters 2026-02-06 19:51:50 -08:00
Enrico Ros 3471d6b4f5 Roll AIX 2026-02-06 19:30:49 -08:00
Enrico Ros 2dc7ba72b3 AIX/LLMs: bits 2026-02-06 19:30:18 -08:00
Enrico Ros e12279dab0 AIX: Anthropic: show the US inference setting when on 2026-02-06 19:24:04 -08:00
Enrico Ros 2e0c79cb64 LLMs: OpenRouter: also inherit the initial temperature from upstreams 2026-02-06 19:19:33 -08:00
Enrico Ros aa697edb8c AIX: Anthropic: minor API changes 2026-02-06 19:18:54 -08:00
Enrico Ros c72e3c58dd AIX: Anthropic: allow US servers 2026-02-06 19:17:01 -08:00
Enrico Ros 1de30c8bd5 AIX: Anthropic: accomodate some API changes 2026-02-06 18:52:58 -08:00
Enrico Ros 3a8eea6fb7 Roll AIX 2026-02-06 18:37:05 -08:00
Enrico Ros b7fd0bdba7 LLMs: OpenRouter: auto-inherit configurable parameters from Anthropic, Gemini and OpenAI.
Fixes #948: OpenAI-through-OR verbosity is sync'd with OpenAI models.

Fixes #893: Gemini-through-OR parameters are synchronized with Gemini models

Fixes #940: OpenAI-through-OR reasoning effort is synced with OpenAI models and much improved. We will have to still fix #944 for OpenAI levels to be fully sync'd with upstream (in progress)
2026-02-06 18:27:38 -08:00
Enrico Ros 58457cac50 LLMs: OR/Anthropic: support effort and adaptive.
Fixes #962
2026-02-06 18:27:38 -08:00
Enrico Ros 0fbacee7dc LLMs: Anthropic: editable Max effort. #962 2026-02-06 18:27:38 -08:00
Enrico Ros a498f28d14 LLMs: Anthropic: support for max effort. #962 2026-02-06 18:26:07 -08:00
Enrico Ros 5b9c6a2d0e LLMs: Anthropic: support adaptive thinking correctly. #962 2026-02-06 18:26:07 -08:00
Enrico Ros 4c7f50ab98 LLMs: Anthropic: inline thinking budget 2026-02-06 18:26:07 -08:00
Enrico Ros ef03d33bbf LLMs: Anthropic: GA skills 2026-02-06 18:26:07 -08:00
Enrico Ros 22c9fc56c0 LLMs: Opus 4.6: naming 2026-02-06 18:26:07 -08:00
Enrico Ros c952fd734f LLMs: Opus 4.6: remove forcing 2026-02-06 18:26:07 -08:00
Enrico Ros 310e99af23 LLMs: Opus 4.6: sort order, unhide 4.5 2026-02-06 18:26:07 -08:00
Enrico Ros e78446904a Docker: remove broken command directive. Fixes #964 2026-02-06 18:25:24 -08:00
Enrico Ros 760e9d8279 CC: Anthropic: update sources of info 2026-02-06 18:25:24 -08:00
Enrico Ros 61a60c5b9f Markdown: bundle in main chunk instead of lazy-loading 2026-02-06 12:41:41 -08:00
Enrico Ros 3054e1b88d Node 24: add .nvmrc, drop 26 from engines 2026-02-06 12:41:41 -08:00
Enrico Ros 6f4fabf147 Claude Opus 4.6 baseline support 2026-02-05 12:02:21 -08:00
Enrico Ros b0c791a055 Sweep: bits 2026-02-05 03:35:40 -08:00
Enrico Ros 748991249a LLMs: OpenAI: Update tooling availabiltiy across models 2026-02-05 02:36:28 -08:00
Enrico Ros 1aea7122cc Sweep: improve detection of connection issues 2026-02-05 02:35:47 -08:00
Enrico Ros 9a83b428f1 AppBreadcrumbs: auto-ellipsize 2026-02-05 01:21:46 -08:00
Enrico Ros 2cd38bc02b Sweep: update baseline with improved OpenAI chatCompletion values. remove verbosity when the only value is medium (aka, no parameter) 2026-02-05 00:44:48 -08:00
Enrico Ros e586142190 AIX: OpenAI-compatible: ChatCompletions: support verbosity for all (not just openrouter) 2026-02-05 00:07:36 -08:00
Enrico Ros a10d0dcf5d LLMs: auto-inject image output 2026-02-05 00:07:36 -08:00
Enrico Ros 6fdff488a9 Sweep: neutered values 2026-02-05 00:07:36 -08:00
Enrico Ros 8af0d78127 Sweep: adapt to the interfaces like aix.client.ts 2026-02-04 23:07:21 -08:00
Enrico Ros 177686a7fc Sweep: add option to merge models instead of wiping the file 2026-02-04 23:01:40 -08:00
Enrico Ros 09b6e47036 Sweep: fix Responses interface application 2026-02-04 21:14:27 -08:00
Enrico Ros 704187ba3e Models Modal: change visibility 2026-02-04 20:49:39 -08:00
Enrico Ros 4ea8a06503 LLMs: auto-inject web search 2026-02-04 20:49:39 -08:00
Enrico Ros 80fcc7d3e3 Security: client-dominated credential isolation for OpenAI access 2026-02-04 20:09:16 -08:00
Enrico Ros a04c62da6f LLMs: OpenAI: fix verbosity (automated). Fixes #947 2026-02-04 19:57:50 -08:00
Enrico Ros fcb518a050 Security: prevent key exfil 2026-02-04 19:43:09 -08:00
Enrico Ros a222626933 CC: sweep: small note 2026-02-04 19:31:41 -08:00
Enrico Ros a3ceade738 Security: anti-dns-spoofing anthropic 2026-02-04 19:26:57 -08:00
Enrico Ros 51d58223b4 Sweep: more succinct output 2026-02-04 19:12:50 -08:00
Enrico Ros d37a603db2 LLMs: OpenAI: Auto 0-day Responses suport. Fixes e458bca1a. #937 2026-02-04 19:04:13 -08:00
Enrico Ros ea984f3ddf Security: anti-dns-spoofing matching 2026-02-04 18:49:31 -08:00
Enrico Ros a9d3e3dead CC: llms: verify-parameters 2026-02-04 18:49:31 -08:00
Enrico Ros 5499e57205 Tools: sweep: json: fold some sweeps into a 'tools' array 2026-02-04 17:45:50 -08:00
Enrico Ros 6f8ee0247f Tools: sweep: baselines 2026-02-04 17:33:23 -08:00
Enrico Ros 05ee5cc3d1 Tools: sweep: merge id-based parameters 2026-02-04 17:12:36 -08:00
Enrico Ros cb6b569330 Tools: sweep: remove unnecessary configs 2026-02-04 17:05:30 -08:00
Enrico Ros 53073ff109 Tools: sweep: remove opanti summary 2026-02-04 17:05:16 -08:00
Enrico Ros 26d362d7a6 Tools: sweep: partition per-dialect 2026-02-04 16:40:35 -08:00
Enrico Ros 91d99e1a63 Tools: sweep: improvements for Gemini and Anthropic, and to save/load of results 2026-02-04 16:17:19 -08:00
Enrico Ros a20917c971 Tools: sweep: incremental output save 2026-02-04 15:23:00 -08:00
Enrico Ros af9bf9e5b3 Tools: sweep: parallel support 2026-02-04 15:13:39 -08:00
Enrico Ros 46b473b8a0 Tools: sweep: Gemini sweeps. #953 2026-02-04 15:03:31 -08:00
Enrico Ros e2b4028223 Tools: sweep: only select from the predefined sweeps inside the config file, #944, #947, #953 2026-02-04 14:52:09 -08:00
Enrico Ros bac2a31782 Tools: sweep: add opeanai image generation and search tool presence, #944, #947, #953 2026-02-04 14:51:57 -08:00
Enrico Ros 3d20e6bf91 Tools: llm parameter sweep. #944, #947, #953 2026-02-04 14:12:44 -08:00
Enrico Ros 9337216092 tRPC fetchers: console logging on connect/response/parsing can be disabled via env 2026-02-04 14:12:44 -08:00
Enrico Ros cd35d0ca55 Add TSX as a dev dependency 2026-02-04 10:54:44 -08:00
Enrico Ros 6d591b98b8 Roll packages (deep) 2026-02-04 10:53:53 -08:00
Enrico Ros 486381ab9d Sprites: run the gen node native, as module 2026-02-04 10:34:14 -08:00
Enrico Ros c619b4debb ListItemGroupCollapser: sm everywhere 2026-02-04 01:35:55 -08:00
Enrico Ros 383a3085ec Chat Dropdown: adapt Optima Dropdown. #955 2026-02-04 01:03:18 -08:00
Enrico Ros 5a3bb3d817 Chat Dropdown: adapt llmSelect. #955 2026-02-04 01:03:02 -08:00
Enrico Ros d1ba758887 Chat Dropdown: reuse toggleable set and Collapser. #955 2026-02-04 00:55:39 -08:00
Enrico Ros 6fef149997 Sprites: port models-modal 2026-02-03 23:38:50 -08:00
Enrico Ros aad3b16ff2 Sprites: port useLLMSelect, Beam 2026-02-03 23:38:50 -08:00
Enrico Ros 819ba14523 Sprites: Generate and wire 2026-02-03 23:38:50 -08:00
Enrico Ros d3c25ca16a Sprites: update generator with class 2026-02-03 23:38:27 -08:00
Enrico Ros 99a65f72ac Sprites: generator update 2026-02-03 22:35:55 -08:00
Enrico Ros be9080d392 Sprites: generator 2026-02-03 22:35:55 -08:00
Enrico Ros f32d991413 Chat Dropdown: reusable parts. #955 2026-02-03 22:34:12 -08:00
Enrico Ros 94b68ebefa CloseablePopup: memo. #955 2026-02-03 22:33:35 -08:00
Enrico Ros 0450eaaceb CC: rel:release-open 2026-02-03 09:20:11 -08:00
Enrico Ros 408c5ce088 Readme: update counter 2026-02-02 17:13:13 -08:00
Enrico Ros d936629ead 2.0.3: update readme 2026-02-02 15:48:02 -08:00
Enrico Ros 9bd1a66208 2.0.3: update news 2026-02-02 15:43:30 -08:00
Enrico Ros 1a0c029ee8 2.0.3: update package 2026-02-02 15:26:53 -08:00
Enrico Ros e7be228703 Roll AIX 2026-02-02 15:16:57 -08:00
Enrico Ros 0ab4dc972f Dockerfile: suppress CopyIgnoredFile warning for whitelist-style .dockerignore 2026-02-02 15:16:15 -08:00
Enrico Ros 5f1ca8954f Force touch to doubleClick support (disabled) 2026-02-02 15:09:11 -08:00
Enrico Ros 3ec1b033ce BlockEdit_TextFragment: support 'xs' editing. #961 2026-02-02 14:47:56 -08:00
Enrico Ros 0caf27af9b LLMs: skip prod warning for connection errors, they're still shown in the router warnings in dev 2026-02-02 12:55:38 -08:00
Enrico Ros bd67e14fa4 Debug: Wire: off 2026-02-02 12:37:13 -08:00
Enrico Ros 494c3b542c AIX/LLMs: LMStudio: generic conversion 2026-02-02 12:26:36 -08:00
Enrico Ros 8e0884eb64 AIX/LLMs: LMStudio: convert WebP -> JPG instead 2026-02-02 12:21:54 -08:00
Enrico Ros 73c4dc4ac8 AIX/LLMs: support for WebP -> PNG conversion at the hotfix (pre-CGR) stage 2026-02-02 12:18:31 -08:00
Enrico Ros d77274058d LLMs: LMStudio: use native API for detailed model information 2026-02-02 12:07:45 -08:00
Enrico Ros 0c8460419b AIX: ImageContentPart: allow not detail 2026-02-02 11:34:10 -08:00
Enrico Ros eabb589390 AIX: relax error parsing to .error: { message: '..' } 2026-02-02 11:29:52 -08:00
Enrico Ros 62f860ae93 Debug: Wire: clip curl to 4096 inner 2026-02-02 11:26:57 -08:00
Enrico Ros 605aae873c Roll packages with net removal 2026-01-31 16:40:23 -08:00
Enrico Ros 62e9ee5b05 Roll react hook form which shall be even lighter 2026-01-31 16:32:02 -08:00
Enrico Ros d686f5d143 Roll verified changes 2026-01-31 16:26:44 -08:00
Enrico Ros 3922f232ae Roll some types and prettier 2026-01-31 16:22:31 -08:00
Enrico Ros 6735b438d3 Roll Next to fix CVE 2026-01-31 16:15:57 -08:00
Enrico Ros fb1e30ab32 Roll PostHog-node 2026-01-31 16:11:32 -08:00
Enrico Ros 0ec06edb57 Roll PostHog-js 2026-01-31 16:10:21 -08:00
Enrico Ros 2a52673c56 Merge pull request #959 from enricoros/dependabot/github_actions/docker/login-action-3.7.0
chore(deps): bump docker/login-action from 3.6.0 to 3.7.0
2026-01-31 16:09:49 -08:00
Enrico Ros cc20d00d8a Drive picker: improve token handling with expiration 2026-01-31 15:52:24 -08:00
Enrico Ros 3d9201f7dc Drive picker: add a button to close and reset 2026-01-31 15:12:26 -08:00
dependabot[bot] 176732a6c0 chore(deps): bump docker/login-action from 3.6.0 to 3.7.0
Bumps [docker/login-action](https://github.com/docker/login-action) from 3.6.0 to 3.7.0.
- [Release notes](https://github.com/docker/login-action/releases)
- [Commits](https://github.com/docker/login-action/compare/5e57cd118135c172c3672efd75eb46360885c0ef...c94ce9fb468520275223c153574b00df6fe4bcc9)

---
updated-dependencies:
- dependency-name: docker/login-action
  dependency-version: 3.7.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-01-31 11:32:23 +00:00
Enrico Ros 39815b3af3 CustomMarkdownRenderer: reduce log 2026-01-31 03:13:46 -08:00
Enrico Ros bcce517089 Drive: token provider interface 2026-01-31 03:01:36 -08:00
Enrico Ros a4b50d0d97 OpenAI: Raise instancelimit to 10 2026-01-31 03:01:36 -08:00
Enrico Ros 2a124e7588 Drive: text bits 2026-01-30 20:04:24 -08:00
Enrico Ros a85556ab5b Attach content (docs, images, pdf, etc.) from Google Drive. Fixes #943 2026-01-30 19:18:58 -08:00
Enrico Ros cef93d6084 Attachments: cloud types 2026-01-30 17:00:57 -08:00
Enrico Ros 207e257778 AIX: Demuxers: add recovery of incomplete buffers and server-side logging 2026-01-30 14:23:29 -08:00
Enrico Ros 12203daa22 LLMs: New Heuristic model enumeration for the OpenAI protocol 2026-01-30 12:59:14 -08:00
Enrico Ros 27f8e9248d ModelsList: hidden LLMs get faded chips 2026-01-30 12:42:37 -08:00
Enrico Ros 51384dc984 AIX: remove unused OAI Realtime IF 2026-01-30 12:36:44 -08:00
Enrico Ros bc76cbb5ad Server-side fetchers: opt-in optional headers map 2026-01-30 11:55:18 -08:00
Enrico Ros 5a1ca83f6d Server-side fetchers: debug print headers 2026-01-30 11:50:27 -08:00
Enrico Ros c9f585f808 Server/Client Wire improvement, including listModels 2026-01-30 11:39:35 -08:00
Enrico Ros 9f559e1dbf LLMs: Groq: re-add SDAIA 2026-01-30 11:10:52 -08:00
Enrico Ros e458bca1a7 Revert "LLMs: OpenAI Responses API by default for new models. Fixes #937"
This caused all unknown models to default to the responses API.
We need heuristics for determining OpenAI vs OpenAI-compatible.

This reverts commit a6862d8c58.
2026-01-30 03:09:16 -08:00
Enrico Ros 43d2226019 AIX: Fix LiteLLM breakage 2026-01-30 02:01:21 -08:00
Enrico Ros 122bc34701 AIX: don't retry insufficient balance errors 2026-01-29 23:46:19 -08:00
Enrico Ros e01358e268 LRS: add back mmmu 2026-01-29 23:00:59 -08:00
Enrico Ros 847c84c3e6 LLMs: remove cbaMmmu 2026-01-29 22:39:47 -08:00
Enrico Ros b11cac4328 LLMs: sync CB 2026-01-29 22:31:09 -08:00
Enrico Ros f617b06109 LLMs: copy over initialtemperature if/when available on a per-model basis for 'fromManualMapping' 2026-01-29 22:31:08 -08:00
Enrico Ros 345ccf3369 DLLM/ModelDescription: remove trainigDataCutoff: not used anymore in this day and age 2026-01-29 21:43:03 -08:00
Enrico Ros d111b8af62 LLMs: Update XAI verified 2026-01-29 21:23:43 -08:00
Enrico Ros 8f964c5c49 LLMs: Update OpenPipe 2026-01-29 21:22:34 -08:00
Enrico Ros b6f3f4538f LLMs: Update OpenAI Instant models 2026-01-29 21:22:02 -08:00
Enrico Ros f6dd30d5d8 LLMs: Update Mistral 2026-01-29 21:21:12 -08:00
Enrico Ros af8b79f849 LLMs: Update Groq 2026-01-29 21:20:24 -08:00
Enrico Ros 0cfccc423b LLMs: Update Gemini (deprecations) 2026-01-29 21:17:40 -08:00
Enrico Ros f9a5d582d4 LLMs: Update Ollama 2026-01-29 21:14:25 -08:00
Enrico Ros 684e00d594 Model Services: bits2 2026-01-29 21:08:20 -08:00
Enrico Ros 3cd2df0b50 Model Services: Duplicates removal 2026-01-29 21:01:44 -08:00
Enrico Ros 02197f4ee6 OpenAI: fix sequence number validation - broken by LiteLLM. Fixes #957 2026-01-29 13:49:46 -08:00
Enrico Ros f9049a3fea ModelsWizard: fix build 2026-01-29 13:36:05 -08:00
Enrico Ros 462bddc271 Model Service: Service menu 2026-01-29 13:16:25 -08:00
Enrico Ros f79000cf39 Model Service: 3 dots button 2026-01-29 13:16:25 -08:00
Enrico Ros 1d95273f4d Models Refetch: update icon 2026-01-29 13:16:25 -08:00
Enrico Ros 6c4579f434 LLM Duplication copy 2026-01-29 13:16:25 -08:00
Enrico Ros 4ef56ade21 LLMs: OpenAI: default 5.2 to medium, since we have the no-thinking model too 2026-01-29 13:05:16 -08:00
Enrico Ros 7c1369d6e9 LLM Store: allow for removal/reset service-wide 2026-01-29 13:05:16 -08:00
Enrico Ros 533d54b106 LLM Options: reflect status elsewhere 2026-01-29 12:36:33 -08:00
Enrico Ros cce0ca6560 LLM Options: Clone advanced. #941 2026-01-29 12:36:01 -08:00
Enrico Ros e87ce2593c LLM Options: bits 2026-01-29 12:32:32 -08:00
Enrico Ros 431dc8b667 LLM Temperature: allow for Vendor-set no temperature (future) 2026-01-29 12:32:32 -08:00
Enrico Ros 5caf614bf7 LLM Clones: correctly group services when updating, inserting custom models 2026-01-29 12:32:32 -08:00
Enrico Ros ecf9703570 LLM Clones: User can create clones. #941 2026-01-29 12:32:32 -08:00
Enrico Ros e7641393a0 LLMParametersEditor: fix controlled component 2026-01-29 12:32:31 -08:00
Enrico Ros 2201f6ff5a ModelsList: memo feature chips 2026-01-29 12:31:59 -08:00
Enrico Ros 557e1ce293 Clipboard Content Transform: fix copying of code 2026-01-29 11:10:56 -08:00
Enrico Ros cbe9a6b9a5 LLMOptions: style a bit 2026-01-29 00:36:16 -08:00
Enrico Ros 9bbcb038d4 LLMOptions: launch purpose (not on) 2026-01-29 00:28:28 -08:00
Enrico Ros 3602204420 LLMOptions: add commands 2026-01-28 23:52:10 -08:00
Enrico Ros 6f485e5589 GoodModal: shrink on xs 2026-01-28 23:49:29 -08:00
Enrico Ros 2f46a3dfaf LLM Options: Details/Override 2026-01-28 17:45:39 -08:00
Enrico Ros 267845bba3 LLM Options: LLM actions 2026-01-28 17:21:52 -08:00
Enrico Ros 6f33a8eebf LLMs: improve variants handling across the board. #941 2026-01-28 17:03:59 -08:00
Enrico Ros b0d2b09a2e AIX: Fix show injector 2026-01-28 17:03:58 -08:00
Enrico Ros c699b6b16b OpenAI: add a no-thinking variant for GPT-5.2, which allows to change temperature. #941 2026-01-28 17:03:58 -08:00
Enrico Ros 1789bac28d Roll AIX 2026-01-28 03:21:30 -08:00
Enrico Ros 60c05f615f AIX Debugger: unfiler 2026-01-28 03:02:12 -08:00
Enrico Ros bd84523671 LLMs: Gemini: removed models 2026-01-28 02:48:58 -08:00
Enrico Ros eb21b9c770 Speex: open config if closed 2026-01-28 02:42:24 -08:00
Enrico Ros ff3ac11afb LLMParametersEditor: survive undefined temperature 2026-01-28 02:23:39 -08:00
Enrico Ros 1ef8c3d02b LLMs: Gemini: temperature defaults to 1.0 and can be changed 2026-01-28 02:01:55 -08:00
Enrico Ros 2ebaf6279b AIX: Injector: works well 2026-01-28 01:42:50 -08:00
Enrico Ros a5ee40e184 Speex: fix build 2026-01-28 01:15:56 -08:00
Enrico Ros b17a97eac7 AIX: request body injection. #953 2026-01-28 01:09:32 -08:00
Enrico Ros 63908bfaf6 Speex: update README 2026-01-27 23:53:27 -08:00
Enrico Ros 3f9a419a19 Speex: +Inworld Config 2026-01-27 23:50:33 -08:00
Enrico Ros bae691e33e Speex: +Inworld 2026-01-27 23:50:33 -08:00
Enrico Ros 91539346ee Speex: Fix #624 2026-01-27 22:08:39 -08:00
Enrico Ros 4842ca81b3 Speex: Cancellable preview 2026-01-27 22:08:38 -08:00
Enrico Ros 9c77a1a4ab Speex: Chunk test: remove button 2026-01-27 22:08:36 -08:00
Enrico Ros 4af284be42 Speex: prevent voice/engine mismatch 2026-01-27 22:08:15 -08:00
Enrico Ros 6aec68bb3c Speex: fix unlimited chunking 2026-01-27 22:08:15 -08:00
Enrico Ros d4e2b0834f Speex: allow inner calls, for bytes access 2026-01-27 20:23:10 -08:00
Enrico Ros 24c2702f96 Speex: ghost key to 'tts' for remembering the pass 2026-01-27 20:18:05 -08:00
Enrico Ros 4691fc9bad Speex: pre-wrap errors 2026-01-27 20:17:58 -08:00
Enrico Ros 8c6c60b6f1 Speex: fix webspeech voice selection 2026-01-27 20:17:02 -08:00
Enrico Ros bc482407fe Speex: overhaul for chunking and future synchroniciy / controllability 2026-01-27 20:11:49 -08:00
Enrico Ros ff05593db8 AudioAutoPlayer: handler for any streaming or full-file play, with stop and await ending 2026-01-27 20:11:49 -08:00
Enrico Ros 3d304d9374 AudioLivePlayer: extra safety 2026-01-27 20:11:49 -08:00
Enrico Ros 1734f0c2f1 AudioLivePlayer: extra safety, if stop was called already, waitForPlaybackEnd would return immediately 2026-01-27 19:38:48 -08:00
Enrico Ros 1b25e5df85 AudioLivePlayer: anti-leak the objectUrl and resolve wait on stop 2026-01-27 19:38:48 -08:00
Enrico Ros ea8eb32b0b AudioLivePlayer: await until done 2026-01-27 19:38:48 -08:00
Enrico Ros 614a1f95de AudioPlayer: improve straight play function 2026-01-27 19:38:48 -08:00
Enrico Ros d36bc28914 blobUtils: combine arrayBuffers 2026-01-27 19:38:48 -08:00
Enrico Ros deec48d7c1 CC: enable gh issues list 2026-01-27 19:38:48 -08:00
Enrico Ros b318ec8d39 Merge pull request #951 from jayrepo/patch-3
Add middleware.ts to docker image
2026-01-26 23:11:08 -08:00
Jay Chen b4b0e2befc Update .dockerignore 2026-01-27 14:22:22 +08:00
Enrico Ros 51d3fe13da Roll AIX 2026-01-26 19:51:41 -08:00
Enrico Ros 58220216d3 LLMs/AIX: support for Kimi Thinking On/Off 2026-01-26 19:43:06 -08:00
Enrico Ros cac75cca42 LLMs: Detail Kimi-K2.5 2026-01-26 19:24:09 -08:00
Enrico Ros 47f247907f LLMs: Add Kimi-K2.5 2026-01-26 18:57:50 -08:00
Enrico Ros 81e04b7322 ChatDrawer: buckets: fewer splits, more stability 2026-01-25 19:39:05 -08:00
Enrico Ros 56a964b700 AIX debugger: highlight non-conversation frames 2026-01-25 19:37:44 -08:00
Enrico Ros 458341d79f AIX debugger: don't auto-advance frame for support operations 2026-01-25 19:37:43 -08:00
Enrico Ros d1d212b075 Copy: intercept to others too 2026-01-24 20:24:52 -08:00
Enrico Ros 59c9996489 LLM Params update: TS improvements 2026-01-24 19:33:25 -08:00
Enrico Ros bf8221a2f1 LLM Params update: MDS with int (not float) 2026-01-24 19:33:25 -08:00
Enrico Ros 787a11a040 LLM Params system: Improve types definition 2026-01-24 19:33:25 -08:00
Enrico Ros 05d114be2f Copy: redo visual copy and copy interception (Ctrl+c, etc) for Plain text and HTML 2026-01-24 19:33:25 -08:00
Enrico Ros 3c04a7dbac Copy: also disable copy of collapsed Expanders 2026-01-24 19:33:25 -08:00
Enrico Ros 1673e1148d Copy: annotate what to Not copy 2026-01-24 19:33:25 -08:00
Enrico Ros de416b035d Copy: remove overlay copy button on the message 2026-01-24 19:33:25 -08:00
Enrico Ros 08aaf2989d Beam: always show model selector for Custom fusion 2026-01-24 15:49:54 -08:00
Enrico Ros a50964060c Stop ResizeObserver issue notifications 2026-01-24 15:41:41 -08:00
Enrico Ros 54b6108719 Beam: hide Merge Model selector for Custom fusion 2026-01-24 15:32:58 -08:00
Enrico Ros 585e5c254a Roll AIX 2026-01-23 11:42:32 -08:00
Enrico Ros 477808c9bb AIX: OAI Responses: allow 'failed' on web_search_call 2026-01-23 11:42:22 -08:00
Enrico Ros 6c58a2b688 AIX: OAI Responses: allow 'failed' on web_search_call 2026-01-23 11:42:10 -08:00
Enrico Ros c9854bf30f LLMs: OpenAI: partial Reasoning Effort updates. #944 2026-01-23 11:28:03 -08:00
Enrico Ros cfed4bbd41 LLMs: OpenAI: remove restore markdown on GPT-5 models 2026-01-23 11:20:53 -08:00
Enrico Ros 2dd6485b0e LLMParametersEditor: align to XAI X search default off 2026-01-23 11:19:32 -08:00
Enrico Ros bf1dd5b860 LLMs: Toggle code execution in Params Editor only (not quick) 2026-01-23 10:52:32 -08:00
Enrico Ros 765c373f7d LLMs: OAI: allow code execution 2026-01-23 10:52:32 -08:00
Enrico Ros 32d752e82b LLMs: OAI Responses: (unused) parameter for OpenAI code execution 2026-01-23 10:35:58 -08:00
Enrico Ros 4623e438fa AIX: OAI Responses: code interpreter Hosted tool def 2026-01-23 10:35:08 -08:00
Enrico Ros 8a44ff396f AIX: XAI: relax annotation title presence 2026-01-23 10:16:59 -08:00
Enrico Ros 086d7ecae4 Speex: TTS character limit settings. Fixes #942 2026-01-23 10:05:35 -08:00
Enrico Ros d6adebb711 Attachment buttons: full name in tooltip. Fixes #946 2026-01-23 09:55:29 -08:00
Enrico Ros 8325fe7b3c Roll AIX 2026-01-23 09:29:41 -08:00
Enrico Ros 7cf83f878b AIX: XAI: Response API Request + wiretypes. Fixes #938 2026-01-23 04:29:13 -08:00
Enrico Ros 597ba26424 AIX: Code Executor wires 2026-01-23 04:29:06 -08:00
Enrico Ros 7bccea47f5 AIX: OpenAI Responses: parse Code Execution and Custom Tools 2026-01-23 04:29:06 -08:00
Enrico Ros 5770116779 DMessage: Code Executor 2026-01-23 04:29:06 -08:00
Enrico Ros 0679144f69 LLMs/AIX: XAI new parametrization 2026-01-23 04:29:05 -08:00
Enrico Ros c9fd288b52 AIX: OpenAI chatCompletions: remove obsolete X search params 2026-01-23 04:16:52 -08:00
Enrico Ros 9ae449fcfd LLMs: type check server params against the client params def 2026-01-23 00:46:07 -08:00
Enrico Ros 249f67f796 AIX: improve dispatch messaging validation for 4 protocols 2026-01-23 00:46:07 -08:00
Enrico Ros e91c0bb554 AIX: stripUndefined 2026-01-23 00:28:05 -08:00
Enrico Ros 5e306d9598 AIX: XAI: models update 2026-01-23 00:25:33 -08:00
Enrico Ros 42ebc81cbb AIX: XAI: models update 2026-01-22 23:56:11 -08:00
Enrico Ros f624c37db5 AIX: XAI: models update 2026-01-22 17:51:03 -08:00
Enrico Ros 22b6f42936 AIX: OAI: Responses wiretypes changes 2026-01-22 17:25:54 -08:00
Enrico Ros 760c66cac8 Attachments: reposition menu to see the tokens bar 2026-01-22 17:18:08 -08:00
Enrico Ros 1d91e9da03 Attachments: client-side Markdown conversion and Text/HTML cleanup & Markdown conversion 2026-01-22 17:17:57 -08:00
Enrico Ros 7eac409ec6 AIX: XAI: model removal 2026-01-22 13:54:36 -08:00
Enrico Ros 128558420c AIX: XAI: model validation 2026-01-22 13:54:17 -08:00
Enrico Ros ca3e664690 AIX: spill part type cleanup 2026-01-22 13:12:29 -08:00
Enrico Ros 7eb37462d7 LLMs: update ollama models 2026-01-22 03:06:17 -08:00
Enrico Ros 31e02c2d39 CC: slashcommands: update llms:ollama 2026-01-22 03:02:55 -08:00
Enrico Ros 003a68b9b8 CustomMarkdownRenderer.tsx: allow for <br/> tags inside Table Cells. Fixes #939 2026-01-22 02:14:42 -08:00
Enrico Ros f418708389 AIX: Inspector: improve render 2026-01-22 01:55:14 -08:00
Enrico Ros d23a564035 AIX: Inspector: reassembler 'transport' data and ui 2026-01-22 01:45:20 -08:00
Enrico Ros 7fe586244c AIX: Inspector: debugger object client sync 2026-01-22 01:38:33 -08:00
Enrico Ros f1a597cdc6 AIX: move Inspector frames selector 2026-01-22 01:36:54 -08:00
Enrico Ros 9b68c8f58c CC: slashcommands: update llms:ollama 2026-01-22 01:06:11 -08:00
Enrico Ros be5b57ea71 LLMs: shared model definition validators 2026-01-22 00:54:00 -08:00
Enrico Ros 425c82f26d CC: slashcommands: improve parsing of ollama models 2026-01-22 00:08:07 -08:00
Enrico Ros 942421c1fb LLMs: together: fix key validation 2026-01-21 23:52:15 -08:00
Enrico Ros b1184f6928 AIX: CSF: also support client-side exceptions 2026-01-21 23:39:10 -08:00
Enrico Ros ffeb6d1b98 LLMs: models bits 2026-01-21 23:34:31 -08:00
Enrico Ros b2718b56b7 CC: shashcommands: improve kimi 2026-01-21 23:16:57 -08:00
Enrico Ros 455f834957 CC: allow llms:update 2026-01-21 23:05:56 -08:00
Enrico Ros 8a14c80ff8 CC: slashcommands: llms:groq update 2026-01-21 23:04:09 -08:00
Enrico Ros e268e733c7 LLMs: Groq: overlap check 2026-01-21 23:03:29 -08:00
Enrico Ros 8933a8dfb3 LLMs: Gemini: deprecations 2026-01-21 22:53:15 -08:00
Enrico Ros 9796cc525c LLMs: xAI verified 2026-01-21 22:37:11 -08:00
Enrico Ros cdbf9a9190 Speex: CSF support - auto-detected from linked services - client-unbundled-dynamic 2026-01-21 18:38:43 -08:00
Enrico Ros c26792292d Speex: extract rpc common core 2026-01-21 18:14:03 -08:00
Enrico Ros 4698e0ee03 LLMs: OpenAI: remove /v1/ from the user input, to immedialy give feedback on the correct way of doing it 2026-01-21 16:58:57 -08:00
Enrico Ros 68afcb2f4b AIX: OpenAI: disable reasoning summaries when disabling Streaming as well. Fixes #932 2026-01-21 16:52:36 -08:00
Enrico Ros e8f61e46e3 AIX: Fetchers: don't retry on 'request too large' 2026-01-21 16:22:50 -08:00
Enrico Ros 317bb2b7c8 useLLMSelect: preserve scrolling on cat toggle 2026-01-21 16:00:43 -08:00
Enrico Ros d1b3c6b468 ContentFragments: improve zero state add text look. #934 2026-01-21 16:00:42 -08:00
Enrico Ros b35eccc984 AIX: increase resilience to new values across 3 parsers. Fixes #918
Note: relaxed throws on Anthropic and Gemini (all throws), while on OAI-Responses we are raising to throws in dev.
2026-01-21 15:08:17 -08:00
Enrico Ros a780c92047 AIX: speculative support for keepalives on chatCompletions API. Fixes #918 2026-01-21 14:36:40 -08:00
Enrico Ros 5fc65698ba Collapsible Model Groups, #936 2026-01-21 14:10:30 -08:00
Enrico Ros c923b5ec4c Restyle llm select model groups #936 2026-01-21 13:36:43 -08:00
Enrico Ros 609b2b9a7b Group models correctly by service in Beam drop down list. Fixes #936 2026-01-21 13:36:31 -08:00
Enrico Ros a257278004 dMessageUtils: OSB 2026-01-21 13:15:26 -08:00
Enrico Ros 273daed634 Chat Messages: label heuristic to ignore the vendor-N- 2026-01-21 13:13:49 -08:00
Enrico Ros a6862d8c58 LLMs: OpenAI Responses API by default for new models. Fixes #937 2026-01-21 13:07:58 -08:00
Enrico Ros 323e5b4ea7 LLMs: OpenAI OSB speculative support 2026-01-21 13:06:47 -08:00
Enrico Ros 89217a5308 CC: allow gh issue view 2026-01-21 13:03:05 -08:00
Enrico Ros a45e995d2f AIX: OpenRouter: improve reasoning through OpenAI-completions across models. Fixes #893 2026-01-20 11:32:11 -08:00
Enrico Ros 8700b4c8ca Roll AIX 2026-01-20 02:07:55 -08:00
Enrico Ros 1f7f5fb488 Data (personas): more concise default 2026-01-20 01:43:45 -08:00
Enrico Ros afde8ee864 LLMs: OpenRouter: 'verbosity' support for OpenAI gpt-5 models, and Anthropic Claude Opus 4.5 (remapped to reasoning effort by OpenRouter). Fixes #927 2026-01-20 01:42:54 -08:00
Enrico Ros 3884c26b15 LLMs: OpenAI: Reviewed 5.1 Instant / 5.2 Params support. Closes #930 2026-01-20 01:21:45 -08:00
Enrico Ros 24dce7eae9 LLMs: further improve the LLMParametersEditor's sync with the dialog. #926 2026-01-20 01:09:04 -08:00
Enrico Ros 1db4e9b771 LLMs: Anthropic: Fix Effort in the LLM Options Dialog. Fixes #926 2026-01-20 00:31:39 -08:00
Enrico Ros b2ed7eae00 CC: CMDs: llms:align-params-uis 2026-01-20 00:31:39 -08:00
Enrico Ros 3169fd67e8 LLMs: OpenRouter: fix Antrhopic thinking models. Fixes #925 2026-01-20 00:15:00 -08:00
Enrico Ros 773ceb1396 Tools: data/llm: remove obsolete registry 2026-01-19 23:40:56 -08:00
Enrico Ros 8c62ee1720 LLMs/AIX: fully remove moderation
Doesn't seem to be used anymore by anyone, nor was active in the code
2026-01-19 23:32:15 -08:00
Enrico Ros 5fa1f52922 LLMs: openai service setup: bits 2026-01-19 23:09:20 -08:00
Enrico Ros d2180c010c LLMs: Helicone: start unformize 2026-01-19 22:42:16 -08:00
Enrico Ros b73df7b2ce LLMs: OpenAI: Autocomplete + suggest hosts for Chutes, Fireworks, Novita. #921 2026-01-19 22:42:16 -08:00
Enrico Ros 971f737846 LLMs: support Novita.ai models with capability auto-detection. #921 2026-01-19 22:39:32 -08:00
Enrico Ros a393353907 LLMs: Azure: rename custom field 2026-01-19 22:39:32 -08:00
Enrico Ros 751f609554 LLMs: OpenAI: unify paths 2026-01-19 22:39:31 -08:00
Enrico Ros e8cd5c6552 LLMs: Ant: unify paths 2026-01-19 22:39:31 -08:00
Enrico Ros 86e387b270 LLMs: allow OpenAI/Azure OpenAI services renaming. Fixes #922 2026-01-19 15:54:13 -08:00
Enrico Ros 32f15aa621 FormTextField: allow for end decorator 2026-01-19 15:54:13 -08:00
Enrico Ros bfc889a9e5 LLMs: fix reset of non-declared params at models update. Fixes #924 2026-01-19 14:53:46 -08:00
Enrico Ros bd907625a8 UpDate 2026-01-15 17:48:00 -08:00
Enrico Ros 60004926d7 Recommend DC (CSF) for Local services 2026-01-15 16:28:00 -08:00
Enrico Ros ac751dfd1a Roll AIX 2026-01-14 16:42:06 -08:00
Enrico Ros 6828eee17f LLMs: Perplexity: sync 2026-01-14 16:40:01 -08:00
Enrico Ros 19c97f397b LLMs: OpenAI: sync a GPT Audio model 2026-01-14 16:38:32 -08:00
Enrico Ros 0167a8bdd8 LLMs: Mistral: update 2026-01-14 16:36:31 -08:00
Enrico Ros 93e5044603 LLMs: Groq: strings 2026-01-14 16:36:19 -08:00
Enrico Ros 024d930677 LLMs: Gemini: small update 2026-01-14 16:35:22 -08:00
Enrico Ros 98873446a8 LLMs: Ollama: update tags 2026-01-14 16:35:15 -08:00
Enrico Ros 5318b7a406 OCR: cache tesseract's import 2026-01-14 16:30:57 -08:00
Enrico Ros 4a6c3cbcd2 Roll AIX 2026-01-14 15:41:58 -08:00
Enrico Ros ac0a39c202 LLMs: OpenAI: GPT-5.2 Codex support 2026-01-14 15:40:40 -08:00
Enrico Ros 88d39345a5 Attachments: PDF: add a PDF->OCR (via interim images) and an 'Auto' (default)
The Auto mode tries plain Text, then Text to Images to OCR, then falls back to pure images.
2026-01-14 15:09:40 -08:00
Enrico Ros 7aa9cb07b2 OCR: extract util functions 2026-01-14 13:18:44 -08:00
Enrico Ros ef30c8d28d Tesseract: roll package 2026-01-14 13:17:11 -08:00
Enrico Ros 2727f690b4 Attachments: PDFs: use Density to attach images 2026-01-14 10:59:43 -08:00
Enrico Ros 5945c24301 Speex: RPC: fallback to full buffer play if streaming is unavailable (Firefox) 2026-01-14 10:32:41 -08:00
Enrico Ros 7b6aff1f95 AudioLivePlayer: Firefix doesn't support MPEG streaming 2026-01-14 10:32:39 -08:00
Enrico Ros cb0fe3aadd AIX: OpenAI Responses: support for keepalive packets 2026-01-13 19:39:47 -08:00
Enrico Ros 4f9d69f9c2 AIX: Anthropic: Fix for out-of-order block start. Fixes #917 2026-01-12 16:40:16 -08:00
Enrico Ros c18aeabe06 CC: changelog cmd update 2026-01-12 04:18:02 -08:00
Enrico Ros 550742323a CC: changelog cmd 2026-01-12 04:02:17 -08:00
Enrico Ros c71f789a08 Draw PromptComposer: fix for CJK 2026-01-12 03:44:00 -08:00
Enrico Ros a9b4b195bf Extend #916 to InlineTextArea and editing of a Text Fragment 2026-01-12 03:37:02 -08:00
Enrico Ros 52e8177f42 Simplify #916 2026-01-12 03:36:33 -08:00
Enrico Ros b0743efc48 Merge branch 'fork/tantanorange/feat/bug-Input-Method-Editor' 2026-01-12 03:30:25 -08:00
Enrico Ros 6dfd652dac LLMParametersEditor: shrink label 2026-01-12 03:23:42 -08:00
Enrico Ros 3f93cb2e6d Phone: work over an empty conversation id 2026-01-12 03:11:49 -08:00
Enrico Ros 8f7b9b7f19 Optima Page Heading: support disabled 2026-01-12 02:10:38 -08:00
Enrico Ros abff89ab6b CC: ignore temp files 2026-01-12 02:10:20 -08:00
tantanorange d4f03f743a bug(issue-784): fixed 'Enter' trigger unexpected Chat under Iput Method Editor. 2026-01-11 18:32:55 -08:00
Enrico Ros c3714f6651 Camera Capture: darker 2026-01-10 15:20:30 -08:00
Enrico Ros 9b4d0ddf2f Merge pull request #913 from enricoros/dependabot/github_actions/actions/download-artifact-7.0.0
chore(deps): bump actions/download-artifact from 4.3.0 to 7.0.0
2026-01-10 14:46:32 -08:00
Enrico Ros 2c9ac2f549 Merge pull request #914 from enricoros/dependabot/github_actions/actions/upload-artifact-6.0.0
chore(deps): bump actions/upload-artifact from 4.6.2 to 6.0.0
2026-01-10 14:46:20 -08:00
dependabot[bot] c1292de2a0 chore(deps): bump actions/upload-artifact from 4.6.2 to 6.0.0
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.6.2 to 6.0.0.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/ea165f8d65b6e75b540449e92b4886f43607fa02...b7c566a772e6b6bfb58ed0dc250532a479d7789f)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-version: 6.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-01-10 11:54:05 +00:00
dependabot[bot] 21d5e4cd29 chore(deps): bump actions/download-artifact from 4.3.0 to 7.0.0
Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 4.3.0 to 7.0.0.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](https://github.com/actions/download-artifact/compare/d3f86a106a0bac45b974a628896c90dbdf5c8093...37930b1c2abaa49bbe596cd826c3c89aef350131)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-version: 7.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2026-01-10 11:54:01 +00:00
Enrico Ros a9495a3e15 Roll eslint 2026-01-09 19:47:24 -08:00
Enrico Ros bff5b3d765 Roll posthog 2026-01-09 19:46:49 -08:00
Enrico Ros a4ff37eecc Roll safe packages 2026-01-09 19:40:14 -08:00
Enrico Ros 460209f486 GH Actions: fix manifest format 2026-01-09 17:36:28 -08:00
Enrico Ros 96c68c86a4 GH Actions: fix tag case 2026-01-09 17:16:22 -08:00
Enrico Ros 8b152fdff8 GH Actions: improve parallelism of the docker image build 2026-01-09 17:10:53 -08:00
Enrico Ros 25c9a52873 Dependabot: basic configuration 2026-01-09 16:40:28 -08:00
Enrico Ros 44302d903c CC Actions: checkout v6 2026-01-09 16:24:31 -08:00
Enrico Ros c7b8668609 GH Actions: docker-image: pin versions 2026-01-09 14:36:28 -08:00
Enrico Ros 7d60df6266 Docker: save 0.5GB 2026-01-09 14:27:04 -08:00
Enrico Ros b7f898a5e5 Docker: move to Node 24 and cleanup. #907 2026-01-09 13:55:02 -08:00
Enrico Ros 04c4dbe4b8 Docker: remove compose version 2026-01-09 13:53:31 -08:00
Enrico Ros 8d04c494df Docker: negate .dockerignore 2026-01-09 13:53:31 -08:00
Enrico Ros a6aadf76f3 Revert "Fix Node 25 build..." - breaks other Node versions.
This reverts commit b70d57d878.
2026-01-09 13:53:30 -08:00
Enrico Ros a685ef97bf AIX: chatGenerate executor: object-string-ellipsize when replaying input (received from the llm) 2026-01-09 13:36:06 -08:00
Enrico Ros d46c29689f AIX: OpenRouter: support image generation through OpenAI's OR-extended API, including supporting advanced Gemini params. Fixes #906 2026-01-09 13:36:06 -08:00
Enrico Ros 65ce07395b Fix drag-to-update on mobile 2026-01-09 13:36:06 -08:00
Enrico Ros cc1542fe95 Wire: improve debug print with object-ellipsize-strings 2026-01-09 13:36:06 -08:00
Enrico Ros b70d57d878 Fix Node 25 build, by using --no-webstorage with node - otherwise Zustand's persist middleware will break the build 2026-01-09 10:45:50 -08:00
Enrico Ros 5aa857362b Merge pull request #912 from enricoros/claude/issue-909-20260109-1034
feat(deepseek): add API Host field to DeepSeek settings
2026-01-09 02:42:29 -08:00
Enrico Ros c92fc34051 Merge pull request #911 from enricoros/claude/issue-902-20251229-1701
docs: remove Midori AI Subsystem section from installation guide
2026-01-09 02:39:46 -08:00
claude[bot] b01e66f12a feat(deepseek): add API Host field to DeepSeek settings
Add the ability for users to configure a custom API host for DeepSeek,
allowing them to use alternative endpoints like https://api.deepseek.com/beta.

Changes:
- Add `deepseekHost` to DDeepseekServiceSettings interface
- Wire deepseekHost to oaiHost in transport layer
- Add API Host form field visible in advanced settings

Closes #909

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2026-01-09 10:38:11 +00:00
Enrico Ros a88d20784a Roll AIX 2026-01-09 02:27:54 -08:00
Enrico Ros 63486ed6cf LLMs/AIX: support 4 levels for Gemini 3 Flash. Fixes #905 2026-01-09 02:27:53 -08:00
Enrico Ros 3ceec773f2 LLMs: DeepSeek: remove 3.2 Speciale 2026-01-09 02:08:54 -08:00
Enrico Ros 817fa56ec4 LLMs: Anthropic: remove Claude 3 Opus 2026-01-09 02:05:33 -08:00
Enrico Ros 088fb21a90 LLMs: Gemini: update cache pricing 2026-01-09 01:55:53 -08:00
Enrico Ros 79c755a469 Wire: ellipsize AixDemuxers.DemuxedEvents 2026-01-09 01:31:39 -08:00
Enrico Ros a091d3f011 OpenAI: support for gpt-image-1.5 2026-01-09 01:22:26 -08:00
Enrico Ros c7c01a5d7c AIX: Gemini: sync API: FC with multimodal responses, validated FC-Config, retrieval config, multiple voices, seed, responseId 2026-01-09 01:03:29 -08:00
Enrico Ros cdc0f48973 AIX: Gemini: support MISSING_THOUGHT_SIGNATURE 2026-01-09 00:46:10 -08:00
Enrico Ros e884f6b962 LLMs: Gemini: mark Deep Research Pro Preview as hidden for now - Interactions API coming 2026-01-08 20:46:15 -08:00
Enrico Ros 485a9bea71 LLMs: Gemini: remove removed models 2026-01-08 20:44:57 -08:00
Enrico Ros f3c3b667ca LLMs: OpenAI: remove chatgpt-image from llms 2026-01-08 17:51:58 -08:00
Enrico Ros 3b0c4f31b6 LLMs: Gemini: add 3-flash-preview 2026-01-08 17:47:06 -08:00
Enrico Ros 5e54600766 Deps: set peers 2026-01-08 17:43:41 -08:00
claude[bot] c3e54f69b7 docs: remove Midori AI Subsystem section from installation guide
The Midori AI Subsystem is being sunset as announced in issue #902.
This removes the deployment section from the installation documentation.

Closes #902

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-12-29 17:02:49 +00:00
Enrico Ros c4022d1c9b Roll small 2025-12-20 21:53:18 +01:00
Enrico Ros 6e13a78a24 Roll next 2025-12-20 21:40:15 +01:00
Enrico Ros c7cacd9727 LLMs: Gemini: remove 'medium' thinking level support - too early. Fixes #900 2025-12-20 21:08:41 +01:00
Enrico Ros a77110f704 LLMs: remove parameters set to a former enum value if then removed 2025-12-20 21:07:38 +01:00
Enrico Ros 83a6069de5 CC: update workflows 2025-12-14 03:45:26 +01:00
Enrico Ros e9a1890e54 Beam: rays/fusions can edit, delete 2025-12-11 23:50:06 +01:00
Enrico Ros bf928aa06e AIX: 'searching' item state with 5.2 Pro 2025-12-11 20:38:54 +01:00
Enrico Ros b2dc50590c LLMs: OpenAI GPT 5.2 Pro: enable web search 2025-12-11 20:33:53 +01:00
Enrico Ros 229e53ac32 Roll AIX 2025-12-11 20:20:26 +01:00
Enrico Ros 51e8a47615 OpenAI: Support X-Hight reasoning modes for 5.2-class models 2025-12-11 20:19:57 +01:00
Enrico Ros e80b58a412 UI: support 52 / 52Pro X-High/None reasoning efforts 2025-12-11 20:19:37 +01:00
Enrico Ros 48ced8b079 LLMs: support 52 / 52Pro X-High/None reasoning efforts 2025-12-11 20:19:30 +01:00
Enrico Ros c07e2aea1e AIX: support 52 / 52Pro X-High/None reasoning efforts 2025-12-11 20:19:19 +01:00
Enrico Ros f3194aa30e LLMs: Add GPT 5.2 2025-12-11 19:30:23 +01:00
Enrico Ros cb3e4cd951 LLMs: Mistral: update models 2025-12-09 13:33:01 -08:00
Enrico Ros f5d8d029ea CC: update workflows 2025-12-08 16:54:27 -08:00
Enrico Ros 7c946c4126 CC: add cherry-pick 2025-12-07 12:12:30 -08:00
Enrico Ros ded4ea0d69 Personas: disable YouTube transcript (unsupported API) 2025-12-07 12:10:27 -08:00
Enrico Ros c180c549fe BYOM: improve message 2025-12-07 11:53:49 -08:00
Enrico Ros 1f30f1168f Friction: Model Wizard: also warn if some keys are not saved 2025-12-07 11:52:51 -08:00
Enrico Ros 9446f15922 Friction: better remember Wizard model data 2025-12-07 11:51:35 -08:00
Enrico Ros e13b2c9cd9 Tutorial: 'BYOM' message 2025-12-07 11:49:18 -08:00
Enrico Ros e9e14e0292 LLMs: OpenRouter: add to the wizard 2025-12-07 11:46:04 -08:00
Enrico Ros added19656 Roll posthogs 2025-12-05 19:37:01 -08:00
Enrico Ros 4fa3c4d479 Remove old material (wrong) path 2025-12-05 19:32:34 -08:00
Enrico Ros 690738de9a Fix CVE-2025-55182 2025-12-05 18:41:13 -08:00
Enrico Ros cb31d27e68 Copy: strip background/colors on copy (keep font size and structure) 2025-12-05 11:17:17 -08:00
Enrico Ros e6658df123 Attachment: show dl issues on console 2025-12-05 11:15:15 -08:00
Enrico Ros 0b7154a14c LLMs: OpenAI: remove obsoleted models 2025-12-04 15:52:42 -08:00
Enrico Ros 02c1838de5 LLMs: OpenAI: add gpt-5.1-codex-max 2025-12-04 15:52:18 -08:00
Enrico Ros fc455fceb8 LLMs: Mistral: rmeove obsoleted models 2025-12-02 07:56:29 -08:00
Enrico Ros 8d40cdd234 LLMs: Mistral: sort 2025-12-02 07:53:15 -08:00
Enrico Ros 40145c669a LLMs: Mistral: add Mistral-Large and Ministral 2025-12-02 07:53:09 -08:00
Enrico Ros 34d2fc233f LLMs: Mistral: adapt wires 2025-12-02 07:52:55 -08:00
Enrico Ros 670ec0381a Speex: collapse configure - recollapse with shift 2025-12-01 09:11:48 -08:00
Enrico Ros 2128f255fe Speex: collapse configure when not used 2025-12-01 09:09:50 -08:00
Enrico Ros b717bd9a9a Settings: max-height to not jump around too much 2025-12-01 08:58:20 -08:00
Enrico Ros 8aab9311f5 Roll AIX 2025-12-01 08:33:32 -08:00
Enrico Ros ff3e16ea67 DeepSeek: still images are not supported 2025-12-01 08:30:16 -08:00
Enrico Ros 1de039c315 AIX: OpenAI ChatCompletion: remove multipart hotfix (not needed anymore) 2025-12-01 08:27:55 -08:00
Enrico Ros d05e1786d7 Model namings: speciale 2025-12-01 08:16:44 -08:00
Enrico Ros e34b5a7372 AIX: support Deepseek Speciale 2025-12-01 08:16:29 -08:00
Enrico Ros a1b3d1b508 DeepSeek: 3.2 and Speciale 2025-12-01 08:16:18 -08:00
Enrico Ros 1ebccdf420 Speex: Readmes 2025-12-01 02:59:56 -08:00
Enrico Ros e5f674509c 2.0.2 News 2025-11-30 16:54:56 -08:00
Enrico Ros 197a4ae5c0 2.0.2 Package 2025-11-30 16:53:25 -08:00
Enrico Ros 64d2dcf39c AudioLivePlayer: tryfix for the persistent android notification 2025-11-30 15:05:17 -08:00
Enrico Ros caf54c736b Speex: do not stop the playback too early 2025-11-30 14:31:43 -08:00
Enrico Ros 423c2cce28 speakText: port to Speex 2025-11-30 12:51:55 -08:00
Enrico Ros a1af51efcb Call: port to Speex 2025-11-30 06:55:51 -08:00
Enrico Ros ffc1bf9c58 Remove src/modules/elevenlabs 2025-11-30 06:55:51 -08:00
Enrico Ros a54bfdb342 Settings: port to Speex 2025-11-30 06:55:51 -08:00
Enrico Ros 03861d2dbd Speex: map instead of array 2025-11-30 06:38:14 -08:00
Enrico Ros 8c080da6bf Speex: Autoconfig WebSpeech best 2025-11-30 06:38:14 -08:00
Enrico Ros a8c98056b6 Speex: Config UI Done 2025-11-30 06:38:14 -08:00
Enrico Ros 78e663f955 Speex: important fixes 2025-11-30 06:38:14 -08:00
Enrico Ros 70546a5039 Speex: Almost Done 2025-11-30 06:38:14 -08:00
Enrico Ros 30f78b33cb Speex: diable Azure 2025-11-30 06:38:14 -08:00
Enrico Ros 712e8c1f16 Speex: UI update: Selects and Persona Voice changer 2025-11-30 06:38:14 -08:00
Enrico Ros 933dfdfb53 Speex: improve types 2025-11-30 06:38:14 -08:00
Enrico Ros 9ce86b029f Speex: UI settings modal 2025-11-30 06:38:14 -08:00
Enrico Ros 13580cc69d Speex: UI config improvements 2025-11-30 06:38:14 -08:00
Enrico Ros a7dee0002d Speex: debug instrumentation 2025-11-30 06:38:14 -08:00
Enrico Ros c84b2df3fa Speex: fix elevenlabs 2025-11-30 06:38:14 -08:00
Enrico Ros d9471a8684 Speex: fix types 2025-11-30 06:38:14 -08:00
Enrico Ros ef630c2272 Speex: improve UI and errors 2025-11-30 06:38:14 -08:00
Enrico Ros e188c71652 Speex: RPC: shared downstreaming 2025-11-30 06:38:14 -08:00
Enrico Ros 910260c2c8 Speex: UI: credentials edit and add new 2025-11-30 06:38:14 -08:00
Enrico Ros 22752abc38 Speex: relax engine validation 2025-11-30 06:38:14 -08:00
Enrico Ros 92bc3a5d64 Speex: DVoice -> wire_Voice 2025-11-30 06:38:14 -08:00
Enrico Ros 1383752cc1 Speex: reduce logging 2025-11-30 06:38:13 -08:00
Enrico Ros 66af16fb81 Speex: manual refactor 2025-11-30 06:38:13 -08:00
Enrico Ros fc019d7b46 Speex: client cleanups 2025-11-30 06:38:13 -08:00
Enrico Ros ac4f0fcb12 Speex: LocalAI: Preview 2025-11-30 06:38:13 -08:00
Enrico Ros a6c2bc663d Speex: arrange files 2025-11-30 06:38:13 -08:00
Enrico Ros e62ffa02e9 Speex: LocalAI vendor 2025-11-30 06:38:13 -08:00
Enrico Ros a003600839 Speex: some UI 2025-11-30 06:38:13 -08:00
Enrico Ros ea73feb06d Speex: remove elevenlabs, with key migration 2025-11-30 06:38:13 -08:00
Enrico Ros 3bdf69e1b7 Speex: ui: begin 2025-11-30 06:38:13 -08:00
Enrico Ros 590fe78bd1 Speex: client cleanup 2025-11-30 06:38:13 -08:00
Enrico Ros 76187ba0e7 Speex: rpc backend 2025-11-30 06:38:13 -08:00
Enrico Ros 5eba375f4d Speex: add webspeech (with detection) and synthesize-openai 2025-11-30 06:38:13 -08:00
Enrico Ros 8fa6a8251f Speex: vendors, engine store, client, router, skel-synthesize 2025-11-30 06:38:13 -08:00
Enrico Ros 75fa046f30 Speex: centralize capability 2025-11-30 06:38:13 -08:00
Enrico Ros 08a8cd1430 Speex: Types & Client 2025-11-30 06:38:13 -08:00
Enrico Ros 3afbb78a39 Icons: port to PhVoice 2025-11-30 06:38:12 -08:00
Enrico Ros fca6ccd816 Badge: transparent BG to not overlap text. Fixes #889 2025-11-29 14:52:13 -08:00
Enrico Ros 8d351822c1 Niy 2025-11-29 13:25:36 -08:00
Enrico Ros 7d274a31fe AIX: CGR: use shared objectUtils 2025-11-29 12:40:04 -08:00
Enrico Ros e36dde0d25 objectUtils: estimate JSON size, deep clone with string limit, find largestStringPaths 2025-11-29 12:17:28 -08:00
Enrico Ros 51cc6e5ae5 CSF: only show the option for server-side (not client-side) disconnect 2025-11-29 11:12:30 -08:00
Enrico Ros 28d911c617 ElevenLabsIcon: add icon 2025-11-28 05:49:33 -08:00
Enrico Ros b1e9fe58fb objectUtils: add stripUndefined 2025-11-28 04:23:11 -08:00
Enrico Ros 16ba014ade GoodBadge: for 'new' 2025-11-28 04:23:11 -08:00
Enrico Ros e9d5a20c1a FormTextField: support inputSx 2025-11-28 04:23:11 -08:00
Enrico Ros 6e0036f9c4 FormSecretField: crystal clear keys input 2025-11-28 04:23:11 -08:00
Enrico Ros d7e189aa1c FormSliderControl: allow sliderSx 2025-11-28 04:23:11 -08:00
Enrico Ros ea2b444fb2 FormChipControl: alignEnd 2025-11-28 04:23:11 -08:00
Enrico Ros cd1efaf26e FormChipControl: support descriptions 2025-11-28 04:23:11 -08:00
Enrico Ros e47f0e5d43 LanguageSelect: imrove select 2025-11-28 04:23:11 -08:00
Enrico Ros 5284d37984 AudioLivePlayer: ignore a closure error 2025-11-28 04:23:11 -08:00
Enrico Ros 1bf6fa0e4d Browse service: improve error reporting 2025-11-27 19:12:08 -08:00
Enrico Ros fc294c82f1 Pdfjs: lock to 5.4.54
more recent 5.4 have trouble with await import('pdfjs-dist'), throwing.
2025-11-27 18:33:20 -08:00
Enrico Ros 7b1dc49dda Roll pdfjs 2025-11-27 18:19:51 -08:00
Enrico Ros d15ddeea24 Roll react-player 2025-11-27 18:15:19 -08:00
Enrico Ros eaac213859 Ph: add Voice 2025-11-27 18:07:54 -08:00
Enrico Ros 02c1460351 Roll posthog 2025-11-27 18:04:06 -08:00
Enrico Ros 2fff35b7d9 Roll superjson 2025-11-27 18:03:37 -08:00
Enrico Ros c5b9072bde LLMs: LocalAI publish interface 2025-11-26 19:01:44 -08:00
Enrico Ros 8a570e912a CSF: docs 2025-11-26 07:37:56 -08:00
Enrico Ros 1dcc40afb8 CSF: Propagate everywhere 2025-11-26 07:37:09 -08:00
Enrico Ros c2092f8035 BlockPartError: vendor name 2025-11-26 06:50:11 -08:00
Enrico Ros 886c4b411e Revert "Test Edge on node"
This reverts commit 8888fd40cd.
2025-11-26 06:13:28 -08:00
Enrico Ros 8888fd40cd Test Edge on node 2025-11-26 04:56:26 -08:00
Enrico Ros 31cd01bccf BlockPartError: CSF enabled 2025-11-26 04:42:50 -08:00
Enrico Ros c59b221004 BlockPartError: allow retrying disconnected errors too 2025-11-26 04:27:52 -08:00
Enrico Ros cb3cc3e74c PostHog: disable the info level 2025-11-26 04:05:03 -08:00
Enrico Ros 9e90015fcc PostHog: disable the info level 2025-11-26 03:56:55 -08:00
Enrico Ros 95e0517056 60s - disable any maxDuration 2025-11-26 03:56:25 -08:00
Enrico Ros 2b2f47915f AIX: OpenAI: Fix CSF! 2025-11-26 03:11:12 -08:00
Enrico Ros 9acd178ce1 AudioPlayer: safe end of stream 2025-11-26 03:11:08 -08:00
Enrico Ros f381f80184 AIX: Anthropic: add strict to tool defs on wiretypes 2025-11-24 16:44:13 -08:00
Enrico Ros c83be61343 AIX: Anthropic: newlines for text broken by tool calls 2025-11-24 16:05:44 -08:00
Enrico Ros f6e49d31ec PWA-Desktop detect. Fixes #887 2025-11-24 15:48:50 -08:00
Enrico Ros cc0429a362 Update readme 2025-11-24 15:14:49 -08:00
Enrico Ros b35901d94c 2.0.1 Roll AIX 2025-11-24 15:06:39 -08:00
Enrico Ros c0df1a23f4 2.0.1 Update news 2025-11-24 15:05:40 -08:00
Enrico Ros 495619af2c LLMs: Interfaces fix 2025-11-24 15:00:09 -08:00
Enrico Ros 72dfadf106 AIX: Anthropic: auto-header for programmatic tool calling (calling from code, not just llm) 2025-11-24 14:58:34 -08:00
Enrico Ros 5825909e45 AIX: Anthropic: programmatic tool calling support 2025-11-24 14:42:20 -08:00
Enrico Ros d3f6d87ee0 AIX: remove legacy unconstrained 'json mode' 2025-11-24 14:29:36 -08:00
Enrico Ros c4f4c5ddad AIX: cross-vendor json output | strict tool invocation 2025-11-24 14:23:25 -08:00
Enrico Ros 2921d7ca27 Anthropic: Structured Outputs | Strict Tools 2025-11-24 13:59:20 -08:00
Enrico Ros 2021cbc988 Anthropic: MessageDeltaUsage 2025-11-24 13:58:57 -08:00
Enrico Ros e9e29861b2 Anthropic: cleanup models 2025-11-24 13:21:25 -08:00
Enrico Ros 8e6da36059 LLM interface types - relax for bw comp 2025-11-24 13:21:13 -08:00
Enrico Ros 5e1469e12e Anthropic: Tool Search Tool 2025-11-24 13:20:58 -08:00
Enrico Ros bd7465f8b1 Roll packages 2025-11-24 12:34:52 -08:00
Enrico Ros 570397a616 Anthropic: add Effort parameter 2025-11-24 12:34:39 -08:00
Enrico Ros b3b5f1daef Anthropic: raise default thinking to 16384 2025-11-24 12:13:50 -08:00
Enrico Ros 25ec3ae47c Anthropic: add Opus 4.5 2025-11-24 12:09:41 -08:00
Enrico Ros 5ba5e3da58 2.0.1 Roll AIX, news 2025-11-24 04:11:39 -08:00
Enrico Ros 9296c14ca0 2.0.1 News 2025-11-24 04:11:39 -08:00
Enrico Ros 310b5d3422 2.0.1 Package 2025-11-24 03:57:17 -08:00
Enrico Ros 1c5967112e Rolled posthog as there's still no local min 2025-11-24 03:19:08 -08:00
Enrico Ros 49a3d8ee71 Roll deep 2025-11-24 03:14:58 -08:00
Enrico Ros cf8b61e8d9 Packages: locked dexie 2025-11-24 03:11:26 -08:00
Enrico Ros 967ae5723e Roll posthog - next.config.ts fix 2025-11-24 02:39:28 -08:00
Enrico Ros 03421acf2f Roll posthog - security fix (details below)
https://helixguard.ai/blog/malicious-sha1hulud-2025-11-24
2025-11-24 02:39:15 -08:00
Enrico Ros d43896cc5a Package: cmd to fix sharp on win32/x64 2025-11-24 02:33:50 -08:00
Enrico Ros b283124a2f Roll packages 2025-11-24 02:05:37 -08:00
Enrico Ros 8c39be01f8 Roll packages 2025-11-24 02:04:23 -08:00
Enrico Ros fb2bd4ccd8 Error Hints: nit 2025-11-23 23:34:36 -08:00
Enrico Ros 5b826ffc45 Error Hints: control AI Service advanced setup 2025-11-23 23:26:56 -08:00
Enrico Ros 0b2ab365d3 Error Hints: Render Reconnect 2025-11-23 23:26:56 -08:00
Enrico Ros 93fc54992c Error Hints: AIX Client and Reassembler -> Fragment 2025-11-23 23:26:56 -08:00
Enrico Ros 60b7326deb Error Hints: Fragments 2025-11-23 23:26:56 -08:00
Enrico Ros d6e6139244 AIX: Gemini: change log 2025-11-23 23:26:56 -08:00
Enrico Ros 0892911ddc Next config: conditional strict mode 2025-11-23 23:26:56 -08:00
Enrico Ros 30267ac50c LLMs: Nano Banana message names 2025-11-23 23:16:43 -08:00
Enrico Ros ffef0ef31d PWA detect. Fixes #887 2025-11-23 23:15:56 -08:00
Enrico Ros fc047087ce CSF: direct connect actions hook 2025-11-23 23:15:03 -08:00
Enrico Ros 81d4966535 CSF: improve vendors 2025-11-23 20:02:06 -08:00
Enrico Ros 004d63fda1 FormRadioControl: gap 1 2025-11-23 16:23:35 -08:00
Enrico Ros 23e2dbb354 tRPC fetchers: increase error message to 240 2025-11-23 16:19:25 -08:00
Enrico Ros 28e9899b97 Settings: looks 2025-11-23 16:19:22 -08:00
Enrico Ros 7441d41550 FormRadioControl: descriptions 2025-11-23 16:19:11 -08:00
Enrico Ros 99e2d5597a LLMs: CSF: OpenAI 2025-11-23 02:56:08 -08:00
Enrico Ros 74321a44ca LLMs: Client-side .listModels 2025-11-23 02:46:28 -08:00
Enrico Ros 7b664affb7 AIX: Client-side .chatGenerate 2025-11-23 02:46:28 -08:00
Enrico Ros c411835f3b LLMs: listModels dispatch: cleanup 2025-11-23 02:36:08 -08:00
Enrico Ros 7b62c946a5 LLMs: Vendors: type the access 2025-11-23 02:35:59 -08:00
Enrico Ros 252e2fcd29 LLMs: Access extraction rewires 2025-11-23 02:28:38 -08:00
Enrico Ros aa2731bccc LLMs: Access extraction 2025-11-23 02:28:38 -08:00
Enrico Ros 282c439963 LLMs: CSF: UI configs 2025-11-23 02:28:38 -08:00
Enrico Ros e99459aba0 LLMs: CSF: vendors 2025-11-23 02:28:38 -08:00
Enrico Ros 4c35cbbe34 LLMs: CSF: access 2025-11-23 02:28:38 -08:00
Enrico Ros cab3537ae2 CSF: activate stubs 2025-11-23 02:28:38 -08:00
Enrico Ros c3f211389b AIX: edgeProcedure rename 2025-11-23 02:28:38 -08:00
Enrico Ros a4de84a842 AIX: dispatch: extract debugger / move security 2025-11-23 02:28:38 -08:00
Enrico Ros 2bf1eaaa0f Partially remove app.config 2025-11-23 01:57:29 -08:00
Enrico Ros 7f5ddd1629 Client stubs: env with log 2025-11-23 01:35:12 -08:00
Enrico Ros ed798fec65 Client stubs: PostHog 2025-11-22 19:09:38 -08:00
Enrico Ros 90386f5794 Client stubs: Env 2025-11-22 19:09:36 -08:00
Enrico Ros 8ada8811bf Build/Env: remove superfluous function 2025-11-22 19:09:33 -08:00
Enrico Ros b24badabef Revert "PostHog: server-client-safe import"
This reverts commit 2c8b713ff3.
2025-11-22 18:58:21 -08:00
Enrico Ros 4e20cb12cd Env: server-only naming 2025-11-22 16:38:05 -08:00
Enrico Ros 245da9e6cc App: server-client-safe vercel import 2025-11-22 16:30:36 -08:00
Enrico Ros a800b34aa7 App: prioritize posthog 2025-11-22 16:30:36 -08:00
Enrico Ros 50c3941f42 Posthog: client: cleanup 2025-11-22 16:30:36 -08:00
Enrico Ros 6e5d5ee36c Posthog: server: exceptions: trpc change 2025-11-22 16:25:50 -08:00
Enrico Ros 2c8b713ff3 PostHog: server-client-safe import 2025-11-22 16:06:00 -08:00
Enrico Ros 8162a6706d PostHog: server: add templates 2025-11-22 16:06:00 -08:00
Enrico Ros 952f6883fa PostHog: server: exceptions 2025-11-22 16:06:00 -08:00
Enrico Ros 373f3e3698 PostHog: server: cleanups 2025-11-22 16:06:00 -08:00
Enrico Ros 17791f631f AIX: extract consts for client-side bundles 2025-11-22 16:06:00 -08:00
Enrico Ros 6987c67cc7 AIX: Images: further resize the last generated image - can be very large, so >0.5M we resize 2025-11-21 03:11:08 -08:00
Enrico Ros 65a59e5d2d Blocks: Reasoning: mx 1.5 2025-11-21 01:58:34 -08:00
Enrico Ros 05b9a6d412 AIX: OpenAI: avoid prefixing with [summary] the reasoning 2025-11-21 01:41:16 -08:00
Enrico Ros 6608f4f164 AIX: Gemini: collapse empty text in between reasoning 2025-11-21 01:40:52 -08:00
Enrico Ros 93378ad6b0 VoidFragments -> ContentFragments 2025-11-21 01:19:23 -08:00
Enrico Ros bd4a60203e In-order rendering: transfer aux 2025-11-21 00:52:23 -08:00
Enrico Ros c9e6a62641 Fragments: Aux: add delete aux 2025-11-21 00:35:08 -08:00
Enrico Ros 68d797fa99 Fragments: Aux: show reasoning progress (assumed, not in the Aux fragment) and disable the 'realize' button while incomplete 2025-11-21 00:35:05 -08:00
Enrico Ros 08011d8cf2 ChatMessage: fix Void Fragment to text 2025-11-21 00:34:43 -08:00
Enrico Ros 2f91bf7f52 In-order rendering: cleanups 2025-11-21 00:27:52 -08:00
Enrico Ros d5182c05c1 In-order rendering of parts 2025-11-20 23:09:29 -08:00
Enrico Ros 8e0947a833 AIX: GG Transmitter: also faster body size estimation 2025-11-20 22:56:15 -08:00
Enrico Ros 1d88fc37b0 AIX: GG Transmitter: smaller echo request 2025-11-20 20:22:43 -08:00
Enrico Ros 46bd8e6f4d AIX debugger: wrap 2025-11-20 20:22:43 -08:00
Enrico Ros b95b427331 AIX debugger: unbreak layout 2025-11-20 20:22:43 -08:00
Enrico Ros 9b574c60eb Roll AIX 2025-11-20 19:52:32 -08:00
Enrico Ros a8b39cc0a4 LLMs: Gemini: support image size 2025-11-20 19:51:51 -08:00
Enrico Ros cdbc7dd9b8 AIX: Gemini: parse and recreate thought signatures 2025-11-20 19:11:42 -08:00
Enrico Ros 08dfec4fcf AIX: expend parts to include per-part vendor opaque information 2025-11-20 19:11:42 -08:00
Enrico Ros 7f4553225b AIX: transfer/reassemble per-fragment opaque vendor data 2025-11-20 19:11:42 -08:00
Enrico Ros f37e65a91e DMessageFragments: per-fragment vendor-specific 2025-11-20 19:11:42 -08:00
Enrico Ros c022f8a68c LLMs: improve params editing 2025-11-20 19:11:42 -08:00
Enrico Ros daa7a506a5 AIX: Gemini: include thoughts when thinking is requested 2025-11-20 19:11:42 -08:00
Enrico Ros f3dcf39c15 LLMs: Gemini: update cba 2025-11-20 17:01:39 -08:00
Enrico Ros 06cbef16d4 LLMs: Gemini: add Nano Banana Pro
LLMs: Gemini: show thoughts
2025-11-20 17:01:39 -08:00
Enrico Ros ab31bcd3e3 LLMs: Gemini: remove obsolete 2025-11-20 17:01:39 -08:00
Enrico Ros 563a99864f LLMs: showThoughts: default to false 2025-11-20 17:01:39 -08:00
Enrico Ros 39b8abc2c6 Fix: LLMs: cleanup stale userParameters 2025-11-20 17:01:39 -08:00
Enrico Ros f3dd837076 AIX: Gemini: UrlContext (fetching website) not supported by Nano Bananas 2025-11-20 17:01:39 -08:00
Enrico Ros d6b3a5259d fix eslint warnings 2025-11-20 17:01:39 -08:00
Enrico Ros 9fea1d5c64 LLMs: xAI: add Grok 4.1 Fast 2025-11-19 16:24:46 -08:00
Enrico Ros 0adb5355c7 Debug Wire: mark sessions 2025-11-19 15:44:13 -08:00
Enrico Ros 01d807b61e Browsing: allow time for screenshot 2025-11-19 15:44:10 -08:00
Enrico Ros 285bb812d0 LocalAI: fix list and virtualize 2025-11-19 14:43:56 -08:00
Enrico Ros d897155d6e LocalAI: fix gallery parsing 2025-11-19 14:43:53 -08:00
Enrico Ros 7154426279 packages: add react-virtual 2025-11-19 14:43:50 -08:00
Enrico Ros 4526084e4d roll packages 2025-11-19 12:20:18 -08:00
Enrico Ros 0c5c786ae3 Beam: starring selects 2025-11-19 12:14:47 -08:00
Enrico Ros 8a2c4aa356 useLLMSelect: show starred models only 2025-11-19 12:14:16 -08:00
Enrico Ros 4cba819edd State: show starred models only 2025-11-19 12:11:24 -08:00
Enrico Ros 4db42a2b29 StarIcons: improve 2025-11-19 12:11:21 -08:00
Enrico Ros fc0ee5b698 LLMs: fix OpenAI models overlap check 2025-11-19 12:11:11 -08:00
Enrico Ros 2c0c3f1c70 CC: zustand fix 2025-11-19 12:11:01 -08:00
Enrico Ros 3f3976b73c eslint: migrate to flat conf 2025-11-19 12:09:53 -08:00
Enrico Ros 82d5dcced5 LLMs: Gemini 3: fixed temperature, as Google Strongly Recommends to keep it at 1 2025-11-18 20:29:31 -08:00
Enrico Ros f4eaed694a LLMParametersEditor: allow code execution 2025-11-18 20:19:26 -08:00
Enrico Ros 05d9869326 Improve CG tool appearance 2025-11-18 20:19:09 -08:00
Enrico Ros 2675934ff8 LLMs: Gemini 3 - enable code execution 2025-11-18 20:10:01 -08:00
Enrico Ros fb6e19d3ea Roll AIX 2025-11-18 19:51:56 -08:00
Enrico Ros f1151d54e1 LLMs: Gemini 3.0 with Thinking Level, media Resolution, Google Search. Fixes #884 2025-11-18 19:51:55 -08:00
Enrico Ros 6a0fa4f9fa LLMs: Gemini Parameters 2025-11-18 19:44:16 -08:00
Enrico Ros 20d96fffc8 AIX: Gemini: wires upgrade 2025-11-18 19:42:29 -08:00
Enrico Ros ad6c06308a AIX: Gemini: -> thinkingLevel, -> mediaResolution, -> codeExecution, -> urlContext 2025-11-18 19:36:40 -08:00
Enrico Ros 84ee4171a4 AIX: Gemini: thought signature logger 2025-11-18 19:36:40 -08:00
Enrico Ros 6bc4f8a1e4 AIX: Gemini: wiretypes for thinkingLevel (param), urlContext (tool), thoughtSignature (fc invocation arg) 2025-11-18 19:36:40 -08:00
Enrico Ros 8876aa0866 RenderCode: reload button when html is rendered. Fixes #885 2025-11-18 19:36:39 -08:00
Enrico Ros 691d2e7228 Starring: models on top 2025-11-18 05:34:04 -08:00
Enrico Ros 7a12755de9 T2I: LocalAI: temporary mappings of models names and sizes to OpenAI gpt-image (GI) family equivalents
Maps OpenAI model IDs to LocalAI models:
- gpt-image-1 → stablediffusion
- gpt-image-1-mini → dreamshaper
- dall-e-3 → sd-3.5-large-ggml
- dall-e-2 → sd-3.5-medium-ggml
2025-11-18 04:36:09 -08:00
Enrico Ros 8573f56d03 T2I: Enable LocalAI, over 'stablediffusion' 2025-11-18 04:05:16 -08:00
Enrico Ros 8f3e683321 T2I: Azure disable WebP 2025-11-18 02:59:02 -08:00
Enrico Ros 64867b0b67 T2I: Azure support - LocalAI likely. Fixes #832 2025-11-18 02:59:01 -08:00
Enrico Ros e42d060e57 T2I: OpenAI: honor options 2025-11-18 02:55:47 -08:00
Enrico Ros 2ca9ab8a0c T2I: add options 2025-11-18 02:54:43 -08:00
Enrico Ros fdc0c6b371 T2I: propagate abort signal 2025-11-18 02:48:01 -08:00
Enrico Ros 8f8779c3cd Selection Operations: improve with fuzzy matching: matches more. Helps #881 2025-11-18 02:43:11 -08:00
Enrico Ros 851877ad8b LLMs: Azure: send the image_generation request anyway, for future compatibility - #832 2025-11-18 01:19:21 -08:00
Enrico Ros 8df74529ad LLMs: Azure: still inline image generation does not work - #832 2025-11-18 01:16:13 -08:00
Enrico Ros 353f51ebf0 LLMs: Azure: more explicitly named no-web-search var 2025-11-18 01:07:31 -08:00
Enrico Ros 6c5cb08118 ModelsList: fix list jumpiness on starring. Fixes #880. 2025-11-18 00:49:35 -08:00
Enrico Ros 54fee92b15 ModelsList: improve performance in large lists (e.g. OpenRouter) 2025-11-18 00:49:05 -08:00
Enrico Ros 776431c801 LLMs: Mistral: silence the off-by-2 warning 2025-11-17 14:58:49 -08:00
Enrico Ros 9f893ce999 LLMs: Groq: update models 2025-11-17 14:58:49 -08:00
Enrico Ros 820447670c LLMs registry: move 2025-11-17 05:05:27 -08:00
Enrico Ros b43c49cd64 Type annotation 2025-11-17 03:58:40 -08:00
Enrico Ros f9c3558975 Remove warning 2025-11-17 03:49:03 -08:00
Enrico Ros 1b75250824 LLMs: enumeration 2025-11-17 03:43:10 -08:00
Enrico Ros 3fa3bb5d03 LLMs: Central Dispatch 2025-11-17 03:29:40 -08:00
Enrico Ros ef0ff55f1f LLMs: extract LocalAI, LMStudio 2025-11-17 01:42:26 -08:00
Enrico Ros 66aa8ed177 Composer: autoFocus only on Desktop 2025-11-17 00:27:52 -08:00
Enrico Ros 519286bc69 DocumentAttachmentFragments: doc number limiter 2025-11-17 00:13:46 -08:00
Enrico Ros 9882f45fd2 DocumentAttachmentFragments: rename into pane 2025-11-17 00:09:31 -08:00
Enrico Ros 634f6216a0 Roll packages 2025-11-17 00:07:08 -08:00
Enrico Ros 69574a7d1c AIX: OpenRouter: fix reasoning summaries 2025-11-16 04:36:41 -08:00
Enrico Ros eddd4b9be8 ModelsServiceSelector: improve +1 2025-11-16 04:36:26 -08:00
Enrico Ros 9a9c31ff53 DocumentAttachmentFragments: performance: memo 2025-11-16 04:36:06 -08:00
Enrico Ros 41ee7a1c85 Nav support: remessage 2025-11-15 13:35:26 -08:00
Enrico Ros 2f9bbf373c Nav support: with AI triage 2025-11-15 13:26:30 -08:00
Enrico Ros d662e10ebb Support: with AI triage 2025-11-15 13:23:42 -08:00
Enrico Ros cd31092333 Update AI triage 2025-11-15 13:23:31 -08:00
Enrico Ros 1eae7ab6f3 Update AI triage 2025-11-15 13:15:51 -08:00
Enrico Ros ba378f852f Tryout AI triage 2025-11-15 12:51:56 -08:00
Enrico Ros 5cfd1e557d Update README 2025-11-15 12:46:39 -08:00
Enrico Ros df31d79eaf Update README 2025-11-15 12:21:54 -08:00
Enrico Ros 12d7304325 Update README 2025-11-15 12:19:38 -08:00
Enrico Ros 41424cbdfd Update README 2025-11-15 12:17:06 -08:00
Enrico Ros 05dda519a2 CC: add tree 2025-11-15 12:04:07 -08:00
Enrico Ros 120d39282e Add screenshot section to README
Added a section for screenshots and usage examples.
2025-11-15 11:40:34 -08:00
Enrico Ros 8e7d0fd13b Update README 2025-11-15 11:32:21 -08:00
Enrico Ros 3d979fdfbb Update Docker deployment link in README 2025-11-15 01:58:19 -08:00
Enrico Ros 6ab47ae3cb Sweet readme
Updated README to enhance project visibility and features.
2025-11-15 01:56:56 -08:00
Enrico Ros a4977b4924 AI Inspector: 5Hz updates limit 2025-11-14 17:42:43 -08:00
Enrico Ros bac9c692b8 AI Inspector: toggle visibility on shortcut 2025-11-14 17:27:23 -08:00
Enrico Ros 6ab15356e1 AI Inspector: show Desktop shortcut 2025-11-14 17:19:21 -08:00
Enrico Ros 73cc7121c3 CC: add npx eslint 2025-11-14 15:40:34 -08:00
Enrico Ros 1aeef06f49 AppBeam: nits 2025-11-14 01:51:45 -08:00
Enrico Ros 3b16bcf01d OpenAI: officially updated 5.1 models 2025-11-14 01:15:09 -08:00
Enrico Ros f6351fda41 Ph: +Megaphone 2025-11-13 01:49:34 -08:00
Enrico Ros 007e91480d Roll posthog 2025-11-13 01:49:00 -08:00
Enrico Ros 163ef9296e HBWA: note 2025-11-13 01:48:18 -08:00
Enrico Ros fa042f7d68 AIX: prenorm 2025-11-12 22:53:50 -08:00
Enrico Ros 8a11040dde optimization: faster 'hasKeys' Object non-emptiness check (avoids allocation) 2025-11-12 22:40:46 -08:00
Enrico Ros a88971d557 Models: Vendor (service) selection: vastly improve 2025-11-12 17:45:34 -08:00
Enrico Ros 5867e5fcc5 OpenRouter: more config beauty 2025-11-12 17:32:43 -08:00
Enrico Ros 20e587d6d3 OpenRouter: more config cleanup 2025-11-12 16:58:14 -08:00
Enrico Ros 6bfa8471cd Models Modal: option for fullscreen 2025-11-12 16:01:14 -08:00
Enrico Ros 5c10bce2f4 ModelsList: disable the large popups 2025-11-12 15:53:52 -08:00
Enrico Ros f1663f6668 DataStreamViz: optimize with intersection observers for 1k Beams 2025-11-12 15:39:15 -08:00
Enrico Ros 90c27e0e74 LLMs: add displayGroup 2025-11-12 15:31:32 -08:00
Enrico Ros b5eac0d907 OpenRouter: more config improvement 2025-11-12 15:20:30 -08:00
Enrico Ros 4eabe2cb3a Roll AIX 2025-11-12 15:11:17 -08:00
Enrico Ros a1c0d30a06 LLMs: GPT-5 optimistic updates (coming later this week) 2025-11-12 15:10:50 -08:00
Enrico Ros 63c9f65040 Merge remote-tracking branch 'opensource/claude/issue-879-20251112-2245' 2025-11-12 15:05:01 -08:00
Enrico Ros f58a066bff OpenRouter: improve config 2025-11-12 15:02:21 -08:00
Enrico Ros 952ea6357a tRPC: newline on dev warnings 2025-11-12 14:57:59 -08:00
Enrico Ros 6695973035 tRPC: raise the correct server-side exceptions 2025-11-12 14:57:44 -08:00
claude[bot] 3dc28635f4 Add support for GPT-5.1 models
- Add gpt-5.1-2025-11-12 (GPT-5.1 Thinking) with adaptive reasoning
- Add gpt-5.1-chat-latest (GPT-5.1 Instant) with adaptive reasoning
- Both models include full feature set: chat, vision, function calling, JSON, prompt caching, reasoning, web search
- Pricing set to match GPT-5 (to be updated when official pricing is announced)
- Added models to manual ordering list for proper UI sorting

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-11-12 22:47:51 +00:00
Enrico Ros 0bde01a85f Llms: vector ops 2025-11-12 14:43:26 -08:00
Enrico Ros b9840c2074 Warn downgrade 2025-11-12 14:43:23 -08:00
Enrico Ros 8228a76875 LLMs: LocalAI enumeration less verbose 2025-11-11 18:40:52 -08:00
Enrico Ros 46b370a2e3 AIX: OpenAI ChatGenerate: fixup for malformed NS objects - best-effort 2025-11-11 18:37:34 -08:00
Enrico Ros 820e9513ba Fragments: Void reasoning priority 2025-11-11 18:22:14 -08:00
Enrico Ros bd71d64db3 LLMs: OpenRouter: fix context window max_tokens issue 2025-11-11 18:09:50 -08:00
Enrico Ros 9d4baf827c LLMs: OpenRouter: auto-detection of capabilities (i/o modalities, features, etc). Thanks OpenRouter, you're the best! 2025-11-11 17:47:31 -08:00
Enrico Ros d6843d7fcf AIX: OpenRouter: write/parse the new reasoning request / reasoning_details response
Removes older system(s) from OpenRouter.
2025-11-11 17:46:58 -08:00
Enrico Ros babb1dd962 LLMs: OpenRouter: parser 2025-11-11 16:46:13 -08:00
claude[bot] aa32e396a7 Tryfix speech recognition on Android 2025-11-11 14:07:51 -08:00
Enrico Ros 1068efcb49 CC: commands: openrouter 2025-11-11 14:01:08 -08:00
Enrico Ros 576c7f1458 CC: improve triage 3 2025-11-11 13:48:20 -08:00
Enrico Ros 37c857b055 CC: improve triage 2 2025-11-11 13:37:46 -08:00
claude[bot] 794dfb44d1 Add date to README
Added today's date (2025-11-11) to README.md as requested in issue #876.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-11 21:34:04 +00:00
Enrico Ros 929bb6dc66 CC: improve triage 2025-11-11 13:32:34 -08:00
Enrico Ros 28337e31eb Token Badge: show for half a cent+ 2025-11-11 12:48:32 -08:00
Enrico Ros 09a38c0e4b UserInputChecklistComponent: fix crash 2025-11-11 12:30:02 -08:00
Enrico Ros 645b8fb9cd LLMs: OpenRouter: fix 400 2025-11-11 12:14:37 -08:00
Enrico Ros 541588948c LLMs: OpenRouter: add future PKCE 2025-11-11 12:13:09 -08:00
Enrico Ros bdd6fcfbbc Ph: add Gift/Key 2025-11-11 12:13:08 -08:00
Enrico Ros 9e50286c66 LLMs: Moonshot: remove images from the request for those models 2025-11-10 02:00:42 -08:00
Enrico Ros 418e4649dc Analytics: Client send 2025-11-10 00:59:28 -08:00
Enrico Ros 4a70f20f4a Roll packages
Note: fix Posthog/nextjs-config to 1.3.2 because starting from 1.3.3 there are new dependencies not evaluated yet
2025-11-10 00:33:28 -08:00
Enrico Ros d6eabfcb6d Roll posthog 2025-11-10 00:27:07 -08:00
Enrico Ros d88889d760 Extra: disable 0.1 opacity; makes the code less usable 2025-11-10 00:07:27 -08:00
Enrico Ros 85146d8af0 LLMs: Moonshot: native search parameter (disabled here and in the model - upstream not good enough) 2025-11-09 20:16:39 -08:00
Enrico Ros 9612572f07 AIX: Moonshot: native search dispatch/parse support 2025-11-09 20:15:02 -08:00
Enrico Ros 4bb1dddf4d ApproximateCosts: only display on Extra 2025-11-09 20:15:02 -08:00
Enrico Ros b066a86962 AIX: fix exhaustive check 2025-11-09 20:15:02 -08:00
Enrico Ros 6086455782 LLMs/AIX: Moonshot AI support 2025-11-09 20:15:02 -08:00
Enrico Ros 9020b3cbad AIX: Fix Mistral's response breakage in NS mode - Fixes #873. 2025-11-09 17:23:07 -08:00
Enrico Ros 5822dea270 AIX: Further breakdown logging levels - with Retry entry points. 2025-11-09 17:14:55 -08:00
Enrico Ros c445f59664 AIX: Fix Logging, with warn only for real server-side solvable issues. Fixes #872. Fixes #874.Fixes #875 2025-11-09 17:07:14 -08:00
Enrico Ros 737e4cb4f9 AIX - fix Gemini parser. Fixes #871 2025-11-09 16:31:52 -08:00
Enrico Ros dba7368d01 Rename platform format 2025-11-07 23:35:29 -08:00
Enrico Ros 314c4cd8cc Error correction: render placeholder notification 2025-11-07 23:11:31 -08:00
Enrico Ros 3e46f99e14 AIX: Error correction: client-side reporting 2025-11-07 22:45:11 -08:00
Enrico Ros e0cc552b8d AIX: Error correction: operation reporting (Anthropic only for now, but generic) 2025-11-07 22:45:11 -08:00
Enrico Ros 6b5be403af AIX: Error correction: dispatch reporting 2025-11-07 22:45:11 -08:00
Enrico Ros 269d5989bc Fetchers: Error correction: callback before retry 2025-11-07 22:43:12 -08:00
Enrico Ros edfe3d9b65 DFragments: Error correction: keep results in placeholder fragments (backward-comp) 2025-11-07 22:43:12 -08:00
Enrico Ros ffb2c42a26 imageUtils: non-aliased rescaling, multi-pass for downscales 2025-11-07 14:11:23 -08:00
Enrico Ros b7de19b020 AIX: turn on operation-retrier for Anthropic's error-in-stream. Fixes #869 2025-11-07 12:03:16 -08:00
Enrico Ros 77cd659b39 AIX: support operation-level retrier with reassembly wipe #869 2025-11-07 12:03:12 -08:00
Enrico Ros fbba9d8357 Block parts: show the retry-followup 2025-11-07 12:02:13 -08:00
Enrico Ros f464a9efdf LLMs: listModels: openAI, deprio 401. Fixes #870 2025-11-07 11:42:22 -08:00
Enrico Ros 7ec4290582 AIX: Anthropic: retriable errors: extended parsing 2025-11-07 11:04:21 -08:00
Enrico Ros 3f887a1d3a AIX: Gemini: fix other proxy errors. Fixes #868 2025-11-07 10:50:25 -08:00
Enrico Ros ffd76dc587 *Image Captioning with a dedicated (configurable) model. Fixes #862 2025-11-05 14:15:50 -08:00
Enrico Ros d7f3594a73 FormInputKey: smaller key 2025-11-05 12:32:18 -08:00
Enrico Ros 32fa5f206b AudioGenerator: underwater 2025-11-05 10:41:53 -08:00
Enrico Ros 70d2c09e81 Env: drop requirements on non-staging/non-prod 2025-11-05 09:01:47 -08:00
Enrico Ros 17f03806d0 AIX: rename DMessage gen to throw 2025-11-05 07:07:05 -08:00
Enrico Ros b6aba0efa4 AIX: type change to DMessageGuts for correctness 2025-11-04 17:01:26 -08:00
Enrico Ros 65a5e06935 DMessage: Generator helpers 2025-11-03 14:53:53 -08:00
Enrico Ros f459cb9805 PhPencilSimple: add 2025-11-03 09:04:28 -08:00
Enrico Ros f5470aca5d AIX: OpenAI Responses: remove patch for deep-research 2025-11-02 20:12:07 -08:00
Enrico Ros c26af97fe7 LLMs: OpenAI: deep research with standard params path 2025-11-02 20:11:17 -08:00
Enrico Ros 766ec458a2 CC: add permission 2025-11-02 15:43:28 -08:00
Enrico Ros 48ff78580c typos 2025-11-02 13:49:12 -08:00
Enrico Ros 396f7524d7 Panel Resize: rename inset 2025-11-01 18:08:31 -07:00
Enrico Ros da19ef42f5 AIX: OpenAI Responses: use the web_search (non preview) tool 2025-11-01 17:19:57 -07:00
Enrico Ros 91abe5aa43 DLLM: FC/FR: rendering improvement 2025-11-01 12:05:37 -07:00
Enrico Ros 682435321b DLLM: FC/FR: rendering improvement 2025-11-01 12:01:23 -07:00
Enrico Ros 76f0d60224 LLMs/AIX: Gemini: computer use comment 2025-11-01 11:28:34 -07:00
Enrico Ros 628b88ef9f LLMs/AIX: Gemini: computer use test 2025-11-01 11:18:12 -07:00
Enrico Ros 6a792814ce LLMs/AIX: Gemini: computer use mixed-tool (hosted + client) test. 2025-11-01 11:09:41 -07:00
Enrico Ros 05ce15d677 LLMOptionsModal: auto-open if user pricing is set or custom tokens are set 2025-11-01 11:02:26 -07:00
Enrico Ros 4a9d0d4f8e LLMs: fix post user-pricing, #860 2025-11-01 10:54:55 -07:00
Enrico Ros 16f0552682 CC: unbreak settings 2025-11-01 10:34:04 -07:00
claude[bot] 9e3819b9c7 feat: Add user pricing override for hypothetical cost tracking
Add userPricing field to DLLM interface following the established pattern
for user overrides (similar to userContextTokens and userMaxOutputTokens).

This enables users to set custom pricing for local models (Ollama, LM Studio, etc.)
to track "what if" costs and compare with cloud models.

Changes:
- Added userPricing field to DLLM interface (llms.types.ts)
- Added getLLMPricing() getter function with override precedence
- Updated store to preserve userPricing during model updates
- Updated all llm.pricing access points to use getLLMPricing()
- Added pricing override UI in LLMOptionsModal (Details section)
  - Input price ($/M tokens)
  - Output price ($/M tokens)
  - Reset buttons for each field
- Cost calculations automatically use user pricing when set
- Existing cost display in tooltips works with user pricing

Resolves #860

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-31 08:37:58 +00:00
Enrico Ros 233a0d4b35 LLMs: xAI: rerank 4 2025-10-29 15:53:41 -07:00
Enrico Ros bd95b808ae InlineTextareaEditable: click to edit 2025-10-29 13:53:59 -07:00
Enrico Ros 96132c4585 Export: subordinate tslug warn to the downgrade 2025-10-29 09:50:12 -07:00
Enrico Ros 3edacef572 Export: messaging 2025-10-29 09:43:34 -07:00
Enrico Ros 36889c1695 Export: robustness 2025-10-29 09:30:45 -07:00
Enrico Ros cd2c6c1d8f Export: nits 2025-10-29 09:29:06 -07:00
Enrico Ros d8c78b1a00 Export: disable when beam open, as it's not exported for now 2025-10-29 09:28:18 -07:00
Enrico Ros 74a22c26cf Export: relax file export 2025-10-29 09:26:55 -07:00
Enrico Ros f742eba4c1 Ph: Terminal Icon 2025-10-29 09:11:01 -07:00
Enrico Ros 36c2812157 Export: warn on import from a downgrade or different tenant 2025-10-29 09:07:07 -07:00
Enrico Ros d353fc4c63 Export: save file variant 1 2025-10-29 08:54:05 -07:00
Enrico Ros 98bd3d6da0 LLMs: Ollama: Update models 2025-10-28 16:36:43 -07:00
Enrico Ros cd5ec8d295 LLMs: Perplexity: Update models 2025-10-28 16:34:24 -07:00
Enrico Ros f91c6456bd LLMs: xAI: Update models 2025-10-28 16:31:53 -07:00
Enrico Ros 67af87968e workflows: CC: ollama update 2025-10-28 16:30:48 -07:00
Enrico Ros 58ea3e1b35 workflows: CC: permissions 2025-10-28 16:27:15 -07:00
Enrico Ros a9435c10e8 LLMs: OpenPipe: Update models 2025-10-28 16:23:40 -07:00
Enrico Ros a86860fe76 LLMs: Groq: Update models 2025-10-28 16:19:40 -07:00
Enrico Ros a3d707f78a LLMs: Mistral: Update models 2025-10-28 16:17:34 -07:00
Enrico Ros c502426249 LLMs: Anthropic: Update models 2025-10-28 16:17:06 -07:00
Enrico Ros 2fb5ffcecf LLMs: Anthropic: remove retired Claude 2 models 2025-10-28 16:09:36 -07:00
Enrico Ros 6d995c1253 LLMs: Anthropic: remove retired Sonnet 3 models 2025-10-28 16:08:39 -07:00
Enrico Ros a860c1c490 LLMs: Anthropic: remove retired Sonnet 3.5 models - So long and thanks!! 2025-10-28 16:06:42 -07:00
Enrico Ros 481d9cc745 LLMs: Anthropic: only display 'obsoleted models' in 2025-10-28 16:03:02 -07:00
Enrico Ros 7e53a7bc2b Server: tRPC: Retriers: carve0out 429 quota 2025-10-28 15:59:05 -07:00
Enrico Ros 4df10e3782 Lint 2025-10-28 15:59:05 -07:00
Enrico Ros 396da65178 AIX: OpenRouter: don't display processing messages 2025-10-28 15:49:37 -07:00
Enrico Ros 87e8faf383 workflows: docker: limit to 1hr 2025-10-28 13:11:28 -07:00
Enrico Ros 9eb3e6d398 workflows: CC: raise to 30min 2025-10-28 13:11:21 -07:00
Enrico Ros 332c4fdf82 2 Open - Release 2025-10-27 19:26:18 -07:00
Enrico Ros 4d247344d5 2 Open: end changelog 2025-10-27 18:42:48 -07:00
Enrico Ros 4e4738d4f6 2 Open: dissolve 1.17 2025-10-27 18:31:31 -07:00
Enrico Ros dbfa7b0932 2 Open: TechLevels 2025-10-27 18:26:49 -07:00
Enrico Ros e90231d58d Roll AIX 2025-10-27 18:26:08 -07:00
Enrico Ros 9bc7d40425 2 Open: 200 2025-10-27 18:25:36 -07:00
Enrico Ros d2d5c0621b CC update 2025-10-27 18:24:50 -07:00
Enrico Ros e41d57c914 Update README.md 2025-10-27 18:24:50 -07:00
Enrico Ros 7c5336cba3 2 Open: Link to live changes. 2025-10-27 16:48:42 -07:00
Enrico Ros d041e4e2bf AIX: message the tRPC < 11.6 edge disconnections 2025-10-27 15:24:09 -07:00
Enrico Ros 7fba6255ff AIX: operation-level retry shall loop with abortSignal (to let the next iteration respond appropriately) rather than re-throw the RequestRetry which the tRPC router wouldn't know what to do with 2025-10-27 15:19:20 -07:00
Enrico Ros dc226d9ac0 AIX: Anthropic: support to retry on overloaded 2025-10-27 15:08:10 -07:00
Enrico Ros c01a937d7d AIX: operation-level retry (chatGenerate) with RequestRetry errors which can be thrown by parsers, if allowed 2025-10-27 15:07:41 -07:00
Enrico Ros ee6646a66f Server: abortable delay 2025-10-27 14:32:43 -07:00
Enrico Ros b73aa16001 tRPC: lock to 11.5.1 for the time being, because of #857 2025-10-27 13:55:46 -07:00
Enrico Ros 92c875459a Keep sourcemaps for Open debugging 2025-10-27 11:57:25 -07:00
Enrico Ros 011fbbe834 Try bisecting to a 11.4.4 tRPC in the context of #857 2025-10-27 11:41:34 -07:00
Enrico Ros a921ea6fe5 Try upgrading to a 11.7 tRPC in the context of #857 2025-10-27 11:36:22 -07:00
Enrico Ros 82bcc6d5d5 Try reverting to a 11.6 tRPC in the context of #857 2025-10-27 11:27:42 -07:00
Enrico Ros f6d52da034 Try reverting to a pre-11.7 tRPC in the context of #857 2025-10-27 11:15:37 -07:00
Enrico Ros cd3159cacf Sherpa: reduce debug 2025-10-27 10:54:38 -07:00
Enrico Ros 1af4e18cb3 Help debugging #857 2025-10-27 10:31:31 -07:00
Enrico Ros 7b6eb94bf7 Help debugging #857 2025-10-27 10:30:37 -07:00
Enrico Ros 8cc6d65dd4 Help debugging #857 2025-10-27 10:16:23 -07:00
Enrico Ros 54e5f9a1bc Server: listModels: improve print 2025-10-27 10:11:27 -07:00
Enrico Ros fa28305141 AIX: un-warn dispatch-fetch issues, as they're Error Corrected extensively 2025-10-27 09:59:00 -07:00
Enrico Ros 1e56b36eae Server: move retriers, as it outgrew the original AIX.chatGenerate purpose 2025-10-27 09:58:46 -07:00
Enrico Ros e2253cde7f Server: tRCP fetchers & retrier: downgrade logging severity 2025-10-27 09:44:21 -07:00
Enrico Ros 6a4bfc1cf2 server: improve message 2025-10-25 15:27:25 -07:00
Enrico Ros dfc0d5088d AIX: retrier: reduce to 2 retries 2025-10-25 13:24:09 -07:00
Enrico Ros 8f154305e9 AIX: server: activate logging for auto-retry 2025-10-24 19:58:17 -07:00
Enrico Ros 09b96a01bf AIX: server: upstream auto-retry 2025-10-24 19:57:55 -07:00
Enrico Ros 1ce0c631b4 Browse: strings 2025-10-24 15:52:52 -07:00
Enrico Ros 61a5b6d5eb Improve strings 2025-10-24 15:52:36 -07:00
Enrico Ros ca62bad217 LLMs: OpenRouter: improve only-free 2025-10-24 14:40:31 -07:00
Enrico Ros 13f352a901 Setups: upgraded descriptions 2025-10-24 14:40:30 -07:00
Enrico Ros 775af756fd Button Beam Mobile: outlined 2025-10-24 13:51:12 -07:00
Enrico Ros 5c4545877d Composer: 0.5 shorter 2025-10-24 13:51:08 -07:00
Enrico Ros 9c820dcaf1 AppChat: breathing room on mobile/bottom 2025-10-24 13:51:06 -07:00
Enrico Ros 49f0bf4802 AIX: Gemini: finish reason typesafety 2025-10-24 13:03:23 -07:00
Enrico Ros fbb2f106f0 tRPC: edge procedure (semantics) 2025-10-24 12:41:15 -07:00
Enrico Ros cb46d3d536 AIX: extract the CG executor 2025-10-24 12:32:22 -07:00
Enrico Ros 84289c4ade AIX: dispatch: move IParticleTransmitter to the parsers, while impl is up 2025-10-24 11:06:52 -07:00
Enrico Ros b35ffd9983 AIX: router: move echo out of connection loop 2025-10-24 11:05:15 -07:00
Enrico Ros 8197fed036 server: fetchers: explained the tRPC error 2025-10-24 10:57:34 -07:00
Enrico Ros f6c40cdce6 AIX: Gemini: finish reason parser: compress 2025-10-24 10:53:30 -07:00
Enrico Ros b8cca72cf1 server: fetchers: errors: remove cause for security 2025-10-24 10:36:39 -07:00
Enrico Ros d20cafa22b server: fetchers: errors: match Vercel's edge runtime error messages (absence of _cause) 2025-10-24 10:31:51 -07:00
Enrico Ros 421a5ae681 server: report the error cat/codes to the client 2025-10-23 20:10:33 -07:00
Enrico Ros 49157b9efa server: fetchers: redo all with good error detection 2025-10-23 20:10:03 -07:00
Enrico Ros c11684a9cf server: improve error formatting, removing unneeded stacks 2025-10-23 20:08:02 -07:00
Enrico Ros 12aa812b37 server: improve safeErrorString 2025-10-23 20:08:02 -07:00
Enrico Ros 3667425c61 AIX: Refactor - Client - Retry State Machine 2025-10-23 05:25:10 -07:00
Enrico Ros fd0ab93744 AIX: Refactor - Client - Retry & Resume
This refactor allows for low-level looping on the client side.

This can be used for network errors between server<>upstream reported as particles,
as well as for client<>server connections.

One special case of this is the OpenAI system to reattach to detached (background) requests,
or as an alternative to re-fetch them from the server once completed.
2025-10-23 04:26:06 -07:00
Enrico Ros a0b549855f AIX: Refactor - Router - Composable & Resumable
This refactor decomposes the chatGeneration procedure into composable blocks.

Allows for instance chatGeneration-like outputs from different inputs,
allowing for instance `resumability` of a background connection.

Moreover this reorganizes the phases of a CG operation, and includes a generic executor
that takes creator functions for Dispatchers.
2025-10-23 04:11:05 -07:00
Enrico Ros c70c89c2e8 AIX: Client: error as message in ContentReassembler promise chain 2025-10-23 04:11:05 -07:00
Enrico Ros 32c5c00d55 AIX: Client: error classification 2025-10-23 04:11:05 -07:00
Enrico Ros 013d0e0217 AIX: pre-refactor nits 2025-10-23 04:11:05 -07:00
Enrico Ros f0bf866654 Anthropic Skills: on extra 2025-10-23 01:47:40 -07:00
Enrico Ros 2c14cb1113 nit 2025-10-23 01:32:13 -07:00
Enrico Ros 15abecfbb6 LLMs: OpenRouter: add the haiku 4.5 thinking variant 2025-10-23 01:02:52 -07:00
Enrico Ros 827d64d49a remove icon 2025-10-23 01:02:52 -07:00
Enrico Ros 01c45b2286 Anthropic Skills: improve config 2025-10-23 01:02:24 -07:00
Enrico Ros d3e5c196f9 LLMs: remove vendorspec from the params editor 2025-10-23 00:36:22 -07:00
Enrico Ros 71978b94f2 Fragments: support placeholders of 'code-exec' type 2025-10-23 00:11:18 -07:00
Enrico Ros 79da87d823 AIX/LLMs: Anthropic: Skills: improve reporting of steps 2025-10-23 00:06:58 -07:00
Enrico Ros 1c19f36783 AIX/LLMs: Anthropic: improve spec 2025-10-22 23:58:13 -07:00
Enrico Ros a4d4e351e5 AIX/LLMs: OpenRouter search 2025-10-22 23:18:55 -07:00
Enrico Ros 45ef2afccb LLM Options: support 'all from vendor X' 2025-10-22 23:10:12 -07:00
Enrico Ros 9ef5b61722 AIX: Anthropic: Skills: parser fix 2025-10-22 22:53:57 -07:00
Enrico Ros ff008d1034 AIX: Anthropic: Parser: event sequence debugger 2025-10-22 22:39:01 -07:00
Enrico Ros 3cd38f471e DMessage: session draft 2025-10-22 22:32:44 -07:00
Enrico Ros 1581d46be7 AIX: Anthropic Skills dispatch / parse 2025-10-22 22:32:43 -07:00
Enrico Ros 32571e15eb LLMs: Anthropic: Custom Skills CRUD 2025-10-22 22:13:36 -07:00
Enrico Ros d69adaa6af LLMs: Anthropic Skills model params editor 2025-10-22 22:13:36 -07:00
Enrico Ros 246968098a LLMs: Anthropic Skills headers 2025-10-22 21:45:51 -07:00
Enrico Ros 861c4ef370 LLMs: Anthropic Skills model params 2025-10-22 21:45:35 -07:00
Enrico Ros bfe94e98f2 Anthropic: fix old-school get/post 2025-10-22 20:55:30 -07:00
Enrico Ros 9152318ef6 Merge pull request #855 from enricoros/claude/issue-829-20251022-2344
fix(call): propagate speech recognition errors to UI
2025-10-22 17:02:53 -07:00
claude[bot] 302694bdad fix(call): propagate speech recognition errors to UI
- Read recognitionState.errorMessage in Telephone component
- Pass error message to CallStatus component
- Display specific error messages instead of generic fallback
- Matches error handling pattern used in Chat/Composer

This ensures users see detailed error messages instead of generic
Browser may not support text.

Fixes #829 by making speech recognition errors visible to users.

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-22 23:47:26 +00:00
claude[bot] 14602a1411 LLMs: add user override for context window and max output tokens. Fixes #853 2025-10-22 14:57:55 -07:00
Enrico Ros 044baa5fc2 Starring: improve starring in models modal 2025-10-22 14:10:38 -07:00
Enrico Ros 3fa09194a7 LLM Options: reset to defaults on Mobile 2025-10-22 14:10:14 -07:00
Enrico Ros d3aa10f9d1 LLM Options: reset to defaults 2025-10-22 14:09:47 -07:00
Enrico Ros e2b2d5974f AIX: Gemini: detect internal 503s 2025-10-22 04:16:19 -07:00
Enrico Ros d99668aa40 AIX: fix Openrouter parsing 2025-10-22 03:56:12 -07:00
Enrico Ros 5f8d5678fa AIX: server: improve listModel errors 2025-10-22 03:07:39 -07:00
Enrico Ros 14f245df2b AIX: server: improve listModel errors 2025-10-22 02:44:24 -07:00
Enrico Ros f104fb64fd LLMs: Anthropic: update cache costs for 1M models 2025-10-21 02:06:43 -07:00
Enrico Ros 3c2d7a636a LLMs: Perplexity: remove globes from models 2025-10-21 01:35:45 -07:00
Enrico Ros 31b215e58b Roll AIX 2025-10-21 01:24:52 -07:00
Enrico Ros 53ae177396 LLMs: net-dependent Context computation 2025-10-21 01:24:52 -07:00
Enrico Ros 3e1bb3bb3d LLMs: Anthropic: show search caps 2025-10-21 01:24:52 -07:00
Enrico Ros eac150f590 LLMs: Anthropic: sort correctly 2025-10-21 01:24:52 -07:00
claude[bot] 5466b8a265 *LLMs: Add support for Anthropic 1M token context window
Fixes #852
2025-10-21 01:24:52 -07:00
Enrico Ros c3d10c355f *Improved keyboard/focus navigation on menus 2025-10-20 18:44:47 -07:00
Enrico Ros d96a8c14b9 CloseablePopup: assume trueish 2025-10-20 18:44:47 -07:00
Enrico Ros be94f31a85 AIX: Perplexity: parse the new undocumented chunk-like object. Fixes #851 2025-10-20 15:09:32 -07:00
Enrico Ros f7ce349125 AIX: Perplexity: parse and show costs. #851 2025-10-20 15:08:52 -07:00
Enrico Ros a4516b5fa6 Wire/server: pretty-print server-side Zod errors in return messages #851 2025-10-20 14:43:35 -07:00
Enrico Ros 7c1f30c3c7 workflows: CC: npm i hint 2025-10-19 14:51:38 -07:00
Enrico Ros df67be4b03 GC: identified issue with open beams 2025-10-19 14:49:22 -07:00
Enrico Ros 578bb93d8b Env: production helper fix 2025-10-19 14:45:50 -07:00
Enrico Ros b4c5a24864 Env: production helper 2025-10-19 12:53:27 -07:00
Enrico Ros c4a38a6cf6 LLMs: Anthropic: caching config below the fold 2025-10-19 12:40:44 -07:00
Enrico Ros e58f6cc48e Hidden state: use visible accelerator 2025-10-19 12:40:30 -07:00
claude[bot] 8a0c4747c7 fix: preserve model visibility across updates (complete fix)
Fixes #850 - Model visibility was being reset after app updates.

Root cause: User visibility changes were stored in `hidden` field instead of
`userHidden`, but the preservation logic only looked for `userHidden`. This
caused user preferences to be lost during model updates.

Changes:
- Added isLLMHidden() helper to compute effective visibility (userHidden ?? hidden)
- Fixed all write paths to set userHidden instead of hidden (3 files)
- Fixed all read paths to use isLLMHidden() (7 files, 14 locations)

This ensures:
- User preferences persist across updates
- Vendor visibility changes still propagate for untouched models
- Bulk operations work correctly

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-19 19:17:41 +00:00
Enrico Ros 8bef4b9aae FormSelectControl: minWidth can be redefined 2025-10-18 16:50:17 -07:00
Enrico Ros 66382ed980 AIX: Anthropic: Search/Fetch - done
NOTEs: this works without saving the server-side tool invocation and the subsequent responses
to AIX particles, and consequently to DMessageFragments of the opportune type.
-> Shall do it with execution graph fragments.
2025-10-18 14:51:21 -07:00
Enrico Ros 8984b65a51 AIX: Anthropic: Search: do not cite websites - too noisy 2025-10-18 14:30:49 -07:00
Enrico Ros efea6dafbd AIX: Anthropic: Fetch/Search: use placeholders until Tool execution graph abstraction 2025-10-18 14:20:55 -07:00
Enrico Ros 6d4d05e8f7 Roll AIX 2025-10-18 14:18:53 -07:00
Enrico Ros 560a07b4fe LLMs: Anthropic: compress 2025-10-18 14:18:24 -07:00
Enrico Ros fbaff3bde3 AIX: Anthropic: LLM param edit 2025-10-18 13:51:24 -07:00
Enrico Ros 2a01f929f1 AIX: Anthropic: wires nits 2025-10-18 13:20:16 -07:00
Enrico Ros d1d0c32a92 AIX: Anthropic: improve merge of #842 2025-10-18 12:37:45 -07:00
Enrico Ros 3a513e2a4d Merge remote-tracking branch 'opensource/claude/issue-842-20251018-0728' into m2 2025-10-18 12:37:41 -07:00
Enrico Ros 9b32c4b8c5 AIX: Anthropic: headers improvement 2025-10-18 12:37:30 -07:00
Enrico Ros 64542af5af Starring: also codify Emoji 2025-10-18 11:42:14 -07:00
Enrico Ros 1db35feeca Starring: also in useLLMSelect 2025-10-18 11:38:34 -07:00
Enrico Ros 7392063e25 Starring: centralized with styles and memo 2025-10-18 11:38:28 -07:00
Enrico Ros e6745b16f6 Panes: fix panes visibility 2025-10-18 10:41:32 -07:00
Enrico Ros be09b452f0 Panes: persist optima group expanded states 2025-10-18 10:41:07 -07:00
Enrico Ros 42588444a5 Optima Panels: controllable grouped list 2025-10-18 10:36:01 -07:00
Enrico Ros dc48bd1222 OptimaPanelGroupedList: nits 2025-10-18 10:34:37 -07:00
claude[bot] b59eb6cbfb feat: Add Anthropic web search and web fetch tools support
Implements comprehensive support for Anthropic's web search (web_search_20250305) and web fetch (web_fetch_20250910) tools.

- Add llmVndAntWebSearch and llmVndAntWebFetch parameters with ['auto', 'off'] options
- Enable tools for Claude 4.5, 4.1, 4, 3.7, 3.5 Sonnet/Haiku/Opus models (including thinking variants)
- Inject web_search_20250305 and web_fetch_20250910 tools based on parameter values
- Configure web search with max_uses=5 for progressive searches
- Configure web fetch with max_uses=5 and citations enabled
- Add dynamic beta header injection for web fetch (web-fetch-2025-09-10)
- Add UI controls in model settings for easy parameter configuration

Parser already supports web_search_tool_result and web_fetch_tool_result blocks (no changes needed).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-18 07:34:51 +00:00
Enrico Ros a75a31ff04 AIX: Anthropic: Parser: improve 2025-10-18 00:05:09 -07:00
Enrico Ros a0f97e9cd8 AIX: Anthropic: major protocol update 2025-10-17 23:40:31 -07:00
Enrico Ros fe6e7245de AIX: Anthropic: remove vnd.ant.tools for upgrade soon 2025-10-17 23:13:29 -07:00
Enrico Ros a46a9bf76c fix: #845 - no permissions on attachments on Edge 2025-10-17 15:45:04 -07:00
claude[bot] 925e500dc2 fix: simplify benign DOM error handling
- Use console.warn for benign removeChild errors
- Skip PostHog reporting for these errors
- More succinct implementation

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-17 15:00:26 -07:00
claude[bot] 22f0a70272 fix: suppress tiktoken WebAssembly loading errors from PostHog 2025-10-17 14:54:20 -07:00
Enrico Ros 220cc60f7d workflows: CC: DM to sonnet 2025-10-17 04:56:53 -07:00
Enrico Ros 3964fca4b2 cc: allow mcps 2025-10-16 12:18:23 -07:00
Enrico Ros 8fdbb21300 Roll AIX 2025-10-16 12:17:33 -07:00
Enrico Ros c42c9545d2 slashcommands: /aix:roll-aix 2025-10-16 12:16:56 -07:00
Enrico Ros 0de37e337b LLMs: OpenAI: added GPT-5'class models the no-stream option for unverified orgs 2025-10-16 11:37:27 -07:00
Enrico Ros 3ecf7f6016 Merge remote-tracking branch 'opensource/claude/issue-847-20251016-1807' 2025-10-16 11:35:16 -07:00
Enrico Ros da7a62945c workflows: CC: enable reviews and triaging, not DMs 2025-10-16 11:32:37 -07:00
Enrico Ros c876390e27 workflows: CC: enable other users 2025-10-16 11:14:22 -07:00
claude[bot] 9bbc2a2e00 fix: Add llmForceNoStream parameter to OpenAI models with Org ID streaming issues
- Added parameter to GPT-5, GPT-5 Pro, GPT-5 Codex, and GPT-5 Mini
- These models require Organization ID verification for streaming
- Kept existing parameter on o3 and o3-pro models
- Did not modify GPT-5 Nano, GPT-5 Chat Latest, GPT-4.1, or GPT-4o models which work fine

Fixes #847

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-16 18:09:53 +00:00
Enrico Ros 2b18cbc3b9 Errors: cleanup domains 2025-10-16 10:56:32 -07:00
Enrico Ros 388391ddae Errors: remove unused var 2025-10-16 10:46:03 -07:00
Enrico Ros 3e4e6b2f4b BackupRestore: fix a potential removal issue 2025-10-16 10:28:37 -07:00
Enrico Ros e6a65bdf8e BackupRestore: improve messaging 2025-10-16 10:05:05 -07:00
Enrico Ros 0e09cf3d84 Merge pull request #846 from enricoros/claude/issue-844-20251016-0857
fix: properly handle null and undefined in clipboard operations
2025-10-16 02:01:28 -07:00
claude[bot] 5634aa0cac fix: properly handle null and undefined in clipboard operations
- Add nullish coalescing (??) after optional chaining to ensure string return
- Prevents undefined from propagating through the promise chain
- Fixes potential TypeError when calling .startsWith() on undefined

Co-authored-by: Enrico Ros <enricoros@users.noreply.github.com>
2025-10-16 08:58:59 +00:00
Enrico Ros 07916be684 workflows: CC: dm to 100 2025-10-16 01:11:20 -07:00
Enrico Ros 8d20b4675b workflows: CC: allow DM to PR 2025-10-16 00:54:19 -07:00
Enrico Ros d906669ea4 Fix: roll posthog 2025-10-16 00:44:34 -07:00
Enrico Ros 5d7b00f0dc workflows: CC: up max turns 2025-10-16 00:07:52 -07:00
Enrico Ros 740d76c15c LLMs: OpenAI: add gpt-5-search-api 2025-10-15 19:36:38 -07:00
Enrico Ros ca4d21d4b8 LLMs: Gemini: improve logs 2025-10-15 19:19:30 -07:00
Enrico Ros e4defc1baf tRPC Fetchers: improve logging 2025-10-15 19:10:46 -07:00
Enrico Ros 9ea859081d Error Message test 2025-10-15 18:40:47 -07:00
Enrico Ros 87d8320b31 tRPC: Fetchers: show warning also on network connection lost 2025-10-15 18:28:56 -07:00
Enrico Ros 84aea90860 KB: remove AIX adapters 2025-10-15 18:26:38 -07:00
Enrico Ros 95f35cb5cf Roll AIX 2025-10-15 18:08:12 -07:00
Enrico Ros c79ba097c0 CC: slashcommands: idiosyncratic updates 2025-10-15 17:54:14 -07:00
Enrico Ros 8ea1f02c86 workflows: label triaged/reviewed issues/prs 2025-10-15 17:51:10 -07:00
Enrico Ros 674c9c8c25 LLMs: Ollama: update and remove all descriptions (not used) 2025-10-15 17:29:02 -07:00
Enrico Ros 98a3e7e185 LLMs: Alibaba: update: remove any model information and jut return the sorted list from the API 2025-10-15 17:29:02 -07:00
Enrico Ros ee00c53ada LLMs: Perplexity: update 2025-10-15 17:29:02 -07:00
Enrico Ros 0553f64fe8 LLMs: xAI: update 2025-10-15 17:29:02 -07:00
Enrico Ros ff06f6f04c LLMs: Groq: update 2025-10-15 17:29:02 -07:00
Enrico Ros 3f45617e06 LLMs: Deepseek: update 2025-10-15 17:29:02 -07:00
Enrico Ros 9d93c8c55a LLMs: Mistral: update all 2025-10-15 17:29:02 -07:00
Enrico Ros 73eaf740db LLMs: Anthropic: Haku 4.5 (Thinking and not) 2025-10-15 17:29:02 -07:00
Enrico Ros 48426d5022 AIX: OpenAI: [Mistral, 2025-10-15]: non-standard delta.content for thinking blocks 2025-10-15 17:29:02 -07:00
Enrico Ros c79237b419 CC: slashcommands: Models ollama automation 2025-10-15 17:29:02 -07:00
Enrico Ros b0abaf4d9e CC: slashcommands: Models sync - pass1 2025-10-15 17:29:02 -07:00
Enrico Ros ec92a8d31a CC: slashcommands: API syncs 2025-10-15 17:29:02 -07:00
Enrico Ros a4600a4d1d tRPC Fetchers: show content type on parse failures 2025-10-15 17:29:02 -07:00
Enrico Ros ad6a465ce7 workflows: allow slashcommands for DMs 2025-10-15 17:29:02 -07:00
Enrico Ros 0820bb5af6 workflows: elevate max turns 2025-10-15 12:59:39 -07:00
Enrico Ros 73f8488d22 AIX: enable Abort on GET 2025-10-15 12:58:27 -07:00
Enrico Ros 2b3c1c38f3 tRPC Fetchers: improve error reporting at all 3 stages: connect, get response, parse contents 2025-10-15 12:58:24 -07:00
Enrico Ros 59f379f46b tRPC Fetchers: improve JSON decoding errors 2025-10-15 12:58:21 -07:00
Enrico Ros 2bc6ecbe4c tRPC: improve Abort support 2025-10-15 12:58:18 -07:00
Enrico Ros 8274a34841 InlineError: support newlines 2025-10-15 12:58:14 -07:00
Enrico Ros 6e7197caa3 QueryClient: explicit option 2025-10-15 12:57:33 -07:00
Enrico Ros 7c78d48b6c workflows: reduce noise 2025-10-14 22:50:30 -07:00
Enrico Ros b149eb7fa2 workflows: Claude Code DMs 2025-10-14 21:15:07 -07:00
Enrico Ros ba79a3c42c workflows: Claude Code Issue Triage 2025-10-14 21:15:07 -07:00
Enrico Ros 4445ac295f workflows: Claude Code PR Reviews 2025-10-14 21:15:07 -07:00
Enrico Ros 09c2a8b072 KB: renamed app-routing 2025-10-14 18:19:02 -07:00
Enrico Ros 92e371837d Modals: fullscreen options on desktop 2025-10-14 17:54:40 -07:00
Enrico Ros 7fad41dc8a FormSliderControl: option to hide the track 2025-10-14 17:31:40 -07:00
Enrico Ros 0be8ac7e09 CLAUDE.md 2025-10-14 17:31:40 -07:00
Enrico Ros de6e8a047c KB: Index 2025-10-14 17:31:40 -07:00
Enrico Ros 92955f92bf KB: AIX supports 2025-10-14 17:31:40 -07:00
Enrico Ros 5327866836 KB: routing architecture 2025-10-14 17:31:40 -07:00
Enrico Ros 54b8836faa KB: Update the params system 2025-10-14 17:31:40 -07:00
Enrico Ros eb39db9974 KB: structure 2025-10-14 17:31:40 -07:00
Enrico Ros 087e6e2eaf AIX: Client: decimator cleanups 2025-10-14 02:52:28 -07:00
Enrico Ros 295d91b310 AIX: Client: improve disconnect message. 2025-10-14 02:50:50 -07:00
Enrico Ros f75bcb78d7 FIX: AIX: Anthropic: hotfix: disable thinking with forced tool use (policy of: any, tool) 2025-10-14 02:48:22 -07:00
Enrico Ros ffb32d8720 Roll AIX 2025-10-14 02:22:48 -07:00
Enrico Ros 879458d692 Roll packages 2025-10-14 02:22:27 -07:00
Enrico Ros 96eece3a3e Optima: MobilePanel: pad bottom 2025-10-14 01:56:16 -07:00
Enrico Ros dc75136131 LLMs: OpenAI: flush 2025-10-14 01:15:28 -07:00
Enrico Ros 57c43b3c4e LLMs: OpenAI: reduce model desc size 2025-10-14 01:12:03 -07:00
Enrico Ros 4c5b7677e6 LLMs: nit 2025-10-14 01:12:03 -07:00
Enrico Ros 43890150e5 AIX: OpenAI: fix NS function parsing 2025-10-14 01:11:19 -07:00
Enrico Ros bc86214c5e LLMs: xAI: grok names 2025-10-13 18:51:30 -07:00
Enrico Ros ef1f412019 LLMs: OpenAI: slimmer 2025-10-13 18:51:14 -07:00
Enrico Ros 1249efb53b LLMs: xAI: add the latest groks. Fixes #837 2025-10-13 18:50:21 -07:00
Enrico Ros 8bc81e45ce Models: UI hide hidden by default 2025-10-13 17:52:23 -07:00
Enrico Ros 810f316185 Nit 2025-10-13 17:23:34 -07:00
Enrico Ros 5b49e801d1 Roll AIX 2025-10-13 17:20:03 -07:00
Enrico Ros 3269e10da9 LLMs: OpenAI: enable Audio output models 2025-10-13 17:10:23 -07:00
Enrico Ros 53a57fd7ff AIX: OpenAI: support Audio output models S/NS 2025-10-13 17:10:09 -07:00
Enrico Ros dbbf25c3af LLMs: new model description builder 2025-10-13 17:09:17 -07:00
Enrico Ros a2ff00f53b LLMs: mandatory fallbacks 2025-10-13 13:49:08 -07:00
Enrico Ros 4904383838 BlockPartModelAnnotations: fix Favicons for Google Search icons (vertex) 2025-10-13 13:25:00 -07:00
Enrico Ros 8221444308 LLMs: Gemini: enable search 2025-10-13 13:24:31 -07:00
Enrico Ros 7cd94b3163 *AIX: Gemini: Google Search support 2025-10-13 13:24:16 -07:00
Enrico Ros 52cdf7da4e AIX: Hosted tools - bits of cleanup 2025-10-13 12:46:02 -07:00
Enrico Ros 6ff010ae0e AIX: Gemini: remove the old google_search_retrieval 2025-10-13 12:05:19 -07:00
Enrico Ros 6d81150975 AIX: Gemini: parse also finish message and url grounding 2025-10-13 11:55:44 -07:00
Enrico Ros 0fdcc4c64d LLMs: Gemini: remove 4:5 and 5:4 support, not in the API 2025-10-13 11:52:30 -07:00
Enrico Ros f272c9cb12 AIX: Gemini: expand to all finish reasons 2025-10-13 11:36:11 -07:00
Enrico Ros 5354f83736 AIX: OpenAI: Responses: uniform S/NS parsing 2025-10-13 11:26:57 -07:00
Enrico Ros f4b2f36ac0 AIX: OpenAI: Responses: handle streaming stop reasons 2025-10-13 10:49:59 -07:00
Enrico Ros 5fca834c20 AIX: OpenAI: Responses: handle max_tokens 2025-10-13 10:49:59 -07:00
Enrico Ros fff48335ae AIX: OpenAI: skip also markdown restoration when custom tools with restrictive policies are applied 2025-10-13 10:49:59 -07:00
Enrico Ros f39a1825cf AIX: OpenAI: skip hosted tools when custom tools with restrictive policies are applied 2025-10-13 10:17:21 -07:00
Enrico Ros c1b10405a5 AIX: Anthropic: document context_window_exceeded 2025-10-13 10:17:21 -07:00
Enrico Ros 37ba583cf2 AIX: Anthropic: parse new stop reasons 2025-10-13 10:17:21 -07:00
Enrico Ros 4beb7de83f AIX: add tokenStopReasons: ok-pause, and filter-refusal 2025-10-13 10:17:21 -07:00
Enrico Ros cb8202e327 dMessageUtils: pedantic 2025-10-13 10:17:20 -07:00
Enrico Ros 90c90f78b6 LLMs: OpenAI: exclude sora-2/pro from Language models 2025-10-11 20:39:22 -07:00
Enrico Ros e700c27256 presentErrorToHumans: fix 2025-10-11 20:39:18 -07:00
Enrico Ros 7372287b5c AIX: Resumability: Block Control 2025-10-10 18:33:47 -07:00
Enrico Ros d059948f62 useFragmentBuckets: check for error 2025-10-10 18:11:42 -07:00
Enrico Ros 1cb6491d17 AIX: Errors are not appended as Text anymore 2025-10-10 18:04:26 -07:00
Enrico Ros 3a6e8a5f27 AIX: Resumability: OpenAI: request and parse 2025-10-10 17:49:50 -07:00
Enrico Ros c0cd820880 AIX: Resumability: enablement downstream 2025-10-10 17:49:50 -07:00
Enrico Ros 7b5655dd6d AIX: Resumability: enablement upstream 2025-10-10 17:49:50 -07:00
Enrico Ros 0f4c108614 AIX: Client: improve reuse 2025-10-10 17:35:54 -07:00
Enrico Ros 86f4cc66d1 DMessage: upstream handle support 2025-10-10 17:33:04 -07:00
Enrico Ros ca38e7f160 AIX: OAI: Responses: typo 2025-10-10 17:33:04 -07:00
Enrico Ros 99bd54ca79 BlockPartPlaceholder: fix mx: 1.5 2025-10-10 17:32:57 -07:00
Enrico Ros 9a3ef83078 README: Link images 2025-10-10 09:17:43 -07:00
Enrico Ros c1d3c5d350 AIX: Inspector: link in DesktopNav > Tools 2025-10-10 09:17:36 -07:00
Enrico Ros a36e202c80 AIX: Inspector: Quick Toggle 2025-10-10 09:06:14 -07:00
Enrico Ros b713b65a35 AIX: Inspector: show body size 2025-10-10 08:53:33 -07:00
Enrico Ros 925445c729 README: bits 2025-10-10 08:26:34 -07:00
Enrico Ros ce8140ce22 README: add and section out some changes 2025-10-10 02:17:15 -07:00
Enrico Ros d2f60e51c7 Add mascot section to README
Added a section for the mascot with an image.
2025-10-10 02:15:55 -07:00
Enrico Ros c66885d25c News: fix style 2025-10-10 01:24:16 -07:00
Enrico Ros 8d4ca7b547 2.0 update package 2025-10-10 01:06:08 -07:00
Enrico Ros 280b32b3a9 2.0 Adjusted news item 2025-10-10 00:49:43 -07:00
Enrico Ros 522bd890c1 2.0 Simple news items 2025-10-10 00:38:31 -07:00
Enrico Ros 88e1f51099 Model Configuration: don't fullscreen on mobile 2025-10-10 00:34:51 -07:00
Enrico Ros 8774b222d9 2.0 release graphics 2025-10-09 23:58:48 -07:00
Enrico Ros b9ef1d608c News: rename Callout 2025-10-09 22:02:28 -07:00
Enrico Ros a0d25a1d48 News: wire Dev up top 2025-10-09 21:41:05 -07:00
Enrico Ros 92cd9e5930 News: add Dev item 2025-10-09 21:39:41 -07:00
Enrico Ros 3099b0d0ec News: extract Card 2025-10-09 21:19:27 -07:00
Enrico Ros 4a5ce94d29 Fix removal of Release.App 2025-10-09 20:57:15 -07:00
Enrico Ros b47a1fd562 Remove Release.App/News 2025-10-09 20:51:11 -07:00
Enrico Ros 10bef4f75c Dissolve app.version 2025-10-09 20:44:51 -07:00
Enrico Ros 41c571caf5 Name as Open 2025-10-09 20:41:45 -07:00
Enrico Ros a21b049437 BackupRestore: remove version 2025-10-09 20:37:12 -07:00
Enrico Ros f06fbec8df webGeolocationUtils: correct package version 2025-10-09 20:35:56 -07:00
Enrico Ros 24b6b4e1a9 Update description 2025-10-09 20:10:16 -07:00
Enrico Ros df8f9b3e3a AIX: limit the echo request size 2025-10-09 19:55:24 -07:00
Enrico Ros 85a55bcc4c BlockEdit_TextFragment: fix escape key 2025-10-09 19:55:11 -07:00
Enrico Ros facb2e3f2b Update README.md 2025-10-09 19:44:12 -07:00
Enrico Ros f6e79510c9 Revise README for clarity and structure
Updated the README to improve clarity and formatting.
2025-10-09 19:43:11 -07:00
Enrico Ros 528055929a Legacy 2025-10-09 19:34:42 -07:00
Enrico Ros 7a1774a2ba Roll year 2025-10-09 19:32:15 -07:00
Enrico Ros 66749ded0a Update the README.md 2025-10-09 19:30:15 -07:00
Enrico Ros 6f74dc6c72 Misc small cleanups 2025-10-09 16:19:53 -07:00
Enrico Ros b8d27346e0 Update docs and refs (v2-dev -> main) 2025-10-09 16:19:53 -07:00
Enrico Ros e1e73cd260 Docker: update for main branch transition
Triggers on 'main'
2025-10-09 16:19:53 -07:00
Enrico Ros a1bf15c316 AixDebuggerDialog: fix scroll 2025-10-09 06:34:35 -07:00
Enrico Ros e69bf34ed6 GoodModal: uncontrolled maximization 2025-10-08 14:53:10 -07:00
Enrico Ros fa1a977870 AixDebuggerDialog: full screen on Mobile 2025-10-08 11:51:46 -07:00
Enrico Ros 7ed4ccb66c LoggerViewerDialog: full screen on Mobile 2025-10-08 11:50:10 -07:00
Enrico Ros 76a90ede24 ShortcutsModal: full screen on Mobile 2025-10-08 11:43:51 -07:00
Enrico Ros 89e8c24f46 Settings and Models: full screen on Mobile 2025-10-08 11:42:37 -07:00
Enrico Ros 430c7602d4 GoodModal: support Fullscreen and fix display:grid
Note that display:grid was fitting to contents, but we prefer display:flex (direction:column)
so we had to make the maxWidth property from 700 to adaptive to the screen size.
2025-10-08 11:42:23 -07:00
Enrico Ros 51b9fbac0f Roll packages 2025-10-08 08:08:22 -07:00
Enrico Ros 63eba761c5 Fix the Draw Provider selector 2025-10-07 09:25:01 -07:00
Enrico Ros e80fb7aa73 OpenAI gpt-image-1-mini: enable inpaint 2025-10-07 09:24:32 -07:00
Enrico Ros 8b2b98fc10 OpenAI gpt-image-1-mini support 2025-10-07 09:12:39 -07:00
Enrico Ros c9712c72a0 Fix Sonnet-4.5 display name 2025-10-07 08:52:35 -07:00
Enrico Ros d0ad4095c0 LLMs: Add OpenAI Gpt-5 Pro 2025-10-06 11:14:44 -07:00
Enrico Ros 1c00286a70 Roll AIX 2025-10-05 09:48:23 -07:00
Enrico Ros 8687c6b08b Merge pull request #839 from sam0jones0/add-claude-4.1-4.5-thinking-support
Add thinking support for Claude 4.1 Opus and 4.5 Sonnet via OpenRouter
2025-10-05 09:44:57 -07:00
Enrico Ros 7bdf467833 LLMs: disable staging log on openai 2025-10-03 20:53:14 -07:00
Enrico Ros 39736fbd27 idUtils: uuid underscore prefix stripping 2025-10-03 17:11:36 -07:00
Enrico Ros f5e34e8096 idUtils: uuid validation (fast, inaccurate) 2025-10-03 17:11:36 -07:00
Enrico Ros b2246ed922 Prevent wrong error matching 2025-10-03 13:05:05 -07:00
Enrico Ros a499e8463c LLMs: OpenAI: debug only in dev 2025-10-03 12:52:40 -07:00
Enrico Ros 708ae291cc LLMs: Gemini: debug only in dev 2025-10-03 12:52:36 -07:00
Enrico Ros 0d4db0322b tRPC: decode gemini 403 !ok 2025-10-03 12:40:28 -07:00
Enrico Ros 39ae2e47f9 LLMs: Gemini: enable Aspect Ratio 2025-10-02 20:10:42 -07:00
Enrico Ros 25159669df AIX/LLMs: Gemini: Nano Banana image aspect ratio 2025-10-02 20:10:37 -07:00
Enrico Ros 4e24281e18 LLMs: Gemini: update models 2025-10-02 18:48:13 -07:00
Enrico Ros d9bdeeb6b3 GoodModal: support darker bg 2025-10-02 18:48:13 -07:00
Enrico Ros b2847e7026 AIX: OpenRouter: let FCs through 2025-10-02 18:47:20 -07:00
Sam Jones 3f6bd90f64 Add thinking support for Claude 4.1 Opus and 4.5 Sonnet via OpenRouter 2025-10-02 11:30:39 +01:00
Enrico Ros 6b5984deac AIX: OpenAI: Variant Support 2025-10-01 17:48:39 -07:00
Enrico Ros 2dfaec9216 RenderImageURL: change the open/fullscreen icon 2025-10-01 06:21:08 -07:00
Enrico Ros ddbc5e65e8 AIX: OpenAI: fix Responses API breaking change 2025-09-30 18:55:59 -07:00
Enrico Ros 5dae51d2a1 AIX: extract CG impl for server-side usage, retry, etc. 2025-09-29 19:29:26 -07:00
Enrico Ros 75215955be AIX: export server Context type 2025-09-29 19:29:10 -07:00
Enrico Ros 79ee764a9f AIX: Inspector: limit to specific requests in production 2025-09-29 19:29:07 -07:00
Enrico Ros dce27e89a1 AIX: roll 2025-09-29 15:38:15 -07:00
Enrico Ros 448df4baf8 useLLMSelect: model options 2025-09-29 15:35:35 -07:00
Enrico Ros dafd09084a dMessageUtils: support Anthropic 4.5 and Gemini Robotics 2025-09-29 15:03:06 -07:00
Enrico Ros cae7d06256 LLMs: Gemini: add newer Flash/Flash-Lite and Robotics models 2025-09-29 15:03:06 -07:00
Enrico Ros a27eae46f6 LLMs: Gemini: remove vague symlinks 2025-09-29 15:03:06 -07:00
Enrico Ros 9f067c07f0 LLMs: Gemini: remove gen 1.5 models 2025-09-29 15:02:06 -07:00
Enrico Ros 1f0be73695 LLMs: OpenAI GPT-5 Codex 2025-09-29 13:28:26 -07:00
Enrico Ros ce6d42dcdd AIX: OpenAI Responses: allow re-entering the same summary block 2025-09-29 13:28:26 -07:00
Enrico Ros 439740adba ContentReassembler: newline before breakage messages 2025-09-29 13:28:26 -07:00
Enrico Ros cff36c0c31 AutoBlocksRenderer: re-enable ERC on completion 2025-09-29 13:28:26 -07:00
Enrico Ros 7c9edaf186 BlockPartModelAux: support Markdown (but off for now) 2025-09-29 13:28:26 -07:00
Enrico Ros bbc736d72a LLMs: Anthropic thinking models first 2025-09-29 12:00:14 -07:00
Enrico Ros 47439b9907 LLMs: Anthropic Claude Sonnet 4.5 2025-09-29 12:00:14 -07:00
Enrico Ros c3274d66c9 Move tf pointers 2025-09-29 07:49:16 -07:00
Enrico Ros d4836914dd [TOOLS] AIX: support Vnd.Ant 2024-10-22 2025-09-29 07:42:39 -07:00
Enrico Ros 4a44393878 AIX: Inspector: fixes 2025-09-29 07:42:39 -07:00
Enrico Ros 123ebc0f26 AIX: remove old debugging 2025-09-29 07:42:39 -07:00
Enrico Ros 0a133a764b DMessageMetadata: initial recipients 2025-09-29 07:42:39 -07:00
Enrico Ros c1d807a516 OptimaBarDropdown: support showFaded with 67% opacity 2025-09-29 07:39:15 -07:00
Enrico Ros aad715f7e1 Placeholder: mx 1.5 on op chip 2025-09-29 07:39:15 -07:00
Enrico Ros f1ec94111a FormInputKey: fix min width on mobile 2025-09-18 16:22:54 -07:00
Enrico Ros 07fcecc5b5 DebouncedInput: support startDecorator 2025-09-18 16:22:41 -07:00
Enrico Ros c56328009e Models-modal: Wizard uses the same autocomplete pattern 2025-09-17 12:39:27 -07:00
Enrico Ros f8cbb6faa2 Models-modal: shift + add -> wizard 2025-09-17 12:39:22 -07:00
Enrico Ros c07eb4014f FormInputKey: support for tooltips 2025-09-17 12:39:18 -07:00
Enrico Ros 94c1b35cee FormInputKey: bind to a username to help pass managers remember 2025-09-17 12:39:14 -07:00
Enrico Ros 2277fd0880 FormInputKey: smaller keys 2025-09-17 12:39:14 -07:00
Enrico Ros a2313186e4 FormInputKey: autocomplete on 2025-09-17 12:39:14 -07:00
Enrico Ros 3351d61ca7 LLMs: Gemini: update models 2025-09-17 01:11:31 -07:00
Enrico Ros 905d438075 Void fragments: render follow-up placeholders 2025-09-16 12:49:13 -07:00
Enrico Ros ba3290f4e1 animationSpinHalfPause 2025-09-16 12:48:51 -07:00
Enrico Ros a828ea45aa BlockPartPlaceholder: render model messages 2025-09-16 12:40:32 -07:00
Enrico Ros 7c484ea5d8 BlockPartPlaceholder: render model messages 2025-09-16 11:37:16 -07:00
Enrico Ros 5b68608d5b AIX: fragment-compatible support of model ops 2025-09-16 11:36:21 -07:00
Enrico Ros 08ef5396f3 Placeholder: don't draw the datastreamviz if we have a model placeholder 2025-09-16 10:46:52 -07:00
Enrico Ros 662ef5ae4f Fragments: support model placeholders 2025-09-16 10:45:59 -07:00
Enrico Ros 23a1e9b335 DLLM/AIX: Image generation options 2025-09-16 10:16:13 -07:00
Enrico Ros b79f8a1508 Roll packages 2025-09-16 00:23:19 -07:00
Enrico Ros a793fa041e LLMs: update scores from Sep 8, 2025 2025-09-16 00:20:53 -07:00
Enrico Ros c5ef92f1f7 Bump AIX 2025-09-15 23:36:45 -07:00
Enrico Ros 7ccf22c2f4 LLMS/AIX: OpenAI Image Generation Tool 2025-09-15 23:24:00 -07:00
Enrico Ros 67df71ab45 Doc: LLMS/AIX: parameters system 2025-09-15 22:50:43 -07:00
Enrico Ros 0636ca76ea AIX: OpenAI Responses: built-in image_generation_call config and output 2025-09-15 22:24:13 -07:00
Enrico Ros 2f2e4e36be AIX: OpenAI Responses: handle web_search_calls even for obscure types 2025-09-15 19:25:59 -07:00
Enrico Ros 913c821eae AIX: OpenAI Responses: fix different 'refusal' naming :/ 2025-09-15 19:25:59 -07:00
Enrico Ros 43f2bacf58 AIX: OpenAI Responses: response object stub definitions 2025-09-15 19:25:59 -07:00
Enrico Ros ae0cf1a89e AIX: OpenAI Responses: high quality citation output 2025-09-15 19:25:59 -07:00
Enrico Ros 0b2d037385 AIX: OpenAI Responses: web search results transmission 2025-09-15 19:25:59 -07:00
Enrico Ros cd5cef1c25 AIX: Placeholder supports (pre beginning of the real content, then done) 2025-09-15 19:25:59 -07:00
Enrico Ros 8c5f70a339 AIX: OpenAI Responses: Web Search: handle web search results too 2025-09-15 19:25:59 -07:00
Enrico Ros f5ecfd1d74 Annotations: copy citations as text or table 2025-09-15 19:25:59 -07:00
Enrico Ros e0de908741 LLMs: document search off 2025-09-15 16:31:25 -07:00
Enrico Ros cd2ccff0d7 Annotations: improve style for regular citations 2025-09-15 16:24:07 -07:00
Enrico Ros 3df6c62dce CloseablePopup: support custom boxShadow 2025-09-15 13:19:15 -07:00
Enrico Ros 463af67d17 Typo 2025-09-15 13:18:58 -07:00
Enrico Ros 80f345b179 LLM Attachment Drafts: has images 2025-09-13 12:17:50 -07:00
Enrico Ros bf212ca83f AIX: Client: Auto-Index of images excludes caption 2025-09-13 12:17:50 -07:00
Enrico Ros 0185712cbf AIX: Client: Auto-index for Images on System Messages 2025-09-13 01:28:28 -07:00
Enrico Ros a5199a23d9 AIX: Support for Images on System Messages 2025-09-13 00:52:54 -07:00
Enrico Ros 011c382360 AIX: Gemini: note on Image ordering 2025-09-13 00:52:54 -07:00
Enrico Ros 5c9ce84249 AIX: Content.SystemInstruction: support InlineImage parts 2025-09-13 00:52:54 -07:00
Enrico Ros 9e89ba9b10 ImageAttachmentFragments: support renderVariant, for rendering from the PersonasEditData 2025-09-13 00:52:54 -07:00
Enrico Ros cb8cefb0ea Attachment Menu: zIndex over modal for when used on the Persona Edit modal 2025-09-13 00:52:54 -07:00
Enrico Ros 7607b8fec5 ChatMessage: render system images below user images 2025-09-13 00:52:54 -07:00
Enrico Ros 05a96c5aca Fragment buckets: use correct classifier
# Conflicts:
#	src/common/stores/chat/hooks/useFragmentBuckets.ts
2025-09-13 00:49:27 -07:00
Enrico Ros 762b0c11ff AppChat: small comment 2025-09-12 17:49:42 -07:00
Enrico Ros c903f9bc5c LLMs: OpenAI: [DEV] models delta code 2025-09-12 17:49:28 -07:00
Enrico Ros c190ae89ce LLMs: OpenAI: remove older models 2025-09-12 17:49:28 -07:00
Enrico Ros 1b6b491eee LLMs: OpenAI: add new Audio and Realtime models descs 2025-09-12 16:45:17 -07:00
Enrico Ros 4e9c0ba489 LLMs: OpenAI: remove older 4o-realtime models 2025-09-12 16:45:16 -07:00
Enrico Ros 13fcb932d1 LLMs: OpenAI: prevent older realtime models from showing 2025-09-12 16:45:16 -07:00
Enrico Ros f9f2c3d2b2 AIX: LLM_IF_Outputs_Audio replaces needs_audio 2025-09-12 16:36:38 -07:00
Enrico Ros bdab75c336 LLMs: OpenAI: deprecated/shut down some models 2025-09-12 16:36:38 -07:00
Enrico Ros 5996934f60 AIX: OpenAI chatGenerate: enable audio modalities 2025-09-12 16:36:37 -07:00
Enrico Ros 2f8659fc38 AIX: OpenAI chatGenerate: ignore pure-obfuscation messages 2025-09-12 16:29:00 -07:00
Enrico Ros 1e1206ab7e Merge branch 'fork/powyncify/v2-dev-paulshort' into v2-dev
# Conflicts:
#	.claude/settings.local.json
#	src/modules/aix/server/dispatch/chatGenerate/adapters/openai.responsesCreate.ts
2025-09-12 14:27:31 -07:00
Enrico Ros 4682afc985 More permissions 2025-09-12 14:26:31 -07:00
Enrico Ros 8722e1be6c Azure: update docs 2025-09-12 14:26:31 -07:00
Enrico Ros fbd6fd3e7c LLMs: Azure: Disable the web search option from the model description (configurable by code) 2025-09-12 14:19:20 -07:00
Enrico Ros cfba3ce834 Azure: update env var docs 2025-09-12 14:19:20 -07:00
Enrico Ros a4ad1e8295 Azure: remove description of the fix for #828, now it's merged 2025-09-12 14:19:20 -07:00
Enrico Ros aa441b0656 LLMs: Azure OpenAI: cleaned up (and moved) azureOpenAIAccess - simpler and modularized code 2025-09-12 14:19:20 -07:00
Enrico Ros 39a7e30880 AIX: Azure OpenAI: renamed Env Vars 2025-09-12 13:58:23 -07:00
Enrico Ros 74b69f9ea4 AIX: Azure OpenAI: verified web_search_preview not present yet 2025-09-12 13:57:49 -07:00
Enrico Ros 3094540b93 LLMs: misc bits 2025-09-12 13:24:30 -07:00
Enrico Ros 513500b16e LLMs: extract access response type 2025-09-12 13:23:52 -07:00
Enrico Ros 51c41473a5 Azure: remove path from the example 2025-09-12 12:47:09 -07:00
Enrico Ros e79df4a347 AIX: Azure/Request API: improve web search tool use and exceptions 2025-09-12 11:21:51 -07:00
Enrico Ros 53a4a66e9e AIX: OpenAI Responses: support temperature in ChatGPT-5 2025-09-12 10:57:53 -07:00
Enrico Ros aaf2de278f OpenAI: move and rename ChatGPT-NR 2025-09-12 10:57:53 -07:00
Enrico Ros d2e8bad75f OpenAI: decorate LLMs that search 2025-09-12 10:57:53 -07:00
Enrico Ros 98bcbba7ca useLLMSelect: option to configure models at the bottom, and optimize styles 2025-09-12 10:57:02 -07:00
Enrico Ros 61258163e2 Optimize FormSliderControl by 1000x 2025-09-11 16:14:17 -07:00
Enrico Ros 80b393ca14 AIX: improve llm user config replacement/override 2025-09-11 14:02:58 -07:00
Enrico Ros b57c292581 LLMs: bits 2025-09-11 14:02:03 -07:00
Enrico Ros 044e2f9b57 ModelConfiguration: explicit 'undefined' modelParameters 2025-09-11 14:01:47 -07:00
Enrico Ros b14e9c91c6 Speech Recognition: add dispose (unmounts) as a one-way street 2025-09-11 14:01:40 -07:00
Enrico Ros 58fe41edc3 OpenAI Verbosity: llm params editor 2025-09-11 13:59:01 -07:00
Enrico Ros 73a089e177 OpenAI Verbosity: models 2025-09-11 13:58:57 -07:00
Enrico Ros ada9e07c2f OpenAI Verbosity: AIX & llms 2025-09-11 13:58:55 -07:00
Enrico Ros 3b9e42948e More permissions 2025-09-11 13:58:04 -07:00
Enrico Ros 2e822b1eeb Common: KeyStroke: color support 2025-09-04 15:11:22 -07:00
Enrico Ros 8f67c3e398 Common: KeyStroke: pass onClick 2025-09-04 15:11:21 -07:00
Enrico Ros 82289c0564 More permissions 2025-09-03 17:55:32 -07:00
Enrico Ros 16e5e08d21 Roll packages, but hold back Zustand 2025-09-01 17:45:49 -07:00
Enrico Ros 62671ae04f AIX: improve error messaging of captive portals and requests too large 2025-08-21 14:28:22 -07:00
Enrico Ros 266a5c6408 CloseablePopup: prevent keystrokes from being intercepted by MenuItems's accelerators 2025-08-17 14:12:44 -07:00
Enrico Ros e9264c782f ErrorBoundary: support link 2025-08-17 14:11:59 -07:00
Enrico Ros 37eb046c10 Optima: Page heading: ellipsize 2025-08-17 14:11:41 -07:00
Enrico Ros 6e75f7dbee OpenAI: abortable create/edit image requests 2025-08-17 14:10:49 -07:00
Enrico Ros e420fa9661 Server-side fetchers: note the abort path 2025-08-17 14:10:34 -07:00
Enrico Ros 505649e360 OptimaPanelGroupedList: any title 2025-08-17 14:09:51 -07:00
paulhshort 3d93c856ba Fix Azure OpenAI web_search_preview tool incompatibility
Azure OpenAI doesn't support the web_search_preview tool, which was causing
"Hosted tool 'web_search_preview' is not supported" errors with GPT-5 models.

## Changes:
- Pass dialect information to aixToOpenAIResponses function
- Skip web_search_preview tool addition when dialect is 'azure'
- Add logging when web search is skipped for Azure
- Document known Azure limitations in implementation guide

## Impact:
- Fixes web browsing errors with Azure GPT-5 models
- Maintains web search functionality for regular OpenAI models
- Provides clear logging for debugging

This is a critical fix for Azure OpenAI compatibility as web search is not
currently supported on Azure's Responses API implementation.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-08-11 20:07:19 -04:00
paulhshort 9fe5697fd4 Fix Azure OpenAI Resource Not Found errors for GPT-5 and o3 models (#828)
This commit addresses GitHub issue #828 by fixing URL construction for Azure OpenAI's Responses API
and preventing malformed URLs from client configuration issues.

## Problems Fixed:
1. Host normalization: Prevents malformed URLs when client config includes paths/queries
2. API paradigm support: Properly handles Azure's next-gen v1 Responses API
3. API version consistency: Centralizes version management with env overrides

## Key Changes:
- Normalize Azure host URLs to origin only (strip path/query)
- Prefer server environment variables over client-provided hosts
- Add special handling for Responses API (/openai/v1/responses)
- Support both traditional (deployment-based) and v1 API paradigms
- Add configurable API versions via environment variables
- Include debug logging for API paradigm selection

## New Environment Variables:
- AZURE_API_V1: Enable next-gen v1 API explicitly
- AZURE_RESPONSES_API_VERSION: Control Responses API version
- AZURE_CHAT_API_VERSION: Control Chat Completions API version
- AZURE_DEPLOYMENTS_API_VERSION: Control deployments listing API version

## Testing:
Validated with Azure OpenAI endpoint showing:
- List Deployments:  Works
- Chat Completions:  Works (with correct params for GPT-5)
- Responses API (v1):  Works with /openai/v1/responses?api-version=preview
- Responses API (traditional):  404 (Azure doesn't support this pattern)

The fix defaults to using Azure's recommended next-gen v1 API for Responses
while maintaining backward compatibility for existing deployments.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-08-11 19:37:41 -04:00
Enrico Ros 7fde3473ea BlobUtils: export direct 2025-08-09 14:12:52 -07:00
Enrico Ros 56a2d68c71 GPT-5: add Markdown Restore option, like the 'o' models 2025-08-08 16:48:11 -07:00
Enrico Ros 3d140604f8 Roll AIX 2025-08-08 16:48:11 -07:00
Enrico Ros 0a2167fa6a AIX: OpenAI: GPT-5 markdown restoration with the Responses API 2025-08-08 16:47:36 -07:00
Enrico Ros 79e95379ec AIX: OpenAI Chat Completions: GPT-5 as an 'o' model 2025-08-08 16:36:40 -07:00
Enrico Ros 3f740f3800 roll packages 2025-08-08 16:01:04 -07:00
Enrico Ros a4a0ecc0e5 LLMs: reset user params 2025-08-07 19:47:32 -07:00
Enrico Ros 686ad2ed7b FormControls size setting 2025-08-07 19:47:32 -07:00
Enrico Ros d8db79b4e5 AIX: OpenAI does not support Web Search with reasoning:minimal 2025-08-07 14:44:17 -07:00
Enrico Ros d33449f4af GPT-5: support 'minimal' reasoning 2025-08-07 14:37:32 -07:00
Enrico Ros 7e6a12bddf Roll AIX 2025-08-07 14:37:11 -07:00
Enrico Ros ec80413be0 AIX: vnd.oai.reasoning:minimal support 2025-08-07 14:34:54 -07:00
Enrico Ros c2af65facd GPT 5 naming 2025-08-07 13:38:10 -07:00
Enrico Ros 46266ac825 Rename search size to web search 2025-08-07 13:38:10 -07:00
Enrico Ros 91dc25e1c6 OpenAI: GPT5 models (configurable reasoning & search) 2025-08-07 13:38:10 -07:00
Enrico Ros 7f9dafd749 Roll AIX 2025-08-06 15:26:19 -07:00
Enrico Ros 989d0e5741 Anthropic: 4.1 > 4 2025-08-06 15:24:54 -07:00
Enrico Ros 3277c009fa Composer: change tips 2025-08-06 07:57:40 -07:00
Enrico Ros 85f1fe088d Fix models description 2025-08-05 21:37:03 -07:00
Enrico Ros 3c554c92d6 Anthropic: update models 2025-08-05 20:20:42 -07:00
Enrico Ros f95d071197 Anthropic: fix dMessageUtils 2025-08-05 20:20:42 -07:00
Enrico Ros da887d58db Incognito: improve chat drawer icon 2025-08-05 20:20:42 -07:00
Enrico Ros 5273a3c84f ClosablePopup: support size 2025-08-05 20:13:38 -07:00
Enrico Ros f51712867f Folders: feed items count (commented) 2025-08-05 20:13:38 -07:00
Enrico Ros ecac1dffec Folders: support for items count, if any 2025-08-05 20:13:37 -07:00
Enrico Ros 28817bee72 Folders: only drag on the icon 2025-08-05 20:04:04 -07:00
Enrico Ros 3fd41329ea ChatDrawer: improve stepping of the progressive limiter 2025-08-05 20:04:04 -07:00
Enrico Ros f734f0b5f7 ChatDrawer: limit items (ListItems are on the heavy side) 2025-08-04 23:55:37 -07:00
Enrico Ros 3b34a8b96d Fix blob conversion types with newer TS 2025-08-04 14:19:52 -07:00
Enrico Ros 74e6ee4b2d Posthog: improve config (if/when enabled) 2025-08-04 14:15:45 -07:00
Enrico Ros 61929527a3 Roll packages: tRPC, zod, zustand, typescript 2025-08-04 14:13:50 -07:00
Enrico Ros a3e216c956 Logger: max 200 entries 2025-08-02 19:20:37 -07:00
Enrico Ros d4203f728e Nicer replacement of ph when interrupted 2025-08-01 15:34:11 -07:00
Enrico Ros 616376f4ac BackupRestore: Flash: vendor prefix 2025-08-01 14:45:49 -07:00
Enrico Ros 1a309c9bdf BackupRestore: Flash: change schema/version 2025-08-01 14:36:07 -07:00
Enrico Ros 253fc3b213 Approximate Tokenization - optimized 2025-08-01 14:27:05 -07:00
Enrico Ros a79fd0a10c Approximate Tokenization - default on new Mobile installs 2025-08-01 14:26:26 -07:00
Enrico Ros 04df3dcba8 Butter-smooth decimator with deadlines 2025-07-31 11:33:13 -07:00
Enrico Ros 00fbf77dbd Raise update frequency due to optimizations 2025-07-31 09:07:47 -07:00
Enrico Ros 9a34c1e376 References (to Image Assets): restore places that assumed 'image_ref' 2025-07-30 18:06:40 -07:00
Enrico Ros e248104d4b Reference: embed a ref summary 2025-07-30 17:11:22 -07:00
Enrico Ros c10558f230 Reference: cleanup migrated fragments 2025-07-30 16:26:41 -07:00
Enrico Ros 5be41b8199 References: add support (migration, creation (attaching, aix.reassembler, t2i , ego-msg), gc dblobs collection, render image, render button icon, dereference, bucketing, token count) with exhaustive checks 2025-07-30 16:09:05 -07:00
Enrico Ros d6b6e30cf5 Fragments: References 2025-07-30 14:52:26 -07:00
Enrico Ros 825ca7ba87 Fragments: soft-track the originId for multi-origin fragments in messages 2025-07-30 10:26:40 -07:00
Enrico Ros 5c2a8a4996 Tools: fragment type discrimination 2025-07-30 06:49:18 -07:00
Enrico Ros 58aef2a97d Don't transmit ui-side unhandled exceptions (the handler shall be there already) 2025-07-29 16:10:13 -07:00
Enrico Ros e983f9d8a9 Domain for errors 2025-07-29 16:09:46 -07:00
Enrico Ros 7e95dcc1cb PDF: roll pdfjs 2025-07-29 16:03:03 -07:00
Enrico Ros 69a21a82ac PDF: prevent loading issues 2025-07-29 15:56:17 -07:00
Enrico Ros 95d2fee63d Paste.gg: fully removed, the website seems up no more 2025-07-29 13:08:01 -07:00
Enrico Ros 6f22a71555 PostHog: debug valid client-side errors (if Error is provided) 2025-07-29 12:51:56 -07:00
Enrico Ros a30409fcfc Stacked bar: improve 2025-07-29 08:11:24 -07:00
Enrico Ros 217346f572 BackupRestore: partial export (no/settings) 2025-07-29 07:20:38 -07:00
Enrico Ros 4472671470 Stacked bar: support node titles 2025-07-28 16:07:21 -07:00
Enrico Ros b1a026bdd1 Prodia: remove for now as the API keeps changing and is not a good default for our users - may put this back anytime (#786) 2025-07-26 11:06:34 -07:00
Enrico Ros 312fae5f6d OpenAI: T2I: Auto-select the latest model 2025-07-26 10:53:18 -07:00
Enrico Ros 46235aa28a T2I: Auto-select the provider 2025-07-26 10:53:07 -07:00
Enrico Ros 6fe0e297eb Remove Prodia image generation - does not seem to be up to the quality par 2025-07-26 08:46:49 -07:00
Enrico Ros 3b3214ef5e Roll packages 2025-07-26 06:21:20 -07:00
Enrico Ros ebc28ed8a4 Entangled: add DMessage metadata 2025-07-22 07:42:37 -07:00
Enrico Ros 54c23a9907 Update packages 2025-07-21 14:14:55 -07:00
Enrico Ros 8fce40be80 PostHog: revert changes to the Edge route (details inside)
Build fails with:
./node_modules/posthog-node/lib/edge/index.mjs
Module not found: Can't resolve 'crypto' in '/vercel/path0/node_modules/posthog-node/lib/edge'
2025-07-21 14:08:13 -07:00
Enrico Ros 5c5a213c4c PostHog errors: cleanups 2025-07-21 13:28:20 -07:00
Enrico Ros 3b730680cb PostHog errors: cloud/edge routes 2025-07-21 13:28:20 -07:00
Enrico Ros d7765ae578 PostHog errors: add node/edge utility functions 2025-07-21 13:28:19 -07:00
Enrico Ros ab21d5c308 PostHog errors: add node/edge package 2025-07-21 13:28:18 -07:00
Enrico Ros 392319a300 PostHog errors: remove error capture 2025-07-21 13:28:17 -07:00
Enrico Ros bee5f950b9 PostHog: client-side on ErrorBoundary (escape the auto capture) 2025-07-21 12:42:42 -07:00
Enrico Ros 3fc1e3f643 Show the load progress past 500ms 2025-07-21 09:47:57 -07:00
Enrico Ros ee10b39866 PostHog: reduce config 2025-07-21 08:47:37 -07:00
Enrico Ros 867265fd31 PostHog: verbosity off 2025-07-21 08:19:20 -07:00
Enrico Ros 68109a4a37 PostHog: integrate in build 2025-07-21 08:08:57 -07:00
Enrico Ros 874401ef8c PostHog: add cli/nextjs conf 2025-07-21 07:59:11 -07:00
Enrico Ros 303a3f2c7d New react-player 3 2025-07-19 13:15:01 -07:00
Enrico Ros 915f338378 Deep roll 2025-07-19 12:24:47 -07:00
Enrico Ros fd2e1fe34b Roll packages 2025-07-19 12:23:18 -07:00
Enrico Ros e2e7453431 Roll zod@4 2025-07-19 12:22:17 -07:00
Enrico Ros b07573ec4f chat.message: nit 2025-07-17 11:13:53 -07:00
Enrico Ros 66c279e895 Telephone: fix for cancel 2025-07-17 11:06:36 -07:00
Enrico Ros 06e879b884 storageUtils: don't keep requesting permission once granted 2025-07-16 12:19:22 -07:00
Enrico Ros f205dafe4d AppChat: lazy-load modals 2025-07-16 10:48:19 -07:00
Enrico Ros fec18d7039 ModelsModals: extract auto-open 2025-07-16 10:06:30 -07:00
Enrico Ros 5ef09455da ModelsModals: lazy load 2025-07-16 10:06:12 -07:00
Enrico Ros c799869e3b ModelsModals: extract the configurator modal 2025-07-16 10:01:02 -07:00
Enrico Ros 448f5a85d0 LLMs: centralize icons 2025-07-16 09:35:46 -07:00
Enrico Ros 9909a537c2 Modals: lazy load Settings 2025-07-16 09:16:45 -07:00
Enrico Ros 9772a18bf4 LLMs: centralize config UI into 1 function 2025-07-16 08:59:30 -07:00
Enrico Ros 0ac80b26bd Gemini: remove unused options 2025-07-16 08:59:30 -07:00
Enrico Ros 1f5e25a57b Deep roll packages 2025-07-16 08:36:14 -07:00
Enrico Ros 8e5f7ef977 Roll safe packages 2025-07-16 08:24:19 -07:00
Enrico Ros ed21c8affd Logger: improve console output snr 2025-07-15 16:37:19 -07:00
Enrico Ros 023228c2c5 AnimUtils: add opacity pulse 2025-07-14 15:16:56 -07:00
Enrico Ros 68f4118bde AIX: needs a roll for the xAI changes 2025-07-11 17:20:22 -07:00
Enrico Ros 0edc839857 xAI: final touches, it's good now 2025-07-11 17:04:04 -07:00
Enrico Ros ee6f560388 xAI: perfect search 2025-07-11 16:48:42 -07:00
Enrico Ros c100355b7b xAI: models: define search support 2025-07-11 16:47:46 -07:00
Enrico Ros 4f7402c343 xAI: support Live Search on X, Web, etc. 2025-07-11 16:13:21 -07:00
Enrico Ros 5ac73e9599 xAI: update Grok models 2025-07-11 15:30:45 -07:00
Enrico Ros c1e46e00d9 Improve looks of pure-markdown pre code blocks 2025-07-10 23:18:01 -07:00
Enrico Ros 7a05f0f9ab Exclude unused 2025-07-10 20:47:47 -07:00
Enrico Ros afcd511893 More permissions 2025-07-09 17:12:31 -07:00
Enrico Ros 8f42900e8e PerfUtils: interval annotation function 2025-07-09 07:21:00 -07:00
Enrico Ros bcc12876d7 Optima Pane Peek: 25% faster enter 2025-07-08 21:32:49 -07:00
Enrico Ros e1c2f85bda Beam: change count looks 2025-07-07 09:43:45 -07:00
Enrico Ros 6989a807d6 InlineTextArea: stop event propagation on 'esc' 2025-07-06 18:57:56 -07:00
Enrico Ros d92739c793 Remove warning 2025-07-03 19:07:54 -07:00
Enrico Ros 2fcb80b932 Logger: prevent localStorage overflow 2025-07-02 15:14:57 -07:00
Enrico Ros 03b0e88ef7 ChatMessageList: fragment replace as done 2025-07-01 19:20:24 -07:00
Enrico Ros a5a73ddbef Release: const dev build 2025-07-01 14:29:52 -07:00
Enrico Ros eb57147ed3 BackupRestore: improve DB schema restore 2025-07-01 09:08:50 -07:00
Enrico Ros 0cf12d2a8f Rename Chat Bars 2025-07-01 08:49:52 -07:00
Enrico Ros 06d332e785 Restore: selective restore 2025-07-01 08:37:49 -07:00
Enrico Ros a75eaaec69 Bubble: cut tool 2025-06-29 06:58:59 -07:00
Enrico Ros 513ee36027 LLMs: Gemini: update visibilities 2025-06-27 11:09:52 -07:00
Enrico Ros 975f425ae4 LLMs: Gemini: update models 2025-06-27 11:05:45 -07:00
Enrico Ros c310ca9c5c LLMs: Gemini: remove obsolete 2025-06-27 10:41:39 -07:00
Enrico Ros 21a6f0aa50 LLMs: Gemini: update pricing 2025-06-27 10:41:31 -07:00
Enrico Ros c2c3fdf7d4 AIX: Gemini: update parser (thinking) 2025-06-27 10:41:00 -07:00
Enrico Ros ce0880bf5b AIX: OpenAI Responses: support for item done (web_search_call)
Still missing: annotation support, web search step reporting, sequence machine on the 3 new added events
2025-06-27 00:23:54 -07:00
Enrico Ros eed099bfed AIX: OpenAI Responses: text annotations (not implemented yet) 2025-06-27 00:19:07 -07:00
Enrico Ros 08b37efb55 AIX: OpenAI Responses: support web_search_call events 2025-06-27 00:07:25 -07:00
Enrico Ros 8443445ed0 AIX: OpenAI Responses: support the web_search_preview Tool 2025-06-27 00:07:24 -07:00
Enrico Ros d011599060 OpenAI: add Deep-Research o3 and o4-mini 2025-06-26 23:13:55 -07:00
Enrico Ros 0dd043cb6a Zod: tree-shakeable 2025-06-26 15:51:48 -07:00
Enrico Ros 1ebd1d9e15 Zod-4: reduce deprecated 2025-06-26 12:47:33 -07:00
Enrico Ros 202aef8916 tRPC use stable httpBatchStreamLink 2025-06-26 12:46:38 -07:00
Enrico Ros 30acf51410 Lints 2025-06-26 12:46:25 -07:00
Enrico Ros d4b01398c7 Remove zod-to-json-schema 2025-06-26 12:01:40 -07:00
Enrico Ros 4dde3d0fe7 Zod-4: Migrate 2025-06-26 12:01:40 -07:00
Enrico Ros 8aa6fd7c8e Zod-4: for JSON schema 2025-06-26 12:01:26 -07:00
Enrico Ros e2e6e6d641 Zod: qualify records 2025-06-26 12:00:59 -07:00
Enrico Ros 20aa91b9a6 Link: change the dataObject to any, from passthrough 2025-06-26 12:00:59 -07:00
Enrico Ros 7bfd82ae4f AIX: OpenAI: remove unnecessary default 2025-06-26 12:00:59 -07:00
Enrico Ros c5101ee4cf LLMs: remove unnecessary validation 2025-06-26 11:56:31 -07:00
Enrico Ros 378f390941 AIX: FC: convert to z.json() where not string (DEF, gemini Call/Res).
Annotate FC-DEF/FC/FC-R
2025-06-26 11:56:31 -07:00
Enrico Ros 3bc8360959 AIX: Anthropic fix 2025-06-26 11:56:31 -07:00
Enrico Ros af124e7cd9 Roll packages deep 2025-06-26 09:27:27 -07:00
Enrico Ros 71633ff441 Roll packages 2025-06-26 09:13:20 -07:00
Enrico Ros daf2e58c99 Mistral: turn off gaps debug 2025-06-25 15:38:41 -07:00
Enrico Ros 3818af2156 Mistral: full auto-spec of models 2025-06-25 15:32:43 -07:00
Enrico Ros dd0fd2edcf AIX: fw-compatible check deprio 2025-06-25 13:14:57 -07:00
Enrico Ros 07304c6d0e AIX: OpenAI: Responses: Tools check 2025-06-25 13:02:50 -07:00
Enrico Ros 4db1708fae LLMs: OpenAI: enable o3-pro, o1-pro, codex-mini 2025-06-25 12:54:42 -07:00
Enrico Ros 0952926265 AIX: OpenAI: Responses: parser NS/S complete
NOTE: check the console for ANY log. We don't throw to complete requests, but we make
large assumptions on the ordering/sequencing of events.
2025-06-25 12:53:19 -07:00
Enrico Ros a695484921 AIX: OpenAI: Responses: wires 2025-06-25 12:52:12 -07:00
Enrico Ros 55c3eb4cf0 AIX: OpenAI: Responses: dispatch fixes 2025-06-25 09:19:02 -07:00
Enrico Ros 8e42356956 Metrics: include dtAll 2025-06-25 09:19:02 -07:00
Enrico Ros 255ef64b37 AIX: roll AIX 2025-06-24 22:26:25 -07:00
Enrico Ros e3f1307b30 LLMs: OpenAI: change pSpecs for restore markdown 2025-06-24 22:25:57 -07:00
Enrico Ros 93beda7fff AIX: OpenAI: Responses NS parser 2025-06-24 22:21:21 -07:00
Enrico Ros 91251985db AIX: OpenAI: responses interfaces 2025-06-24 19:29:21 -07:00
Enrico Ros b41cb74f45 AIX: response API dispatch switch 2025-06-24 19:23:31 -07:00
Enrico Ros 303b90d1ee AIX: response API dispatcher 2025-06-24 19:20:57 -07:00
Enrico Ros 86f80a320d AIX: response API model annotations 2025-06-24 19:20:44 -07:00
Enrico Ros d4e158a8b6 OpenAI Responses: Wires 2025-06-24 19:18:01 -07:00
Enrico Ros f58eae623a ERC: fix overflow 2025-06-24 13:38:02 -07:00
Enrico Ros bc5493ed50 Zero State models 2025-06-24 10:42:02 -07:00
Enrico Ros 4e51f26ef2 RenderCode: fixed line numbers 2025-06-24 07:44:55 -07:00
Enrico Ros 04226eb686 Attract to Model Config 2025-06-23 12:43:00 -07:00
Enrico Ros f9743fd04b GoodModal: options to not react on Backdrop or Escape closes 2025-06-23 11:38:23 -07:00
Enrico Ros b9746ef100 GoodModal: fix the drag-closes issue
When clicking inside the dialog and dragging on the backdrop, the dialog would
close.

Now we only close if initiated within the dialog.
2025-06-23 11:33:43 -07:00
Enrico Ros 92e56c3c84 Lint 2025-06-23 09:38:42 -07:00
Enrico Ros aa134d7f21 Fix build 2025-06-23 09:38:21 -07:00
Enrico Ros f2bea1867c Perplexity: early variants support (disabled) 2025-06-23 09:18:37 -07:00
Enrico Ros a55acf5146 Perplexity: fix first response 2025-06-23 09:10:22 -07:00
Enrico Ros 869b9b994d Perplexity: enable search context size 2025-06-23 08:27:15 -07:00
Enrico Ros 93fca32e9a Move Date Range config 2025-06-23 08:26:58 -07:00
Enrico Ros 1d7dfd53f4 Improve Search Context config 2025-06-23 08:26:48 -07:00
Enrico Ros a68f35d909 Citations: render date 2025-06-23 07:59:13 -07:00
Enrico Ros f800639e1a Citations: add date to Fragments 2025-06-23 07:54:59 -07:00
Enrico Ros ed45a01267 Perplexity: fix system-only message, or assistant-before-user 2025-06-23 07:39:50 -07:00
Enrico Ros b0634e272d Perplexity: improved search results support (with title) 2025-06-23 07:25:12 -07:00
Enrico Ros d90f012140 Perplexity: reorder citations to not interrupt the first reasoning block 2025-06-23 07:14:26 -07:00
Enrico Ros 41363a534f Perplexity: update models 2025-06-23 07:00:53 -07:00
Enrico Ros 44d53e581b Perplexity: add Academic research and range 2025-06-23 07:00:44 -07:00
Enrico Ros 5aeb034945 OAI: fix spell 2025-06-23 06:33:06 -07:00
Enrico Ros 13a95db7a4 ElevenLabs: return the buffer, optionally 2025-06-22 13:00:05 -07:00
Enrico Ros 1705461e80 Logger: deduplicate 2025-06-21 18:59:34 -07:00
Enrico Ros 3fa7d61c7e ElevenLabs: return play status 2025-06-21 09:50:07 -07:00
Enrico Ros 0b8268fea3 ChatMessage: imperative handle 2025-06-20 16:41:57 -07:00
Enrico Ros 22ffc74371 Drawer: option to pin it 2025-06-20 15:49:37 -07:00
Enrico Ros 31edb6a881 RenderCode: nowrap 2025-06-18 07:52:15 -07:00
Enrico Ros b8245095c9 CleanerMessage: improve layout 2025-06-17 16:03:46 -07:00
Enrico Ros ed26e57352 CleanerMessage: display the presence of attachments, images, etc. 2025-06-17 15:40:28 -07:00
Enrico Ros ea8a757b19 Fix drawers size 2025-06-17 07:52:05 -07:00
Enrico Ros b5d1e5f6c9 Revert "CSS: round() ... [WARNING]" - Not risking it in v2-dev.
This reverts commit 142a4495a6.
2025-06-16 13:08:40 -07:00
Enrico Ros 142a4495a6 CSS: round() to fix potential blurs [WARNING]
Warning: older browsers will ignore the entire CSS lines containing round() calls.

However we already introduced top-level layout rounds in 85e4946f (Fix fractional sizes of drawer and pane).

To restore support of old browsers, calls to 'round()' need to be stripped of the round part.
2025-06-16 13:04:34 -07:00
Enrico Ros 7a9a21c02e Drawer/Panel: fix fractional sizes and shade 2025-06-16 12:24:55 -07:00
Enrico Ros a60c84987d Mobile Drawer/Panel: suppress fractional sizes 2025-06-16 09:27:53 -07:00
Enrico Ros 3150900e13 Drawer: remove the thin border line, in case 2025-06-16 09:26:32 -07:00
Enrico Ros 85e4946ff5 Fix fractional sizes of drawer and pane 2025-06-16 09:26:05 -07:00
Enrico Ros dbf6ad70f5 Mark optima input wrappers 2025-06-16 08:54:09 -07:00
Enrico Ros bf7a16559b Panel: restore gaps 2025-06-16 08:47:04 -07:00
Enrico Ros fa4c78c9c2 Remove duplicate models configuration 2025-06-16 08:06:35 -07:00
Enrico Ros 9d99f46f3c Patch to show a 22px first icon rather than 20 2025-06-16 08:00:08 -07:00
Enrico Ros 5dc86c5649 Remove extra Textsms/Outlined icons 2025-06-16 07:59:54 -07:00
Enrico Ros fa82083670 AppChat: nav: improve icon 2025-06-16 07:59:38 -07:00
Enrico Ros fa3bff3e6d Restore active icons 2025-06-16 07:58:22 -07:00
Enrico Ros 9d68b26868 Chats icon 2025-06-16 07:58:11 -07:00
Enrico Ros 47a0214105 Panel: peek after 1 second
Note that we need to remove the leave handler from the hovered button as it's
covered by the panel itself, and that would trigger a loop
2025-06-16 07:30:17 -07:00
Enrico Ros 82ea6fef3d Fix z-index of Selection header 2025-06-15 17:22:28 -07:00
Enrico Ros eec61adad1 Drawer/Pane: fix data-closed 2025-06-15 17:14:32 -07:00
Enrico Ros ada9fb10e8 Drawer/Pane: switch to aria-expanded from aria-hidden (inverted) 2025-06-15 15:29:00 -07:00
Enrico Ros c2bd9c3310 Unnecessary mouse tracking 2025-06-15 12:50:21 -07:00
Enrico Ros ba93062638 Peeking: discard dedicated action getter 2025-06-15 11:44:34 -07:00
Enrico Ros 61366b7096 Panel: add peeking support to the store 2025-06-15 11:44:34 -07:00
Enrico Ros e1dd9c0117 Drawer: remove unused soft unmount 2025-06-15 11:12:02 -07:00
Enrico Ros 407d3d8db4 Panel: optimize with CSS 2025-06-15 11:12:02 -07:00
Enrico Ros 5a2fa26dad Drawer: optimize opening with css 2025-06-15 10:57:21 -07:00
Enrico Ros fd22faeef8 Drawer: adjust 'peek' timings 2025-06-15 10:57:21 -07:00
Enrico Ros 76c5ef46d0 Drawer: 'peek' functionality, for faster chat switch/etc when the drawer is closed (testing) 2025-06-14 20:46:47 -07:00
Enrico Ros 1e725984cd ScratchClip: support for adding the current clipboard content 2025-06-14 17:26:22 -07:00
Enrico Ros 12c6b6f59b Compact: rename from compress/minify 2025-06-13 11:51:01 -07:00
Enrico Ros 4e1d7f0b82 InlineTextArea: auto-select all on edit
Except for the Fusion instructions, and the Compact
2025-06-13 11:11:36 -07:00
Enrico Ros 0635edbfff FormSelectControl: don't overflow (but no wrap for now) 2025-06-13 07:33:21 -07:00
Enrico Ros 07e2ab07ab Improve Reasoning Traces ordering 2025-06-13 07:33:00 -07:00
Enrico Ros 134d82c673 Improve Reasoning Traces messaging 2025-06-13 07:28:00 -07:00
Enrico Ros 947f9c8355 InlineError: fix break on mobile 2025-06-13 07:05:15 -07:00
Enrico Ros 5e6575a63d Tryfix Sharp differently. 2025-06-12 14:16:05 -07:00
Enrico Ros bef61a8547 Revert "Sharp: fix windows build with a win32 dev dependency"
This reverts commit 7eb8c08e6e.
2025-06-12 14:10:05 -07:00
Enrico Ros 7eb8c08e6e Sharp: fix windows build with a win32 dev dependency
Background: all of a sudden Sharp started not building anymore with the following error message:

```
./public/images/covers/release-cover-v1.12.0.png
Error: Could not load the "sharp" module using the win32-x64 runtime
Possible solutions:
- Ensure optional dependencies can be installed:
    npm install --include=optional sharp
- Ensure your package manager supports multi-platform installation:
    See https://sharp.pixelplumbing.com/install#cross-platform
- Add platform-specific dependencies:
    npm install --os=win32 --cpu=x64 sharp
- Consult the installation documentation:
    See https://sharp.pixelplumbing.com/install
    at Object.<anonymous> (PATH\node_modules\sharp\lib\sharp.js:113:9)
    at Module._compile (node:internal/modules/cjs/loader:1730:14)
    at Object..js (node:internal/modules/cjs/loader:1895:10)
    at Module.load (node:internal/modules/cjs/loader:1465:32)
    at Function._load (node:internal/modules/cjs/loader:1282:12)
    at TracingChannel.traceSync (node:diagnostics_channel:322:14)
    at wrapModuleLoad (node:internal/modules/cjs/loader:235:24)
    at Module.<anonymous> (node:internal/modules/cjs/loader:1487:12)
    at mod.require (PATH\node_modules\next\dist\server\require-hook.js:65:28)
    at require (node:internal/modules/helpers:135:16)
```

This is without changing anything in the system nor in the build. May be a faulty env detection, and happens across all branches.

Deploying this and trying it out.
2025-06-12 14:01:16 -07:00
Enrico Ros aed5272b6c Roll packages 2025-06-12 13:32:45 -07:00
Enrico Ros 13e0779ced OpenAI: update models 2025-06-11 14:11:14 -07:00
Enrico Ros 702006f6ea Remove the @mui/material dependency (brought in as peer by @mui/icons-material, stuck to ^5 for Joy) 2025-06-11 13:23:13 -07:00
Enrico Ros b4fad03c46 Roll packages deeper 2025-06-11 13:13:07 -07:00
Enrico Ros 77e43a4a7e Roll packages 2025-06-11 13:08:37 -07:00
Enrico Ros cfd21e7abb Roll tRPC 2025-06-11 13:05:31 -07:00
Enrico Ros db490bf4fb Sticky Code: fix on Beam 2025-06-10 22:26:18 -07:00
Enrico Ros bc6f3401f8 Pane: proportional header font 2025-06-10 13:44:19 -07:00
Enrico Ros e5c0079f0e Pane: smaller font 2025-06-10 13:16:54 -07:00
Enrico Ros a68d80f7aa Mobile/Pane: use a +1 scaling bump to still allow for xs 2025-06-10 11:43:43 -07:00
Enrico Ros 872c9e9e3b System -> Show Instruction 2025-06-10 11:25:42 -07:00
Enrico Ros 0e51924e5e OptimaPanelGroupedList: remove margin 2025-06-10 11:21:53 -07:00
Enrico Ros c9460a07ef Preferences: rename 2025-06-10 11:01:42 -07:00
Enrico Ros f8d80730fe Mobile: Panes: larger items 2025-06-10 11:00:14 -07:00
Enrico Ros c2e0cd844b Auto Hide on Mobile too 2025-06-10 10:37:41 -07:00
Enrico Ros 5493896392 BTL: style for mobile nav 2025-06-10 09:03:49 -07:00
Enrico Ros 1ad3cb460e Mobile Nav Menu: style 2025-06-10 09:03:49 -07:00
Enrico Ros 721e23de68 Mobile Drawer: unfilter backdrop 2025-06-10 08:47:58 -07:00
Enrico Ros 97b9f5a232 Mobile Drawer: correctly compress inserted content 2025-06-10 08:47:58 -07:00
Enrico Ros 1a9f5a4fda Mobile: transfer App menu to the Drawer 2025-06-10 08:47:58 -07:00
Enrico Ros b2153a14d8 ChatPane: smaller switch 2025-06-10 06:33:45 -07:00
Enrico Ros 8d6499a91c ChatPane: 2-col actions 2025-06-10 06:33:37 -07:00
Enrico Ros 6d6fbac01f Vector Clock Device IDs: SSR fix 2025-06-09 20:13:08 -07:00
Enrico Ros d576e2387e Improve Vector Clock Device IDs 2025-06-09 20:08:51 -07:00
Enrico Ros 4e255a355f Auto Hide: fix mobile compression with an outer div 2025-06-09 12:17:50 -07:00
Enrico Ros 94401f95d7 Auto Hide on Mobile too 2025-06-09 11:37:32 -07:00
Enrico Ros 739f613881 Auto Hide The Message Box
Added option in "Settings > Labs" to auto-hide the Composer.
Fixes #812
2025-06-08 17:57:56 -07:00
Enrico Ros 5dc24557e6 Chat Pane Focus: Fix 'close other panes' 2025-06-08 12:09:10 -07:00
Enrico Ros 65842a976e Chat Pane Focus: AppChat bits 2025-06-08 12:00:44 -07:00
Enrico Ros c6dfc66a14 RenderCode: sticky overlay 2025-06-07 11:34:08 -07:00
Enrico Ros bc54967720 Metrics: fix tier tokens calculations 2025-06-06 13:01:39 -07:00
Enrico Ros 1112aa292f Gemini: support tiered cache pricing 2025-06-06 11:17:40 -07:00
Enrico Ros 31bb06293d Gemini: auto-caching and reasoning tokens parsing 2025-06-06 11:16:29 -07:00
Enrico Ros 0139f0421b Merge branch 'sam0jones0-v2-dev' into v2-dev 2025-06-06 11:12:14 -07:00
Enrico Ros 4f63e98e7f OpenRouter: support for Anthropic thinking variants via the OpenAI protocol. #811 2025-06-06 11:11:13 -07:00
Enrico Ros c04e147ca7 Gemini: full thinking budget support, including showing the reasoning traces! 2025-06-06 09:36:00 -07:00
Enrico Ros b88feeac2c Gemini: caching pricing 2025-06-06 09:20:36 -07:00
Enrico Ros 0902c35e13 Gemini: support today's 2.5 Pro 2025-06-06 09:20:02 -07:00
Enrico Ros e02ee99d26 Typo 2025-06-05 15:57:51 -07:00
Enrico Ros 313313db1f Add TODO 2025-06-05 12:10:52 -07:00
Sam Jones b7bdae00f8 tidy up 2025-06-05 17:35:39 +01:00
Sam Jones b699a665a1 Use reasoning instead of thinking for openrouter 2025-06-05 17:29:43 +01:00
Sam Jones b28a282aba Remove extraprops 2025-06-05 16:50:41 +01:00
Sam Jones a30d2ca025 fix indent again 2025-06-05 16:48:21 +01:00
Sam Jones f7f3929342 fix indent 2025-06-05 16:47:59 +01:00
Sam Jones 35abb6e69d remove comment 2025-06-05 16:47:23 +01:00
Sam Jones b759be62ea Add Claude 4 via openrouter thinking support 2025-06-05 16:32:15 +01:00
Enrico Ros 9a2db4a6e9 Typo fix 2025-06-04 19:30:51 -07:00
Enrico Ros 5bff478d06 Rationalize Single Desktop Overflow Menu 2025-06-04 16:55:40 -07:00
Enrico Ros 3a7402b03d Nav: hide Tokenizer (dev only and hidden) 2025-06-04 16:55:40 -07:00
Enrico Ros d076e73de6 Add PhSquaresFour icon 2025-06-04 16:55:40 -07:00
Enrico Ros 1d98a994d0 AppNews: dev build info only on mobile 2025-06-04 16:55:40 -07:00
Enrico Ros 3957fae782 ScratchClip: unfilter background 2025-06-04 16:55:40 -07:00
Enrico Ros 72c07faedf App: rc3 2025-06-04 16:55:40 -07:00
Enrico Ros be3b6ee394 App: add BaseProduct 2025-06-04 16:55:40 -07:00
Enrico Ros 61910827e6 News: fix keys 2025-06-04 16:55:40 -07:00
Enrico Ros 6582beaf2a Mobile: nav: hide a few apps 2025-06-04 16:55:40 -07:00
Enrico Ros 840223af6f More permissions for Claude Code 2025-06-04 16:55:40 -07:00
Enrico Ros a084b71682 DBlobs: selective types export 2025-06-04 12:56:09 -07:00
Enrico Ros 1dbe30af3d DBlobs: collapse the portability layer 2025-06-04 12:34:13 -07:00
Enrico Ros e57fbb88bf DBlobs: add a portability layer 2025-06-04 12:21:42 -07:00
Enrico Ros a5002b4c12 Remove asyncCanvasToBlob 2025-06-03 17:28:58 -07:00
Enrico Ros c139884671 Image Viewer: flush DBlob actions 2025-06-03 17:23:40 -07:00
Enrico Ros 2b97b0e0cf Image Viewer: bytes size 2025-06-03 17:23:09 -07:00
Enrico Ros 2e4176d41c Image Viewer: download 2025-06-03 16:54:11 -07:00
Enrico Ros 40d62b6f2d Image viewer extended: Attachment Fragments (user-top), Content 'image_ref' Fragments (assistant t2i / draw t2i). Remove openObjectRLInNewTab 2025-06-03 16:41:05 -07:00
Enrico Ros 43d7e19dfb imageUtils: operate only on Blob, free of base64 converters 2025-06-03 16:04:09 -07:00
Enrico Ros ef06071ab1 addDBImage: more Blob usage with removal of resizeBase64ImageIfNeeded 2025-06-03 15:54:39 -07:00
Enrico Ros 18578a63ec Rationalize openObjectRLInNewTab 2025-06-03 15:54:01 -07:00
Enrico Ros aab0beba93 Remove showing image URL in new tab 2025-06-03 15:40:51 -07:00
Enrico Ros 7d32de50a6 DBlobs: partially remove ContextId 2025-06-03 15:12:56 -07:00
Enrico Ros 57d91e330e Images: push Blob usage deeper 2025-06-03 15:08:55 -07:00
Enrico Ros a81da26452 Images: convert Attachments/Reassembler to use Blobs more than base64/base64dataUrls 2025-06-03 13:13:44 -07:00
Enrico Ros 803f6bbdea Canvas/Video: improve Blobs support 2025-06-02 16:40:53 -07:00
Enrico Ros 10a3669551 ImageUtils: support SVG to PNG conversion 2025-06-02 16:16:45 -07:00
Enrico Ros d910fbcae1 Roll pdfjs (4 -> 5) 2025-06-02 15:26:48 -07:00
Enrico Ros e2a6ee94b0 Update Claude Code settings 2025-06-02 15:21:56 -07:00
Enrico Ros 055a2134e0 Remove punycode overrides as we don't get the npm warning anymore
@ref https://github.com/nodejs/node/pull/56632
2025-06-02 15:05:29 -07:00
Enrico Ros 30310a51ff Roll packages 2025-06-02 14:56:54 -07:00
Enrico Ros be648017f5 Roll tRPC 2025-06-02 14:51:53 -07:00
Enrico Ros e737272a39 Enable Node 24 2025-06-02 14:50:55 -07:00
Enrico Ros d7a5c50ce3 Beam: change models on Merges 2025-06-02 14:16:19 -07:00
Enrico Ros a51d5c315f DeepSeek: update models 2025-06-02 10:17:11 -07:00
Enrico Ros 8c1af95b0e Add claude code permissions 2025-06-02 10:00:37 -07:00
Enrico Ros c4d61fdd21 ChutesAI: add support
This includes prettifying the model name, assuming the interfaces
(images are a force) and auto-sizing the context window based on the
response.
2025-06-02 09:53:05 -07:00
Enrico Ros 6301f1f6b5 BlockPartModelAux: parametrize reasoning colorpalette 2025-06-02 09:40:15 -07:00
Enrico Ros edbe2e55bc Accommodate Chutes.ai / sglang parsing 2025-06-02 09:32:59 -07:00
Enrico Ros 604cf43627 No persona selected: finite duration 2025-06-02 09:28:46 -07:00
Enrico Ros e124669545 Attachments: use Blobs, not ArrayBuffer, unless required 2025-06-01 13:52:20 -07:00
Enrico Ros 9ee7c6dddd Attachments: do not take image attachments to not require domain transfer 2025-06-01 09:46:39 -07:00
Enrico Ros 5136261c8e Attachments: open up to incoming web Blobs support, but still perform the whole chain in base64. 2025-05-30 12:08:53 -07:00
Enrico Ros c9ebb44442 Wire all up to BlobUtils 2025-05-30 11:25:30 -07:00
Enrico Ros 95d9976a2c BlobUtils: minor fix 2025-05-30 11:03:56 -07:00
Enrico Ros 1d177c960f Beam: shift to re-run active Beams 2025-05-30 11:02:27 -07:00
Enrico Ros 81a34ca96c BlobUtils: improve Exceptions 2025-05-30 10:06:19 -07:00
Enrico Ros 9749b44dbb BlobUtils: Add WithMetadata 2025-05-30 09:57:45 -07:00
Enrico Ros 6dfe2a92a1 BlobUtils: reduce zero-length checks 2025-05-30 09:25:06 -07:00
Enrico Ros 44646001c1 BlobUtils: add Blob <> X functions 2025-05-29 17:51:56 -07:00
Enrico Ros 088e67c235 Move server-side functions around 2025-05-29 16:22:20 -07:00
Enrico Ros 0d41c92c01 Revert "DMessageFragment: future: Audio Ref fragments"
This reverts commit ce7699c06b.
2025-05-28 17:29:57 -07:00
Enrico Ros e966674d39 Revert "AudioRef: placeholder render"
This reverts commit c6d3bbd7b9.
2025-05-28 17:29:57 -07:00
Enrico Ros ff74a8ed9c Revert "AudioRef: placeholder tokens"
This reverts commit ab217596d8.
2025-05-28 17:29:57 -07:00
Enrico Ros 64fd32de9a Revert "AudioRef: placeholder CGR"
This reverts commit 61b2bedf5e.
2025-05-28 17:29:56 -07:00
Enrico Ros 6584bb4cd1 Doc edit pane: fix controlled looks 2025-05-28 12:45:17 -07:00
Enrico Ros a9065d1a1e Doc edit pane: fix controlled editing 'ok' 2025-05-28 12:43:47 -07:00
Enrico Ros a22832f741 DocAttachments: show delete on edit empty 2025-05-28 12:42:06 -07:00
Enrico Ros 663a33a895 DocAttachmentFragmentPane: improve looks 2025-05-28 12:37:43 -07:00
Enrico Ros 5f7508633b Doc edit pane: fix formatting 2025-05-28 12:34:12 -07:00
Enrico Ros 6a99f65979 Beam: Fix Edit/Delete in (propagate undefined handlers) 2025-05-28 12:28:43 -07:00
Enrico Ros a983f25fb9 Doc Attachment: fix replacement optionality 2025-05-28 12:17:02 -07:00
Enrico Ros 7119d92321 Doc Attachment: edit titles, move the switch button to inside the tooltip 2025-05-28 12:15:22 -07:00
Enrico Ros 5f1a52d620 GoodModal: support autoOverflow 2025-05-28 12:05:52 -07:00
Enrico Ros 42d58ed202 Allow for empty edited fragments, unless the caller branches otherwise 2025-05-28 11:25:26 -07:00
Enrico Ros 20f0dd5b80 DocAttachmentFragments: support controlled editing (ignore the overlay state) 2025-05-28 11:07:45 -07:00
Enrico Ros d95e8b70b9 BlockEdit_TextFragment: support controlled passive editing (don't run onSubmit, don't overlay the edited text) 2025-05-28 10:45:57 -07:00
Enrico Ros 69d7f3f195 updateFragmentWithEditedText: DX annotations 2025-05-28 10:16:52 -07:00
Enrico Ros 61b2bedf5e AudioRef: placeholder CGR 2025-05-27 17:49:12 -07:00
Enrico Ros ab217596d8 AudioRef: placeholder tokens 2025-05-27 17:49:08 -07:00
Enrico Ros c6d3bbd7b9 AudioRef: placeholder render 2025-05-27 17:49:01 -07:00
Enrico Ros ce7699c06b DMessageFragment: future: Audio Ref fragments 2025-05-27 17:34:12 -07:00
Enrico Ros ca3df18d99 AIX: Gemini: also strip the system prompt - TTS only takes 1 message 2025-05-27 17:28:38 -07:00
Enrico Ros 0f96c9f825 AIX: Gemini: enable TTS models 2025-05-27 17:20:11 -07:00
Enrico Ros d6e41c1026 AIX: Gemini: parse TTS outputs 2025-05-27 17:20:03 -07:00
Enrico Ros bc1d0ef6e9 AIX: Audio: particle reassembly to speech (no Blob save for now) 2025-05-27 17:19:09 -07:00
Enrico Ros 352d1425ca AIX: Audio: model.part for chatGenerateContent, and 2 impls 2025-05-27 17:09:14 -07:00
Enrico Ros f92941f4a2 AIX: Audio: particle & transmit 2025-05-27 17:07:35 -07:00
Enrico Ros 4b6f6728fa AIX: Gemini: improve TTS support 2025-05-27 15:56:17 -07:00
Enrico Ros d12771d408 Beam: quick button beams existing 2025-05-27 15:34:53 -07:00
Enrico Ros 7a679dd7d8 LLMs: Gemini: update models 2025-05-27 15:14:53 -07:00
Enrico Ros 72ae27e419 AIX: Gemini: configure voice 2025-05-27 15:14:38 -07:00
Enrico Ros b5722ac9f5 LLMs: Gemini: disable tts-only, although likely supported 2025-05-27 15:14:22 -07:00
Enrico Ros 60b7a20b71 LLMs: Visual Audio interface annotations 2025-05-27 14:53:34 -07:00
Enrico Ros 33ea55ec9d LLMs: Anthropic: prettier name 2025-05-27 14:53:18 -07:00
Enrico Ros 294b1c1ea3 AIX: Gemini: output modality 2025-05-27 14:50:53 -07:00
Enrico Ros 75e19914cc AIX: basic output modality pattern 2025-05-27 14:50:10 -07:00
Enrico Ros e24bd418b5 AIX: Gemini: parser checker 2025-05-27 14:24:21 -07:00
Enrico Ros 66c1af8333 AIX: Gemini: add maxTemperature 2025-05-27 14:23:30 -07:00
Enrico Ros a0917b4533 llmSelect: fix insufficient width 2025-05-27 14:11:11 -07:00
Enrico Ros 74731d512f Browsing: improve debug (server-side) 2025-05-27 13:58:49 -07:00
Enrico Ros e0e8a94031 Browsing: improve debug 2025-05-27 13:52:23 -07:00
Enrico Ros 67306ec0f7 Remove usersnap 2025-05-27 11:27:38 -07:00
Enrico Ros a42cfe26e7 ProviderSingleTab: add bypass 2025-05-27 09:22:06 -07:00
Enrico Ros 9c63614367 Roll packages 2025-05-26 16:41:35 -07:00
Enrico Ros ccfc129e44 Partially revert 2894c070 - fixed by the tRPC upgrade 2025-05-26 16:39:37 -07:00
Enrico Ros ad3b500781 tRPC: upgrade to .canary.32 to fix the Vercel cloud infra shift. #805 2025-05-26 16:38:53 -07:00
Enrico Ros 2894c07049 Next Edge/tRPC: server-side delay (improved workaround) for the 'Stream closed' issue only. #805 2025-05-26 14:11:02 -07:00
Enrico Ros e189d3e174 Next: 15.1.8 2025-05-26 11:38:49 -07:00
Enrico Ros b9ead56ec4 Anthropic: naming of Claude models 2025-05-26 10:52:30 -07:00
Enrico Ros 48c4ac18ab AIX: emergency fix for a sudden Vercel/tRPC streaming issue
Suddenly all Vercel builds experienced exceptions and connection terminations.

On 2025-05-22 around 8PM CET, Vercel servers started to log errors on tRPC calls.

This fix waits 1 extra event loop tick. Shall work around the issue until a proper fix is found.
2025-05-22 13:22:38 -07:00
Enrico Ros 48d1bc7635 Anthropic: add Opus4 nd Sonnet4, w, w/o extended thinking 2025-05-22 11:25:31 -07:00
Enrico Ros 9112cef5f3 Gemini: added Flash Preview 05-20, also in the non-thinking variant 2025-05-20 17:32:03 -07:00
Enrico Ros ff0183b7e6 animate-enter: modify to a fade 2025-05-20 17:04:47 -07:00
Enrico Ros 14ef63b4d2 uiCounters: make space for the byok notice 2025-05-20 17:04:32 -07:00
Enrico Ros eac6228dde Stacked bar component 2025-05-20 17:04:16 -07:00
Enrico Ros 0d28934f37 FormInputKey: disable autofocus on mobile 2025-05-20 17:02:37 -07:00
Enrico Ros 57b694a93d Roll packages 2025-05-19 15:30:26 -07:00
Enrico Ros c2a1fcc942 Allow node 23 2025-05-19 15:12:31 -07:00
Enrico Ros 3fdd2fb04d Optima: Clipboard history 2025-05-19 12:34:36 -07:00
Enrico Ros 8d9c8f681e GoodModal: fix wrong promise state 2025-05-19 11:22:39 -07:00
Enrico Ros bdfd8fb526 GoodModal: propagate reason, even for the button 2025-05-19 10:46:35 -07:00
Enrico Ros 0d88217a78 Keystroke: size 2025-05-19 10:32:00 -07:00
Enrico Ros b15e27e1d3 Prod: models arrow 2025-05-19 10:31:54 -07:00
Enrico Ros 7db5d84e4d Prod: users icon 2025-05-19 10:28:37 -07:00
Enrico Ros c0a37d618a Prod: make tips less intrusive 2025-05-19 10:27:34 -07:00
Enrico Ros b20db5ff50 ScrollToBottom: Fix ResizeObserver warning 2025-05-19 08:30:23 -07:00
Enrico Ros 43e8d5639c Mobile: Default to Beam quick button 2025-05-17 11:32:31 -07:00
Enrico Ros 92ce0af012 RenderCode: potentially fix the initial syntax highlight (skipped 2 dom levels) 2025-05-17 11:31:45 -07:00
Enrico Ros fe76cfdd8b InlineTextarea: optional external syncing with the initial text, on change 2025-05-14 16:18:54 -07:00
Enrico Ros 738ff07e6a Optima: Heading: clickable text 2025-05-14 12:34:58 -07:00
Enrico Ros e11d3d7407 Optima: Heading: no bottom margin 2025-05-14 12:34:57 -07:00
Enrico Ros 70dd92f54d Flash: adapt for the Dexie x10 2025-05-14 09:59:32 -07:00
Enrico Ros 75381a2798 Flash: recreate v2-dev DBs with the last known stores/indices 2025-05-14 09:10:26 -07:00
Enrico Ros 29bddbc6ed ChatMessage: tint avatar button 2025-05-13 12:50:44 -07:00
Enrico Ros 2ca9baf6ba ChatMessageList: tint messages 2025-05-13 12:50:32 -07:00
Enrico Ros a796a98cd4 Composer: tint composer 2025-05-13 12:29:57 -07:00
Enrico Ros 02749c290c Composer: tint buttons 2025-05-13 12:29:06 -07:00
Enrico Ros ec13a9664c Tintable buttons 2025-05-13 12:29:06 -07:00
Enrico Ros a6d6f69d4e Composer: Mobile: change Quick button 2025-05-13 12:00:16 -07:00
Enrico Ros 6d4fb2b444 UI options: extract PageSize 2025-05-13 11:36:23 -07:00
Enrico Ros a9e3da8b21 llmSelect: Add Models 2025-05-13 11:26:28 -07:00
Enrico Ros 56adb0aa88 llmSelect: show features 2025-05-13 04:41:35 -07:00
Enrico Ros aa9dc1a06f Optima: Nav: mobile App name support 2025-05-13 03:28:09 -07:00
Enrico Ros e503335026 Optima: Dropdowns: 'compact' icons too 2025-05-13 03:21:55 -07:00
Enrico Ros 350aac79b1 Optima: Dropdowns: 'compact' symbols 2025-05-13 03:14:20 -07:00
Enrico Ros 56a36987c6 Prod: restore mobile composer Bg 2025-05-13 02:51:32 -07:00
Enrico Ros 5ef00eb42a Prod: pad the mobile composer 2025-05-13 02:43:43 -07:00
Enrico Ros 6aa52cf5e6 Prod: don't focus the mobile keyboard on new chat 2025-05-13 02:39:53 -07:00
Enrico Ros 6118d0f940 Prod: drop the border of the last message instead of using a mt:-1 on the composer 2025-05-13 02:03:14 -07:00
Enrico Ros bab7afdfba Chat cleanup: readonly array 2025-05-13 01:35:55 -07:00
Enrico Ros 865cf0652b jsonUtils: relax validation to undefined (off by default) 2025-05-13 01:26:09 -07:00
Enrico Ros 7126a952b9 Gemini: Update models 2025-05-12 14:48:19 -07:00
Enrico Ros 66eb325779 OpenAI: update scores 2025-05-12 14:24:04 -07:00
Enrico Ros a55411c150 Auto-launch debugger on error/crit/DEV if the env var is set. 2025-05-12 14:09:25 -07:00
Enrico Ros 77eb6fa97c Save one icon. 2025-05-12 01:45:07 -07:00
Enrico Ros 440b13fa48 Save one icon. 2025-05-12 01:45:07 -07:00
Enrico Ros 02f30524a3 AutoBlocks: mark blocks as partial if collapsed 2025-05-12 01:43:04 -07:00
Enrico Ros 26ad736aa1 Composer: aria: new message 2025-05-12 01:36:13 -07:00
Enrico Ros 040588d708 Rename 'Write' to 'Add' 2025-05-08 13:13:33 -07:00
Enrico Ros 5a635bb532 PageCore: optimize and darken on 'full' width 2025-05-04 16:19:12 -07:00
Enrico Ros e8014fccb3 PageCore: default to 'full' width 2025-05-04 16:16:24 -07:00
Enrico Ros 85586fdf58 PageCore: support brighter 2025-05-04 16:15:43 -07:00
Enrico Ros d819e03c79 RenderCode: optimize 2025-05-03 18:55:52 -07:00
Enrico Ros 0f2def82c1 Default to full width 2025-05-03 18:55:50 -07:00
Enrico Ros 634ae0e213 AltBarBeam: test hiding the maximize button 2025-05-03 14:03:43 -07:00
Enrico Ros d0d2051edf Chat: title for beam 2025-05-03 13:56:41 -07:00
Enrico Ros a2e0ddcf81 AltBarBeam: show chat title 2025-05-03 13:56:28 -07:00
Enrico Ros 82be58b54a Optima: cut bar overflow 2025-05-03 13:55:37 -07:00
Enrico Ros ba18891696 Optima: show out name 2025-05-03 13:55:23 -07:00
Enrico Ros f2df042c0a Feature Badge: fix 2025-05-03 13:54:21 -07:00
Enrico Ros 3547119577 jsonUtils: validate objects to be pure JSON with zod 2025-05-02 13:00:11 -07:00
Enrico Ros 66519ac33e Extract dontblurtextarea 2025-05-01 22:56:48 -07:00
Enrico Ros a8ae3aa124 Cherry-pixel 2025-05-01 18:36:06 -07:00
Enrico Ros 28a00bfb29 Composer: undim 2025-05-01 18:19:04 -07:00
Enrico Ros 8c46abbac3 ChipExpander: fix build 2025-05-01 18:02:32 -07:00
Enrico Ros ae7376a708 Shortcuts: unite rendering 2025-05-01 17:56:27 -07:00
Enrico Ros b2d0844959 ChipToggleButton: optional collapse caret 2025-05-01 17:22:40 -07:00
Enrico Ros a88ca25708 ChipToggleButton: support disabled 2025-05-01 17:22:29 -07:00
Enrico Ros 61acf9e56b ChipToggleButton: support variant 2025-05-01 17:22:01 -07:00
Enrico Ros 8b6ffca2cb ChipExpander: publish styles 2025-05-01 17:21:40 -07:00
Enrico Ros 56e3aa3835 Fix beam wrap post 5b562c66 2025-05-01 14:02:18 -07:00
Enrico Ros 11bbe22d80 OptimaBarDropdown: enlarge to 200px 2025-05-01 13:57:30 -07:00
Enrico Ros 777a6bb29d Pmix: elide on missing llms 2025-05-01 13:55:56 -07:00
Enrico Ros 16b91ba63a useLLMSelect: model dialog accelerator 2025-05-01 03:05:59 -07:00
Enrico Ros 0e0ed3d657 Logger: serialize errors 2025-05-01 01:47:49 -07:00
Enrico Ros 910cbb542e Logger: source forwarding 2025-04-30 22:10:17 -07:00
Enrico Ros 17cd63d445 Logger: prefix function for module adaptation 2025-04-30 22:10:17 -07:00
Enrico Ros 89a4283868 Built with Big-AGI 2025-04-30 22:10:17 -07:00
Enrico Ros 195e167414 @Events: disable the HMR dev warning 2025-04-29 00:27:56 -07:00
Enrico Ros d5a4fadebd LLMs: keep user edits 2025-04-28 20:53:08 -07:00
Enrico Ros 21178f4974 @Events: HMR message 2025-04-28 16:00:15 -07:00
Enrico Ros 80b22e6c2d Continue CGRs ignoring missing image dereferences. #733
This is an emergency workaround where formal correctness yields to an
unbroken chat experience.
2025-04-28 11:43:27 -07:00
Enrico Ros 9e02e0aabd Optimize selector performance for selection 2025-04-27 02:32:34 -07:00
Enrico Ros 3572b94e8f Emotion: improve style performance by removing the Checkbox 2025-04-27 02:11:53 -07:00
Enrico Ros c53fbe8c73 Smallfix. 2025-04-25 19:24:49 -07:00
Enrico Ros 16450a347e Prodia: update to v2 2025-04-25 19:03:12 -07:00
Enrico Ros 9a12164082 Image Generation: with ❤ (heartbeats) 2025-04-25 17:05:34 -07:00
Enrico Ros 3ba3ab41d2 Denooise 2025-04-25 16:13:04 -07:00
Enrico Ros 1ed31199ae OpenAI gpt-image-1: support image transforms/edits 2025-04-25 15:38:44 -07:00
Enrico Ros fc9caa79f8 OpenAI gpt-image-1: bring the images to the server as AIX-alike parts (for future proofing) 2025-04-25 15:38:38 -07:00
Enrico Ros 0c19d011cb OpenAI gpt-image-1: edit wiretypes 2025-04-25 15:38:38 -07:00
Enrico Ros b4eddbbc30 Image Edit: wire image inputs to the generator 2025-04-25 15:38:38 -07:00
Enrico Ros f522f5bbc6 AIX: export image dereferencing & inlining 2025-04-25 15:38:38 -07:00
Enrico Ros fde08e6793 Attachments: support image-only 2025-04-25 15:38:38 -07:00
Enrico Ros 4e8e7fa6cf tRPC fetchers: debug wire curl 2025-04-25 15:38:38 -07:00
Enrico Ros a79806e86c tRPC fetchers: support FormData automatically 2025-04-25 15:38:38 -07:00
Enrico Ros 8c0868418c urlUtils: base64 to Blob 2025-04-25 15:38:38 -07:00
Enrico Ros b90919a4df Composer: attach if can edit image 2025-04-25 11:54:21 -07:00
Enrico Ros 7f2842f9ba T2I: capabilities: can edit 2025-04-25 11:52:38 -07:00
Enrico Ros a7f0771ca9 Attachments: support image-only 2025-04-25 11:51:15 -07:00
Enrico Ros 47315ed4a5 Drawing: improve provider names & icon 2025-04-25 01:06:22 -07:00
Enrico Ros d4df2f989b OpenAI gpt-image-1: improve settings 2025-04-25 01:05:13 -07:00
Enrico Ros 3c369e11ae OpenAI gpt-image-1: settings 2025-04-24 23:56:31 -07:00
Enrico Ros 16ba957f3a OpenAI gpt-image-1: settings 2025-04-24 23:13:41 -07:00
Enrico Ros 88a8b10b95 OpenAI gpt-image-1: createImage server/client 2025-04-24 23:13:41 -07:00
Enrico Ros 3063c9950c OpenAI gpt-image-1: exclude from text models 2025-04-24 23:13:41 -07:00
Enrico Ros fdc5845d90 OpenAI gpt-image-1: wiretypes/parser 2025-04-24 23:13:41 -07:00
Enrico Ros 6f66e2a2bb OpenAI gpt-image-1: bits 2025-04-24 23:13:41 -07:00
Enrico Ros a57a41e676 Form Options: immutable 2025-04-24 23:13:41 -07:00
Enrico Ros 5209d2c416 Wire custom error boundary to the inside 2025-04-24 16:46:26 -07:00
Enrico Ros 64138cdcd2 Wire custom error boundary to the outside 2025-04-24 16:46:20 -07:00
Enrico Ros 80a5db3e91 Error resiliency with custom Error boundary 2025-04-24 16:46:06 -07:00
Enrico Ros fb7dfdf341 Error resiliency on duplicating parts that don't exist anymore 2025-04-24 15:32:51 -07:00
Enrico Ros b0f0e35170 Error resiliency on malformed Fragments data 2025-04-24 15:19:03 -07:00
Enrico Ros cb0cc8b370 AIX: FastAPI: relax parser for missing .object and .created 2025-04-24 12:55:31 -07:00
Enrico Ros 9e8a8cb7db LLMs: FastAPI: list heuristics 2025-04-24 12:48:41 -07:00
Enrico Ros c4959776dc textUtils: add humanReadableBytes 2025-04-23 20:34:35 -07:00
Enrico Ros d50d489de7 misc 2025-04-21 17:28:38 -07:00
Enrico Ros 9472de0246 useLLMSelect: style 2025-04-21 11:30:39 -07:00
Enrico Ros 5d2fff8e53 Beam: disable enter animation 2025-04-21 01:21:45 -07:00
Enrico Ros d68ca9df1e PaneTitle: improve ux 2025-04-21 01:21:24 -07:00
Enrico Ros 04a437e9a6 Beam: optimize reactive open hooks (save 1 cycle/desyncs) 2025-04-21 01:03:22 -07:00
Enrico Ros f7fb8c780b RenderMarkdown: disable preprocessing in-progress messages 2025-04-20 18:02:43 -07:00
Enrico Ros a7ebf8a014 Markdown parser: improved safety 2025-04-20 14:40:07 -07:00
Enrico Ros e950ced1a1 Markdown parser: recursion fix for inline latex math 2025-04-20 14:32:57 -07:00
Enrico Ros 15d5a9cb58 CleanerMessage: optimize entrance 2025-04-20 12:32:55 -07:00
Enrico Ros 5c9747d8eb Remove @t3-oss/env-nextjs 2025-04-18 19:14:20 -07:00
Enrico Ros d308739643 TS-based next.config & ~/server/env build 2025-04-18 19:12:14 -07:00
Enrico Ros 6c5db40bd0 Import t3-oss/t3-env: glue
Adapt the code to work for us.
2025-04-18 19:12:14 -07:00
Enrico Ros f3212291dd Import t3-oss/t3-env
We need to change this locally to migrate to Zod4 without third party dependencies.
2025-04-18 18:21:12 -07:00
Enrico Ros 140a829291 LLMs: Gemini: update sorting 2025-04-17 19:02:25 -07:00
Enrico Ros e30d938425 dMessageUtils: Gemini namings 2025-04-17 18:56:48 -07:00
Enrico Ros 521b6a414f Gemini: (geminate :) add the non-thinking variant 2025-04-17 18:39:58 -07:00
Enrico Ros a20d0f970e LLMs: update benchmark scores 2025-04-17 18:36:49 -07:00
Enrico Ros 66c1307112 LLMs: Parameters: support hidden params 2025-04-17 18:31:37 -07:00
Enrico Ros 241a25599f LLMs: Parameters: support initial spec value 2025-04-17 18:31:24 -07:00
Enrico Ros 3f610bf122 AIX: Gemini: full thinking support
Enables the v1alpha API (seems to be more reliable)
2025-04-17 17:25:26 -07:00
Enrico Ros 858cc41a89 Gemini: update models 2025-04-17 17:24:02 -07:00
Enrico Ros 244917faf9 LLMs: support Gemini Thinking Budget 2025-04-17 17:24:02 -07:00
Enrico Ros 08964188ea LLMs: cleanup parameter editor 2025-04-17 17:24:02 -07:00
Enrico Ros 07c96661e7 AIX: Gemini: support thinking budget 2025-04-17 17:24:02 -07:00
Enrico Ros 048940d383 AIX: Gemini: measure reasoning tokens 2025-04-17 17:24:02 -07:00
Enrico Ros 85ad0e1e86 FormSliderControl: start adornment and variant 2025-04-17 17:24:02 -07:00
Enrico Ros f2f06f5d44 Panes: improve title edit 2025-04-17 13:55:37 -07:00
Enrico Ros b8e0ef5340 Feature Badge 2025-04-17 13:37:00 -07:00
Enrico Ros 07608b3fe3 Mic: default timeout 2 -> 5 seconds - too quick (@dc) 2025-04-17 13:36:59 -07:00
Enrico Ros e808509331 Cleanup: allow to un-skip messages 2025-04-16 20:32:05 -07:00
Enrico Ros 2d9a4fccfa ChatDrawer: improve show archived 2025-04-16 18:47:15 -07:00
Enrico Ros 49cf263408 Logger: maximized 2025-04-16 17:45:58 -07:00
Enrico Ros 69e7dc7481 AIX: o3: enable the non-streaming option 2025-04-16 15:36:42 -07:00
Enrico Ros a76fe34a64 LLMs: Groq updates 2025-04-16 15:06:37 -07:00
Enrico Ros 7a3882fe28 LLMs: update CBA ratings 2025-04-16 14:53:48 -07:00
Enrico Ros 6954b79178 AIX: fix o1/3/4 formatting re-enabled 2025-04-16 13:40:12 -07:00
Enrico Ros 2032d045ca More precise text 2025-04-16 13:24:22 -07:00
Enrico Ros e4ee0c3ab6 Flush future utility fn 2025-04-16 13:23:48 -07:00
Enrico Ros 3218d00850 LLMs: OpenAI declutter 2025-04-16 13:13:30 -07:00
Enrico Ros 7f0b4f79ff AIX: o4 support 2025-04-16 13:13:22 -07:00
Enrico Ros 945ba0a34c Beam: debug print 2025-04-16 13:13:10 -07:00
Enrico Ros 676797f0ac LLMs: OpenAI declutter 2025-04-16 12:54:06 -07:00
Enrico Ros 8e89d5dbfc LLMs: OpenAI o3/o4 & full cleanup 2025-04-16 12:52:20 -07:00
Enrico Ros 150eb4e9e2 LLMs: moved some old Azure model defs 2025-04-16 12:45:10 -07:00
Enrico Ros 14766629a2 LLMs: naming: fw-compatible OpenAI ox/gpt-x 2025-04-16 12:44:05 -07:00
Enrico Ros f475c8ae6c ChatDrawer: clear archived if inadvertently set 2025-04-15 23:36:53 -07:00
Enrico Ros 11badbf22c @Events: improve domain helper 2025-04-15 23:30:15 -07:00
Enrico Ros aedc051523 WebSpeech: preserve partial (interim) dictation when truncated ahead of stabilization (e.g. mic transter/timeout) 2025-04-15 17:24:17 -07:00
Enrico Ros b5336eb63c Simple chat archival support 2025-04-14 18:31:30 -07:00
Enrico Ros 0c85ddd82d LLMs: display image generation and search capabilities 2025-04-14 13:50:17 -07:00
Enrico Ros f0386a21c6 LLMs: Perplexity: update desc 2025-04-14 13:35:45 -07:00
Enrico Ros a7518937f0 LLMs: Perplexity: add Deep Research 2025-04-14 13:34:09 -07:00
Enrico Ros 87b012f0be LLMs: OpenAI: 4.1 models. Fixes #792 2025-04-14 11:30:03 -07:00
Enrico Ros 6a7a34c0b0 LLMs: Anthropic: mark as reasoning 2025-04-11 07:58:31 -07:00
Enrico Ros 9ce29138d2 DNT-DT: safety clears (15s timeout + window blur (light)) 2025-04-11 07:41:38 -07:00
Enrico Ros 95a6e09158 Monotonics: ++Aix 2025-04-10 19:57:40 -07:00
Enrico Ros 0962b79149 LLMs: Gemini: update models
Includes AIX: Gemini: predictLongRunning
2025-04-10 19:56:44 -07:00
Enrico Ros 51ba5304a6 Models: double-check OpenRouter tokenizers (disabled) 2025-04-10 19:32:58 -07:00
Enrico Ros d7137d1311 Models: show reasoning models in list 2025-04-10 19:29:02 -07:00
Enrico Ros d8babc91d5 LLMs: Gemini: sorting 2025-04-10 19:24:15 -07:00
Enrico Ros 3649a79f07 LLMs: XAI chat message namings 2025-04-10 19:24:00 -07:00
Enrico Ros 3992acd9d4 AIX: XAI: update models list + enable reasoning tuning 2025-04-10 19:21:16 -07:00
Enrico Ros b6f130e00b AIX: XAI: models list parser update 2025-04-10 18:31:33 -07:00
Enrico Ros 63c475e24f useGlobalShortcuts: support Backspace 2025-04-10 15:24:07 -07:00
Enrico Ros d8d4f4e8f3 useLLMSelect: 10-100x faster on many models 2025-04-10 14:37:35 -07:00
Enrico Ros e4a2bf8b71 Beam: improve un-max button 2025-04-10 13:09:29 -07:00
Enrico Ros 19a1110bcf Beam: move max icon 2025-04-10 13:09:25 -07:00
Enrico Ros 1997599b33 Logger: reduce icon usage 2025-04-10 13:09:03 -07:00
Enrico Ros 467f24022b Logger: support for DEV 2025-04-10 11:51:38 -07:00
Enrico Ros 3147f9b087 Bubble: tryfix on mobile 2025-04-10 09:17:51 -07:00
Enrico Ros 79e5931a45 Fix extension of text selection. Fixes #788 2025-04-10 09:03:01 -07:00
Enrico Ros 1c9cefb61b AIX: OpenAI-compatible: fix missing reasoning tokens 2025-04-10 08:33:09 -07:00
Enrico Ros 318bf80ad6 Flash: enable saving images too 2025-04-06 17:29:02 -07:00
Enrico Ros bc3a757764 Flash: disable auto-backup on restore (confusing/danger of overwrite) 2025-04-06 16:57:10 -07:00
Enrico Ros 31459c0121 Flash: download works on mobile now 2025-04-06 16:46:43 -07:00
Enrico Ros 87bd9ff08b Flash: improve mobile reliability with streaming of the Flash object 2025-04-06 16:00:14 -07:00
Enrico Ros 972284ec20 Settings: show AIX & Logger on Tools 2025-04-06 14:58:07 -07:00
Enrico Ros 8aaec8e13b Optima: make the last dialogs mut-ex 2025-04-06 14:57:52 -07:00
Enrico Ros f668fb85b2 Flash: tryfix for Mobile JSON trunctation 2025-04-06 14:57:12 -07:00
Enrico Ros a5e4ab8f9e AIX Debugger: explain 2025-04-06 14:57:12 -07:00
Enrico Ros a58db6c2bf Optima: parent the AIX Debugger Modal 2025-04-06 14:57:12 -07:00
Enrico Ros cc8c5a4b7c LogViewer: more mobile friendly 2025-04-06 14:56:54 -07:00
Enrico Ros 7027474942 Logger: per-module factory 2025-04-06 13:44:30 -07:00
Enrico Ros 41dd1e4b81 Flash/Backup Data
A new setting to export all the settings in localstorage and IndexedDB into
single 'flash' files for Big-AGI to reload.

This allows to quickly and easily migrate a full installation, including images,
from a v2-dev open installation to another.

This won't likely work across other branches, but it's meant to be forward-compatible.
2025-04-06 00:58:25 -07:00
Enrico Ros dd24b33cce Models list: display free models 2025-04-05 12:01:30 -07:00
Enrico Ros a703edab58 Gemini: update models 2025-04-05 11:10:06 -07:00
Enrico Ros 57346617a5 tools/ai/repo-structure: fix on mac/zsh 2025-04-05 10:49:33 -07:00
Enrico Ros 3a8bfb0bb1 @Events - export the app-wide bus type 2025-04-05 10:49:21 -07:00
Enrico Ros cd4e6f0f5e Roll packages 2025-04-05 10:39:52 -07:00
Enrico Ros 1a3037b756 Roll mono: +Aix. 2025-04-05 10:31:11 -07:00
Enrico Ros ef32834e10 OpenRouter: models list: prevent schema changes from breaking working models. Fixes #787 2025-04-05 10:25:12 -07:00
Enrico Ros a684a0fd3b OpenRouter: models list: ignore missing fields on 'openrouter/auto'. Fixes #787 2025-04-05 10:24:40 -07:00
Enrico Ros 44505d0e44 @Events
Introducing the Events module with per-Domain extensibility.

Depends on @Logger.
Required eventemitter3.

A pleasure to extend, and start using both for Subsystems and AGI events.
2025-04-04 14:19:10 -07:00
Enrico Ros d1589cf665 We need one space here for md presentation 2025-04-03 11:52:06 -07:00
Enrico Ros 4a7b4fbabf Document how to launch tools/ai/repo-structure.sh from anywhere 2025-04-03 11:49:43 -07:00
Enrico Ros ac1b3d7938 Simple repo ls for AI context 2025-04-03 11:41:25 -07:00
Enrico Ros 1686e662b4 DBlob: remove unused 2025-04-02 13:14:27 -07:00
Enrico Ros 67c97e7bd2 Vector Clock removal. 2025-04-02 11:09:34 -07:00
Enrico Ros 805c925e0d LLMs: Disable emojis on isLatest 2025-04-01 00:14:55 -07:00
Enrico Ros 8ffba9cdb5 Star your favorite models 2025-03-30 11:16:47 -07:00
Enrico Ros 5d5290f69d Replace the plantuml-encoder package with our custom implementation 2025-03-29 18:27:31 -07:00
Enrico Ros 563403a7f8 Logger: up debug console 2025-03-29 16:02:47 -07:00
Enrico Ros 5cbf013a8e Conversations: count hook 2025-03-28 15:19:49 -07:00
Enrico Ros 8bee761bb4 Quick edit: update text. #782 2025-03-28 10:47:17 -07:00
Enrico Ros 8bc482abe9 Attachments: rename to Note 2025-03-28 00:19:40 -07:00
Enrico Ros 51fd83cd7f Logger: in Optima modals (shift+ctrl+g) 2025-03-27 13:29:28 -07:00
Enrico Ros a0811c6d25 Logger: auto-capture unhandled 2025-03-27 13:29:28 -07:00
Enrico Ros 77e8497100 Logger: core framework 2025-03-27 13:29:27 -07:00
Enrico Ros b46aaa388b Logger: uuid id 2025-03-27 13:29:27 -07:00
Enrico Ros eeeba2febe ReAct mode: fix /draw 2025-03-26 09:27:21 -07:00
Enrico Ros 75921d08d1 OpenRouter: show reasoning 2025-03-26 09:25:25 -07:00
Enrico Ros 7764d18a8b Gemini: update models 2025-03-26 08:51:41 -07:00
Enrico Ros 797293ad8d Drawer: show open beams 2025-03-24 14:18:26 -07:00
Enrico Ros 7c7f1bcd5f Fix SQL auto-mime detection 2025-03-21 10:06:06 -07:00
Enrico Ros 50a430b353 Diffs: increase contrast 2025-03-20 10:40:50 -07:00
Enrico Ros 5b562c6671 Panels: remove minSize={20} for a reflow issue
When this property is set, a re-layout (force reflow) is performed by the browser even with a simple hovering of the separator.

Since we may have very large walls of text/images, we need to minimize relayouts, so for now, we set a min size on the contained scrollable container instead of preventing the resize.

See also this upstream issues: https://github.com/bvaughn/react-resizable-panels/issues/337
2025-03-20 07:59:22 -07:00
Enrico Ros cb0bf2d2e7 Incognito: better pane titles 2025-03-20 07:57:31 -07:00
Enrico Ros 0b042bb2b5 Incognito: better chat background 2025-03-20 07:57:28 -07:00
Enrico Ros b91fbeb978 Dark: better beam 2025-03-20 07:57:20 -07:00
Enrico Ros d0b84e7ca3 Roll packages deep 2025-03-19 21:35:34 -07:00
Enrico Ros 0edeeb54b4 Roll packages 2025-03-19 21:32:35 -07:00
Enrico Ros e1b2a28f7d Remove unneeded package 2025-03-19 21:31:03 -07:00
Enrico Ros 347c7be899 LLMs: o1-pro 2025-03-19 17:23:08 -07:00
Enrico Ros c71d88d3bf Attachments: change order/text 2025-03-19 16:47:58 -07:00
Enrico Ros 0d4cbe462f Change height for stacked desktop openings. 2025-03-19 16:36:25 -07:00
Enrico Ros a05110cd93 Update browse.router.ts 2025-03-19 09:33:33 -07:00
Enrico Ros 8f6ebe8301 Clean: improve icon 2025-03-18 07:29:59 -07:00
Enrico Ros 818775a12b Dark mode: increase code contrast 2025-03-18 06:45:29 -07:00
Enrico Ros 80b60cdaa8 roll packages 2025-03-17 06:11:54 -07:00
Enrico Ros 69118df912 Remove eventsource-parser.
The code is still laying around in AIX for a second.
2025-03-17 06:09:24 -07:00
Enrico Ros ff65382e06 AIX: 146x faster SSE Demuxer - hand-rolled optimized
The 14,600% speedup compared to eventsource-parser comes from Gemini Image Generation use cases.
2025-03-17 06:07:25 -07:00
Enrico Ros 420b8c49c6 AIX: profiler: force fallback which is class scoped
The 'performance' API is global and conflicting between calls.
2025-03-17 06:07:24 -07:00
Enrico Ros 0f9c02e249 AIX: demuxers: extract eventsource-parser 2025-03-17 02:35:43 -07:00
Enrico Ros 4890a90641 AIX: cleanup/centralize security on Production builds 2025-03-17 00:16:55 -07:00
Enrico Ros 653f0991e0 AIX: really disable Profiling on production builds 2025-03-16 23:51:30 -07:00
Enrico Ros a40efb4780 AIX: Debugger: add Profiler 2025-03-16 23:49:28 -07:00
Enrico Ros feea74268d AIX: Profiler: edge-runtime fallback for missing performance classes
Note: could become the default, for
compatibility, or when in non-dev mode.
2025-03-16 23:16:33 -07:00
Enrico Ros 631582ccbb UX: swap two chat actions 2025-03-16 22:23:14 -07:00
Enrico Ros 4f048a9907 AIX: profiler is inactive by default on both Client and Server
To turn it on, either|or:
- server side: aix.router.ts: DEBUG_LOG_PROFILER_ON_SERVER=true
- client side: DEV BUILD + "debug mode" + DEBUG_LOG_PROFILER_ON_CLIENT=true to show on the console
2025-03-16 22:15:52 -07:00
Enrico Ros a8752ccde0 AIX: dynamic Profiler
This requires EITHER:
- on the server-side, in aix.router.ts, set DEBUG_LOG_PROFILER=true;
- on the client side, and only for Development builds, this is automatic in "Debug Mode"
2025-03-16 22:11:37 -07:00
Enrico Ros feafad0d77 AIX: yield ❤|while awaiting 2025-03-16 18:54:34 -07:00
Enrico Ros 6faa468ed3 AIX: ❤|awaited ops 2025-03-16 17:43:07 -07:00
Enrico Ros ab55804039 AIX: images in the last assistant fragment (e.g. generated by AI) get sent at storage quality (e.g. 98% WebP) 2025-03-16 07:26:22 -07:00
Enrico Ros 05d9bb3bab Gemini: store compressed images. Save 80% at 98% quality (png -> webp) 2025-03-16 07:07:04 -07:00
Enrico Ros 39ae8cd250 Gemini: Image Generation additional details 2025-03-16 06:43:45 -07:00
Enrico Ros 5d34e3eb88 Gemini: Image Generation does not support the system prompt 2025-03-16 06:26:33 -07:00
Enrico Ros ee20441307 Gemini: render generated images 2025-03-16 06:04:19 -07:00
Enrico Ros b12920ae67 AIX: max dev messages size: 4096 2025-03-16 05:54:45 -07:00
Enrico Ros f9ab682559 AIX: reassembler: improve (unawaited) error catching 2025-03-16 05:47:10 -07:00
Enrico Ros d042f7b396 AIX: Asynchronous Reassembler 2025-03-16 04:56:38 -07:00
Enrico Ros d8e4c8a78c Disable animation during debug 2025-03-16 04:40:58 -07:00
Enrico Ros 1e2dcce664 Merge pull request #777 from darthalex2014/Multi-key---lite
AIX: AI: multi-key support. function getRandomKeyFromMultiKey
2025-03-16 02:41:00 -07:00
Enrico Ros ab4af50daf UX: rename the Performance toggle, and strongly advise it off 2025-03-16 02:29:54 -07:00
Enrico Ros 26c83764d9 BEAM: quitting reason 2025-03-16 02:28:40 -07:00
Enrico Ros 85ac64dea1 BEAM: selfish when solo. 2025-03-16 02:28:40 -07:00
Enrico Ros 7305c9d354 AIX: withDecimator 2025-03-16 02:28:40 -07:00
Enrico Ros b99f8e6b14 AIX: chatGenerate "maybe asynchronous" callbacks support 2025-03-16 02:17:21 -07:00
Enrico Ros eb7e2ab92a 1px is 1px 2025-03-16 02:17:20 -07:00
Enrico Ros f7edbfb5af Remove one unnecessary/invisible animation, one optional animation 2025-03-16 02:17:20 -07:00
Alex(GoD) 7c918e4735 AIX: AI: multi-key support. function getRandomKeyFromMultiKey
getRandomKeyFromMultiKey
2025-03-16 10:41:14 +05:00
Enrico Ros 7d4d1e13a0 Gemini: fix file URI 2025-03-15 06:43:34 -07:00
Enrico Ros dbe58e30c4 Gemini: fix build - this hasn't landed yet 2025-03-15 06:41:34 -07:00
Enrico Ros d2aa97b889 Gemini: wires for image generation 2025-03-15 06:38:09 -07:00
Enrico Ros 0eac3e3aca Gemini: document the next steps for Grounding 2025-03-15 04:49:02 -07:00
Enrico Ros 75d61d0604 Gemini: support setting the civic integrity block threshold 2025-03-15 04:45:43 -07:00
Enrico Ros 2f7b053f96 Gemini: grounding basic support (not on in the UI) #773
What's left:
- figure out how to turn this on/off
- figure out which models can or cannot use it, without having too much to maintain
- figure out the runtime implementation
- parse the annotation ranges
- render the original icons
- figure out how to escape the Vertex rewriting of URLs
2025-03-15 04:38:12 -07:00
Enrico Ros 5ab5a85b73 Gemini: hotfix image output 2025-03-15 03:56:16 -07:00
Enrico Ros 1d7da8fa8c Gemini: allow using 2.0 Experimental - Flash Image Generation 2025-03-15 03:26:00 -07:00
Enrico Ros 727b2edf74 Gemini: improve the parser 2025-03-15 03:25:40 -07:00
Enrico Ros 6caff0ca59 Gemini: citations (recitation detection) support: disable by default: poor websites 2025-03-15 02:24:20 -07:00
Enrico Ros b41f930d08 Gemini: improve response parsing (excl. candidates) 2025-03-15 01:19:41 -07:00
Enrico Ros 5a70d926cb Gemini: report the actual model used for the generation, not what was requested 2025-03-15 00:41:12 -07:00
Enrico Ros dbfe7b734c Gemini: gemma-3 does not support input images 2025-03-15 00:30:25 -07:00
Enrico Ros 8acf5df3aa Gemini: improve wiretypes 2025-03-15 00:23:39 -07:00
Enrico Ros f3b882ca2f AIX: update readme 2025-03-14 23:55:23 -07:00
Enrico Ros 94adf3cda6 Azure: instructions on how to add models customizations via code. #774 2025-03-14 23:23:17 -07:00
Enrico Ros bfacaa6cf8 Azure: full cleanup, supports for any OpenAI model (via auto weak mapping) #774
Also, separate from the OpenAI models, but still resolve params in the OpenAI set.
2025-03-14 23:15:08 -07:00
Enrico Ros 0033debb90 Azure: upgrade the chat generation API version (keep the deployments list), #774 2025-03-14 23:13:34 -07:00
Enrico Ros 20f2bda6ed Azure: auto-fix temperature and max_tokens, #774 2025-03-14 23:12:42 -07:00
Enrico Ros bcc278c9cf OpenAI: contrary to the docs, the Search Preview models don't support image inputs 2025-03-14 22:10:23 -07:00
Enrico Ros 75ccac221d Composer: change debounce deadlines 2025-03-13 17:31:53 -07:00
Enrico Ros d90dd90a4a Optimization: render the message labels less 2025-03-13 17:26:33 -07:00
Enrico Ros d9156ce66c Composer: improve from useDebouncer 2025-03-13 17:10:57 -07:00
Enrico Ros 61457681e1 LLMs: Gemini fix & update (Gemma 3, see notes).
Somehow the developer instruction is not enabled for Gemma3-IT, and we got this message:
"Gemini: Bad Request - Developer instruction is not enabled for models/gemma-3-27b-it"

So we convert any System message to a User message instead (see the hotfix)
2025-03-12 04:02:06 -07:00
Enrico Ros bf5019108e LLMs: Ollama: update 2025-03-12 04:02:05 -07:00
Enrico Ros 622edec2fb Improve Void fragments render 2025-03-12 03:31:07 -07:00
Enrico Ros dac02f81c0 Improve Annotations render 2025-03-12 03:25:00 -07:00
Enrico Ros d8037ebd8d Improve Chat short model names 2025-03-12 03:24:44 -07:00
Enrico Ros fba1bac8d2 OpenAI: move models 2025-03-12 03:24:32 -07:00
Enrico Ros 510fbd293b Blocks: separate Void Parts from Content parts 2025-03-12 02:26:33 -07:00
Enrico Ros ab8c974e6f Options: improve 2025-03-12 02:26:32 -07:00
Enrico Ros 870f5afcfb Options: framework and parser 2025-03-12 02:15:36 -07:00
Enrico Ros 6192bda94f DVoidWebCitation: fix immutability
Actually I don't know why I need to declare those
as readonly arrays in the base objects.
2025-03-12 01:59:44 -07:00
Enrico Ros 3f701fcee3 Void Annotations: render 2025-03-12 01:36:30 -07:00
Enrico Ros 524d049d74 AIX: Perplexity: parse citations 2025-03-12 01:35:46 -07:00
Enrico Ros 983e964e36 AIX: OpenAI: parse annotations/citations 2025-03-12 01:35:46 -07:00
Enrico Ros 84f989d6da AIX: Annotation particles reassembly 2025-03-12 01:35:46 -07:00
Enrico Ros 49356fa769 AIX: Annotation particles transmission 2025-03-12 01:35:46 -07:00
Enrico Ros 2a6a03da64 DMessage: Void Annotation fragments 2025-03-12 01:35:46 -07:00
Enrico Ros fd17860dd8 Add auto-render of domain icons 2025-03-12 01:35:46 -07:00
Enrico Ros 46fea48b6e UrlUtils: add URL domain/prettyHref extraction 2025-03-12 01:35:28 -07:00
Enrico Ros 54ef248df5 AIX: OpenAI: support for web_search_options 2025-03-11 19:18:00 -07:00
Enrico Ros 2dfb8990d2 AIX: support for Search Context & Location 2025-03-11 19:17:45 -07:00
Enrico Ros a50ac8167b OpenAI: Context & Location search parameters 2025-03-11 19:15:15 -07:00
Enrico Ros 86baab6858 AIX: Debugger: don't show particles by default (heavy) 2025-03-11 19:13:44 -07:00
Enrico Ros 67c18bb0af Utils: webGeolocation 2025-03-11 18:52:12 -07:00
Enrico Ros c4584c27ef OpenAI: update sorting 2025-03-11 17:48:26 -07:00
Enrico Ros 0022439bba OpenAI: support all Search Preview (online) models. 2025-03-11 17:38:37 -07:00
Enrico Ros 5a81ef573c Roll packages deep 2025-03-11 11:20:45 -07:00
Enrico Ros 6f7ea5c7df Roll packages 2025-03-11 11:19:19 -07:00
Enrico Ros 926452bd55 Follow Enter/newline preferences for edits. Fixes #760. Fixes #770. Closes #771. 2025-03-10 20:25:10 -07:00
Enrico Ros b5eeb6945c Vector clocks note added to Data Ownership. 2025-03-10 20:20:58 -07:00
Enrico Ros 241ba623cc Vector clock device IDs 2025-03-10 20:00:05 -07:00
Enrico Ros cbd3099fa5 YT Transcript: warn of broken downloads 2025-03-10 16:43:51 -07:00
Enrico Ros 49e12e2a0b stores/{ui, ux-labs}: move 2025-03-10 12:15:21 -07:00
Enrico Ros 4b405af0e4 Release Notes
Release notes
2025-03-07 18:30:12 -08:00
Enrico Ros 578ef40106 Tech: show shipped 2025-03-07 16:33:03 -08:00
Enrico Ros f6e76b0fb9 Version update 1.92 2025-03-07 16:30:53 -08:00
Enrico Ros 17549bfe29 Cleanups: remove magic emojis 2025-03-07 15:32:22 -08:00
Enrico Ros 7915aed388 SearchParams: for future URL state 2025-03-06 03:06:08 -08:00
Enrico Ros e26c23e238 DNT-DT: wire up 2025-03-05 19:11:22 -08:00
Enrico Ros fb5da15245 DND-DT: GlobalDragOverlay
Note: to enable, add the overlay to OptimaLayout
2025-03-05 19:11:22 -08:00
Enrico Ros 0021e4f354 DND-DT: composer without zIndex 2025-03-05 19:11:22 -08:00
Enrico Ros afa850231c PostHog: dynamic loading to reduce bundle size 2025-03-05 19:11:22 -08:00
Enrico Ros 935dc7ddaf anim: add fade In 2025-03-05 18:27:29 -08:00
Enrico Ros ac08eec0e4 restore React Strict mode 2025-03-05 18:27:29 -08:00
Enrico Ros 5deb062e5f DND-DT: move 2025-03-05 15:02:24 -08:00
Enrico Ros 8e33fdbae5 Panes: allow delete (disabled) 2025-03-05 14:44:36 -08:00
Enrico Ros 403e6fbe37 Resize handler: noop basically 2025-03-05 14:26:55 -08:00
Enrico Ros 071c43997e MOTD: use TimeAgo 2025-03-05 14:03:38 -08:00
Enrico Ros 04f9512c2a Vercel: serverless functions timeout to 60
Do it the proper way, as the vercel_Production file only caused troubles.
2025-03-05 13:49:48 -08:00
Enrico Ros b9bc4421a3 Panes: adapt API 2025-03-05 13:05:25 -08:00
Enrico Ros b2efd5af0a InlineTextarea: centered text 2025-03-05 13:05:18 -08:00
Enrico Ros 264a2f9449 MOTD: fix 2025-03-05 13:02:34 -08:00
Enrico Ros 561959e960 Panes: fading 2025-03-05 12:59:51 -08:00
Enrico Ros 41a5f9a775 Panes: Edit Title, close, close others 2025-03-05 12:59:47 -08:00
Enrico Ros 9a61e04293 Panes: adapt to API change 2025-03-05 12:59:40 -08:00
Enrico Ros 3f1e01c6f9 Panes: rename removeNonFocused 2025-03-05 12:58:12 -08:00
Enrico Ros 12eabf86cf Allow MOTD dismissal, but for every hash key. 2025-03-04 17:15:17 -08:00
Enrico Ros 82d39d3256 Env-vars: document Message of the day and new variables. 2025-03-04 16:33:16 -08:00
Enrico Ros a1921e6fa4 Add the capability to display a MOTD - message of the day. For 2025-03-04 16:32:53 -08:00
Enrico Ros a5463fabe5 Further rationalize Build env var access 2025-03-04 15:59:33 -08:00
Enrico Ros 26f71ddedd PostHog: add functionality and documentation if you want to use it 2025-03-04 15:44:36 -08:00
Enrico Ros bdc2f7e8e1 PostHog: add packages 2025-03-04 13:56:38 -08:00
Enrico Ros 2083be39da roll packages 2025-03-03 15:26:03 -08:00
Enrico Ros 521419a5aa Title: Move overlay 2025-02-28 04:34:27 -08:00
Enrico Ros 5bf9270d5d Multi-pane Titles 2025-02-28 04:18:03 -08:00
Enrico Ros 2b55921830 AIX: options override: be safe without the ref 2025-02-28 02:15:49 -08:00
Enrico Ros 707ffa162e AIX: Debugger: enable any context 2025-02-27 22:32:29 -08:00
Enrico Ros 19848da7c3 AIX: Debugger: wire more 2025-02-27 22:32:02 -08:00
Enrico Ros 334df849b3 AIX: Debugger: first version 2025-02-27 22:30:38 -08:00
Enrico Ros 801d34692b AIX: Debugger: wire Aix Client 2025-02-27 22:29:56 -08:00
Enrico Ros 0aa70f2b80 AIX: Debugger: reactive store 2025-02-27 22:26:51 -08:00
Enrico Ros 5ad11a8b75 AIX: Debugger: rename Dispatch Request loopback 2025-02-27 22:25:37 -08:00
Enrico Ros 3f1bed3b6e Anthropic: auto-limit the thinking budget 2025-02-27 18:35:03 -08:00
Enrico Ros ca3668dd60 Gemini: cleanup models 2025-02-27 18:33:17 -08:00
Enrico Ros b3ae2b1cbc Gemini: remove non existing models from our definitions, and add a check 2025-02-27 18:33:15 -08:00
Enrico Ros f6abca0663 Gemini: update models 2025-02-27 18:33:13 -08:00
Enrico Ros 084ff69239 Anthropic: update header docs 2025-02-27 18:33:10 -08:00
Enrico Ros 8d31be462a OpenAI: fix model order 2025-02-27 16:55:46 -08:00
Enrico Ros 6d010c0ef1 Metrics: show the speed section also if the wait exceeded 10 seconds 2025-02-27 16:55:39 -08:00
Enrico Ros dfc37fb2d4 Metrics: require at least 40 tokens to compute speed (and it's a very low bar 2025-02-27 16:48:39 -08:00
Enrico Ros 56cd7b0b4f Metrics: compensate reasoning tokens 2025-02-27 16:44:23 -08:00
Enrico Ros 0060739bd2 Metrics: hmm 2025-02-27 16:44:10 -08:00
Enrico Ros e98f86d878 Metrics: improve render 2025-02-27 16:35:04 -08:00
Enrico Ros 1683790315 Metrics: render tok/s and wait 2025-02-27 16:31:39 -08:00
Enrico Ros 3c32c906de Metrics: store dtStart and vTOutInner where available 2025-02-27 16:11:38 -08:00
Enrico Ros d8c9c50743 OpenAI: official 4.5 support 2025-02-27 15:44:43 -08:00
Enrico Ros 2fc6febfaf LLM types: small sort 2025-02-27 15:10:44 -08:00
Enrico Ros f49c679005 Optima Dropdown: faster, better style 2025-02-27 01:20:53 -08:00
Enrico Ros 67206a3c4d AppChat: improve borders 2025-02-27 01:20:46 -08:00
Enrico Ros ed23f1d243 roll: misc deep 2025-02-26 19:55:37 -08:00
Enrico Ros 3b8c6c8c06 roll: Lock NextJS to 15.1 2025-02-26 19:53:04 -08:00
Enrico Ros e0c956e3e7 roll: Types for React 19 2025-02-26 19:48:43 -08:00
Enrico Ros 6efff8b285 React: fix useRef for React 19 2025-02-26 19:46:18 -08:00
Enrico Ros 4422c6c803 Incognito: improve appearance 2025-02-26 18:33:24 -08:00
Enrico Ros 511b9241f5 FormLabelStart: optimize 2025-02-25 13:09:12 -08:00
Enrico Ros 89549ebeef LLM Params Editor: support simplify 2025-02-25 13:00:53 -08:00
Enrico Ros bdb24f6da1 Fragments: fix types 2025-02-25 05:00:36 -08:00
Enrico Ros d7bc03f0a9 AIX: Dispatch/CGR: adapters for Thinking Blocks (only Anthropic is implemented)
Note: the ModelAux/reasoning block is only sent if there's a signature or there is redacted data.

We could even further reduce its sending to only Anthropic llms in CGR.
2025-02-25 04:37:38 -08:00
Enrico Ros 64c18e3f68 Fragments: have to deal with this string[] 2025-02-25 04:34:46 -08:00
Enrico Ros 7bba7e0c32 AIX: TRR particle reassembler fix 2025-02-25 04:34:17 -08:00
Enrico Ros e48b3f0f8e Render Block parts 2025-02-25 03:39:37 -08:00
Enrico Ros 31da502123 AIX: Anthropic: parser: S/NS TRR particles 2025-02-25 03:39:09 -08:00
Enrico Ros 9c64bbdd60 AIX: Anthropic: parser: exhaustive checks 2025-02-25 03:21:20 -08:00
Enrico Ros f4c1b0c1da AIX: TRR particle transmitter/reassembler 2025-02-25 03:21:20 -08:00
Enrico Ros c761e9fe38 AIX: mirror the Aux fragment 2025-02-25 03:19:58 -08:00
Enrico Ros e66aaaf98a Fragments: finalize the Aux fragment 2025-02-25 03:19:58 -08:00
Enrico Ros 58b5811d9e Fragments: small fix 2025-02-25 02:38:26 -08:00
Enrico Ros 3b3429d77a LLMs: document interfaces 2025-02-25 01:32:37 -08:00
Enrico Ros 98eb1a6694 Chat AI: keep last Thinking block only (default) 2025-02-25 00:26:22 -08:00
Enrico Ros 91929a3217 Chat AI settings: renames 2025-02-24 23:30:45 -08:00
Enrico Ros 5eecbc43be Chat AI settings: categories 2025-02-24 23:27:59 -08:00
Enrico Ros 609502c545 LLMs: don't control temperature when controlling Anthropic's Thinking Budget (temp=1) 2025-02-24 19:52:41 -08:00
Enrico Ros d0b420f9a1 AIX: Anthropic: wire Response: Thinking/RedactedThinking blocks - NOT matched by AixWire_Particles AND NOR AixWire_Parts 2025-02-24 19:40:48 -08:00
Enrico Ros 1222c53a1a AIX: Anthropic: wire Request: Thinking blocks 2025-02-24 18:57:19 -08:00
Enrico Ros 7b2d51e6c9 AIX: Anthropic: adapter support for the Thinking Budget 2025-02-24 18:57:19 -08:00
Enrico Ros 46cb286839 AIX: Anthropic: framework support for Thinking Budget (nullable number) 2025-02-24 18:57:19 -08:00
Enrico Ros 2e6f0c06fb AIX: Anthropic: adapter misc (Documents, unused for now)
This pairs with the Citations mechanism, that's not yet added to the wires.
2025-02-24 18:57:19 -08:00
Enrico Ros 31c138dacb AIX: improve user-visible message 2025-02-24 18:57:19 -08:00
Enrico Ros e428683ec7 LLMs: Anthropic: add the Thinking variant 2025-02-24 18:57:07 -08:00
Enrico Ros b6462225a7 LLMs: define, edit, and optionally spec the vendor model parameter 'Anthropic thinking budget' 2025-02-24 15:54:12 -08:00
Enrico Ros dfc110ca05 LLMs: enable model variants 2025-02-24 15:21:18 -08:00
Enrico Ros f55bd26f2e Anthropic: improve flags composition 2025-02-24 14:11:14 -08:00
Enrico Ros 603b6b90df Anthropic: 3.7 dMessageUtils 2025-02-24 12:55:52 -08:00
Enrico Ros 2c132ae2cf Anthropic: auto-created-date 2025-02-24 12:55:52 -08:00
Enrico Ros c7f4ad5a31 Anthropic: update 3.7 output size 2025-02-24 12:55:52 -08:00
Enrico Ros b9d5593895 Anthropic: update models 2025-02-24 12:55:52 -08:00
Enrico Ros 6a833fc141 LLM Options: just slight better display 2025-02-24 12:55:52 -08:00
Enrico Ros 4e1ad84831 CloseablePopup: memo 2025-02-24 00:48:31 -08:00
Enrico Ros e90bcdf1a3 ERC: fix overlapping menus and non-closing menus on rmb click 2025-02-24 00:48:23 -08:00
Enrico Ros dfbb346180 BeamView: comment for LLMs 2025-02-23 22:41:12 -08:00
Enrico Ros 2d5b97f68f Draw: fix 2025-02-23 15:55:06 -08:00
Enrico Ros 32826f1e4d Draw: improve # 2025-02-23 15:41:04 -08:00
Enrico Ros b1ed1d624a Draw: image settings 2025-02-23 15:40:55 -08:00
Enrico Ros 06c4040334 No tips on draw 2025-02-23 15:24:25 -08:00
Enrico Ros b71c389f5c Uniform model icons 2025-02-23 14:37:11 -08:00
Enrico Ros 5557de6dc3 Fragments: support placeholders with purpose 2025-02-23 03:15:27 -08:00
Enrico Ros ccdcd24d22 LLMs: fix 'buttons can wrap' 2025-02-23 03:13:06 -08:00
Enrico Ros c410a655ea Fix latext/markdown rendering: preserve leading space when re-encoding for 'remark-math'. Fixes #763 2025-02-23 01:49:31 -08:00
Enrico Ros 2fd84ae57c Nav: disable incomplete 2025-02-23 01:31:37 -08:00
Enrico Ros b760b717ef Imagine: fix prompt and algo 2025-02-23 01:00:08 -08:00
Enrico Ros acf9bd8663 AppChat: Draw: "draw options" on desktop 2025-02-23 01:00:08 -08:00
Enrico Ros 7327f1440e AppChat: Draw: support N images 2025-02-23 01:00:08 -08:00
Enrico Ros 87d8c10905 AppChat: Draw: suspend other elements 2025-02-23 00:35:12 -08:00
Enrico Ros ee45f3cae9 AppChat: Draw: inline enhancements 2025-02-23 00:15:11 -08:00
Enrico Ros 195255ce9a roll packages 2025-02-23 00:15:11 -08:00
Enrico Ros 0e4fda0c5a Draw/Provider: share style 2025-02-22 23:16:53 -08:00
Enrico Ros f1babdee60 Draw/Provider: rename 2025-02-22 23:16:53 -08:00
Enrico Ros a703d85688 T2I settings: remove popup, overflows on mobile 2025-02-22 23:16:53 -08:00
Enrico Ros 0cd677cb39 T2I settings: use chips for the active service 2025-02-22 23:16:53 -08:00
Enrico Ros 9fe11fb6e2 Add FormChipControl: swappable for the Radio Control 2025-02-22 23:16:53 -08:00
Enrico Ros 58451b17dc Optima: export dropdown slotProps 2025-02-22 23:16:53 -08:00
Enrico Ros cba924a31a Phosphor: add settings 2025-02-22 23:16:53 -08:00
Enrico Ros 74e50d1cb2 Beam: don't re-run when ctrl+enter when editing 2025-02-22 23:16:53 -08:00
Enrico Ros bd1c01b4e1 Remove unused 2025-02-22 20:34:27 -08:00
Enrico Ros 541fa4aa28 Code Icon 2025-02-22 20:34:15 -08:00
Enrico Ros 4dd03c7bd6 Fix port 2025-02-22 20:34:07 -08:00
Enrico Ros 3a2de83920 Auto-scale side menu 2025-02-22 19:55:21 -08:00
Enrico Ros 2ef5d339c6 Misc simplify 2025-02-22 19:42:13 -08:00
Enrico Ros 6355098703 Backport smallie 2025-02-22 19:33:34 -08:00
Enrico Ros a10a953097 Big-AGI logos 2025-02-20 17:33:32 -08:00
Enrico Ros 99293d9841 Optima: large UI cleanups 2025-02-20 16:33:18 -08:00
Enrico Ros 6d409e4df5 Optima: Side Paneling 2025-02-20 16:33:11 -08:00
Enrico Ros 2fceef4f0c Fix max/fullscreen icons 2025-02-20 15:49:44 -08:00
Enrico Ros 7577e64085 Show last used chat mode in dev settings. 2025-02-20 14:41:40 -08:00
Enrico Ros 4a9750865f LLM domain capabilities checking: warn about proceeding with a LLM without requirements, but don't bail 2025-02-20 14:31:41 -08:00
Enrico Ros fba0685266 LLM domain autoconfiguration includes the function calling detection 2025-02-20 14:27:55 -08:00
Enrico Ros e3fa1c740d Reconfigure Code/Fast if not present after a full reconfig. 2025-02-20 14:26:59 -08:00
Enrico Ros de190f6d41 LLM Attachments: stay in tooltip 2025-02-20 14:26:19 -08:00
Enrico Ros 7a5bc39376 Gemini: fix model capabilities 2025-02-20 14:26:02 -08:00
Enrico Ros c0b67653de RenderCode: fix fullscreen 2025-02-20 14:25:48 -08:00
Enrico Ros c6b1bd2f3a Advanced AI settings: improve all settings 2025-02-20 13:57:53 -08:00
Enrico Ros ae5c30af6b FormLabelStart: support warnings 2025-02-20 13:51:39 -08:00
Enrico Ros a513378d73 autoChatFollowUps: code model only 2025-02-20 13:51:30 -08:00
Enrico Ros 5b63c12958 Gemini: thinking models do not do FC 2025-02-20 13:51:30 -08:00
Enrico Ros f3fec33085 Code model editing. 2025-02-20 13:11:08 -08:00
Enrico Ros 3a071af42d LLMs: get from domain 2025-02-20 12:49:35 -08:00
Enrico Ros a06a863745 Revert "Mic: Enter/Ctrl+Enter interceptors to Send/Beam"
This reverts commit 93f2cf4bce.
2025-02-20 12:47:14 -08:00
Enrico Ros 93f2cf4bce Mic: Enter/Ctrl+Enter interceptors to Send/Beam 2025-02-20 12:34:10 -08:00
Enrico Ros 0b70728f04 Mic: disable focus on the Composer Textarea while active 2025-02-20 12:32:03 -08:00
Enrico Ros b12f422db6 Shortcuts: Esc comes first 2025-02-20 12:31:55 -08:00
Enrico Ros 13681deaa1 Nav: strings 2025-02-20 12:05:05 -08:00
Enrico Ros d2d43af0df Nav: breadcrumbs 2025-02-20 12:05:05 -08:00
Enrico Ros 500f053afd Settings: update 2025-02-20 12:05:04 -08:00
Enrico Ros 8cf9b06d7b Remove App.pl 2025-02-20 10:32:58 -08:00
Enrico Ros 88002fd78b Rename TenantSlug 2025-02-20 09:16:30 -08:00
Enrico Ros c4684d2dab Fw compat key name 2025-02-20 08:43:44 -08:00
Enrico Ros e46a244fea Move GA 2025-02-20 08:19:08 -08:00
Enrico Ros c940de6cd7 Perplexity: update models 2025-02-19 19:52:23 -08:00
Enrico Ros c391ecc7a9 Ollama: update models 2025-02-19 19:52:23 -08:00
Enrico Ros d65ad7324d OpenAI: small text updates 2025-02-19 18:35:11 -08:00
Enrico Ros a68ffd5339 Groq: update models pricing 2025-02-19 18:26:50 -08:00
Enrico Ros 59736d19af Deepseek: update prices 2025-02-19 18:20:44 -08:00
Enrico Ros 9967f09566 Alibaba: fix pricing 2025-02-19 16:01:11 -08:00
Enrico Ros 3d7e4ebb71 Alibaba Cloud support, incl Qwen Max, Plus, Turbo. Fixes #759 2025-02-19 15:54:34 -08:00
Enrico Ros c9457f7610 Block Editor: set FORCE_ENTER_IS_NEWLINE=undefined in the code to disable Shift+Enter to save, and follow the App preferences instead. Fixes #760. 2025-02-19 14:21:48 -08:00
Enrico Ros 13aef1fd89 xAI: update models 2025-02-18 14:12:57 -08:00
Enrico Ros a9548747cd Shortcuts: fix jumpiness 2025-02-18 13:51:26 -08:00
Enrico Ros 0da4cd6eb1 Empty Inline Links renderer 2025-02-18 10:44:12 -08:00
Enrico Ros 083246bea1 FireworksAI: small doc change 2025-02-18 00:15:47 -08:00
Enrico Ros 9f372ebd72 FireworksAI: support via custom OpenAI on https://api.fireworks.ai/inference 2025-02-18 00:11:18 -08:00
Enrico Ros cdf4c96ed6 Notice on approximate tokenizer 2025-02-17 22:50:46 -08:00
Enrico Ros c757b57e07 GA: application build stats 2025-02-17 22:50:46 -08:00
Enrico Ros 6629585b32 GA: remove @next/third-parties/google 2025-02-17 22:12:21 -08:00
Enrico Ros ad96d6ce66 Dockerfile: deployment type 2025-02-17 21:26:09 -08:00
Enrico Ros 5877dc1e24 Dockerfile: build information 2025-02-17 21:09:04 -08:00
Enrico Ros 908a6b808b Dockerfile: new env=value format 2025-02-17 20:30:18 -08:00
Enrico Ros fbd41fae7f roll residuals 2025-02-17 19:25:42 -08:00
Enrico Ros f9ff37c820 roll packages 2025-02-17 19:18:19 -08:00
Enrico Ros eed91491aa Types: immutable (deeply) 2025-02-17 19:13:38 -08:00
Enrico Ros 6faf9db2ba Azure: add note about AI Foundry. #757 2025-02-17 08:30:21 -08:00
Enrico Ros 713fd7fc22 Azure: rename to Azure OpenAI. #757 2025-02-17 08:29:46 -08:00
Enrico Ros d86ce3ac2f AIX: capitalize dialect in exceptions 2025-02-17 08:17:21 -08:00
Enrico Ros 076163ccfd Diagram - improve title 2025-02-16 01:35:48 -08:00
Enrico Ros 8f74c26f77 Space between radios 2025-02-16 01:35:25 -08:00
Enrico Ros 1b37ed61e3 Update text 2025-02-16 01:14:51 -08:00
Enrico Ros c6a421e61b Panel: Zero improvement 2025-02-15 14:26:54 -08:00
Enrico Ros 550a60f4af Panes: Zero notices 2025-02-15 14:04:24 -08:00
Enrico Ros 01a6901bfe Panes: add an empty split when not branching 2025-02-15 13:48:18 -08:00
Enrico Ros e655aa5bbd Pane Manager: cleanup 2025-02-15 13:17:10 -08:00
Enrico Ros f02409c5a9 Stores: cleanup 2025-02-15 13:12:31 -08:00
Enrico Ros 8524473488 o1: re-enable streaming now that OAI supports it 2025-02-15 12:48:28 -08:00
Enrico Ros 0b039c6453 Add Toggle 2025-02-14 15:00:29 -08:00
Enrico Ros 62250abe8b Improve multichat on mobile 2025-02-14 14:21:13 -08:00
Enrico Ros 5b0fc66cb1 Groq: update models 2025-02-14 13:38:32 -08:00
Enrico Ros ffa15c274b chat-store: merge (not replace) conversations from storage 2025-02-14 13:13:17 -08:00
Enrico Ros 09596000d7 Improve multichat icon 2025-02-14 13:05:05 -08:00
Enrico Ros 8e7a5e7d60 LLMs: improve autoconfig 2025-02-14 01:36:21 -08:00
Enrico Ros fc6d485fa3 LLMs: adapt PersonaSelector 2025-02-13 22:55:16 -08:00
Enrico Ros 0ed2e7e175 LLMs: remove useChatLLM for good 2025-02-13 22:55:16 -08:00
Enrico Ros cb0a54fe2b LLMs: bits 2025-02-13 21:06:00 -08:00
Enrico Ros d9cf91d2f0 LLMs: port useFormRadioLlmType 2025-02-13 21:03:10 -08:00
Enrico Ros 3ec820f212 LLMs: ModelsList for domains 2025-02-13 20:49:19 -08:00
Enrico Ros 474f743d28 LLMs: roll models 2025-02-13 20:39:22 -08:00
Enrico Ros 3f1b508752 LLMs: update the select 2025-02-13 20:38:49 -08:00
Enrico Ros 2c49a1d8b9 LLMs: port the llm dropdown 2025-02-13 19:46:09 -08:00
Enrico Ros ab441659b2 LLMs: port select and options 2025-02-13 18:56:56 -08:00
Enrico Ros 84d843b356 LLMs: per-domain configuration 2025-02-13 18:13:16 -08:00
Enrico Ros 9b3af38326 Models: update benchmark scores 2025-02-13 15:23:51 -08:00
Enrico Ros 8226a638d9 ModelAux: disable button (prob no effect) 2025-02-13 15:06:10 -08:00
Enrico Ros 4cd2c5878c LLMs: rename .service.types 2025-02-12 13:42:49 -08:00
Enrico Ros 8242198068 LLMs: extract assignments slice 2025-02-12 10:36:04 -08:00
Enrico Ros 59be5dc807 Update MCT 2025-02-12 09:46:39 -08:00
Enrico Ros de6b6012ba AiFn: disabled summarize 2025-02-12 09:33:58 -08:00
Enrico Ros 5928c84cf4 Chat AI: change utility model 2025-02-12 01:37:35 -08:00
Enrico Ros b393469584 LLM Select: ensure a min width of 96px, and break words if push comes to shove 2025-02-12 01:24:08 -08:00
Enrico Ros 6f5cef3a6c Wizard: support 'defaults' 2025-02-12 01:03:38 -08:00
Enrico Ros 5234d78719 LocalAI: fix a p > div 2025-02-11 22:52:11 -08:00
Enrico Ros aebe64ef3d Wizard: support Local vendors 2025-02-11 22:51:53 -08:00
Enrico Ros 224a40dcb7 Wizard: improve first time experience 2025-02-11 20:39:19 -08:00
Enrico Ros 5ddb6bf718 LocalAI: large UI improvement 2025-02-11 20:39:04 -08:00
Enrico Ros 11cb61874d Ollma: improve type 2025-02-11 20:38:56 -08:00
Enrico Ros 00ed22ad28 Mistral: improve 2025-02-11 20:38:44 -08:00
Enrico Ros e263922b43 Anthropic: minor status message update 2025-02-11 19:47:00 -08:00
Enrico Ros a4172a74d1 StorageUtils: improve display 2025-02-11 19:44:10 -08:00
Enrico Ros b1fb2aeeb3 Wizard: improve selectors 2025-02-11 19:26:20 -08:00
Enrico Ros 4f3c2b7b8c Reconfigure All Models on hash changes 2025-02-11 17:06:16 -08:00
Enrico Ros ec493ee91b Wizard: Models 2025-02-11 17:04:11 -08:00
Enrico Ros 2200bb9ee8 Anthropic: less intrusive fallback message 2025-02-11 14:59:06 -08:00
Enrico Ros 588129436d Push down: cml background 2025-02-11 13:22:49 -08:00
Enrico Ros fed51d9959 Add icon 2025-02-11 12:41:40 -08:00
Enrico Ros e6af5e77f8 Models modal: simplify (disable the 'all services' button) 2025-02-11 12:40:29 -08:00
Enrico Ros 2eb230d366 Models list: verbiage 2025-02-11 12:39:47 -08:00
Enrico Ros a66ecd7660 Modal: add darken bottom 2025-02-11 12:39:38 -08:00
Enrico Ros 46a9459b7d LocalAI: mark one more 2025-02-11 12:23:11 -08:00
Enrico Ros 0a34dae6c0 Models config: improve add service ux 2025-02-11 12:23:03 -08:00
Enrico Ros 2209a76f25 Models config: improve costs display again 2025-02-11 12:22:50 -08:00
Enrico Ros ba2e27dc7e Models config: improve costs display 2025-02-11 11:57:06 -08:00
Enrico Ros 5f5cedb428 Models config: small ux fix 2025-02-11 11:34:54 -08:00
Enrico Ros a4da127078 Merge pull request #754 from jayrepo/patch-2
link ssl3 for builder
2025-02-11 08:49:29 -08:00
Jay Chen 109d0ffab6 link ssl3 for builder 2025-02-11 23:19:52 +08:00
Enrico Ros 3af2eb1b59 Gemini: update models 2025-02-10 00:45:41 -08:00
Enrico Ros 51d3f37058 Small ux hint 2025-02-09 21:41:28 -08:00
Enrico Ros 3b76018db9 LLMs: OpenAI: decouple reasoning effort an restore markdown 2025-02-09 21:33:45 -08:00
Enrico Ros 271d42c09f OpenAI: restore markdown even of missing developer messages 2025-02-09 21:26:56 -08:00
Enrico Ros ddfb7f0e88 Attach: auto-detect simplify (one button instead of N) 2025-02-07 04:24:36 -08:00
Enrico Ros 3cb8ce1b3b Attach: auto-detect URLs 2025-02-07 04:02:54 -08:00
Enrico Ros 42b00f4942 Ollama: JSON mode is dangerous, say it. Fixes #749 2025-02-03 17:30:34 -08:00
Enrico Ros 749c7ce796 OpenAI: chatgpt-4o-latest doesn't support tools 2025-02-03 11:40:58 -08:00
Enrico Ros 27ff214d04 ChatDrawer: sync once a minute so we don't get unexpected regroup flashes 2025-02-02 09:59:23 -08:00
Enrico Ros 46ff3c293a Fix link 2025-02-02 09:52:52 -08:00
Enrico Ros c034e9f2ee Link FAQs 2025-02-02 09:48:23 -08:00
Enrico Ros b2c5cebc08 Update help-faq.md 2025-02-02 09:26:08 -08:00
Enrico Ros 0017a6b0f9 Create help-faq.md 2025-02-02 09:17:28 -08:00
Enrico Ros a2c9df06de Quick update 2025-02-01 09:18:22 -08:00
Enrico Ros 4152510452 Update README.md 2025-01-31 20:27:26 -08:00
Enrico Ros d253f7279a LocalAI: improve naming, interfaces 2025-01-31 20:01:04 -08:00
Enrico Ros b186caa1d0 Mo ar re al 2025-01-31 18:14:43 -08:00
Enrico Ros f99ac2f471 Update README.md 2025-01-31 18:11:58 -08:00
Enrico Ros 409af6e23e Update README.md 2025-01-31 18:09:04 -08:00
Enrico Ros 36d81e027b Composer: fix dependency 2025-01-31 16:57:02 -08:00
Enrico Ros 2a0cb6125a Thinking: auto-detect blocks 2025-01-31 16:54:16 -08:00
Enrico Ros b65ef1289a Gemini: undocumented safety 2025-01-31 16:35:55 -08:00
Enrico Ros e67f1fb974 Perplexity: add Sonar Reasoning 2025-01-31 15:17:42 -08:00
Enrico Ros 292d7c9e05 Beam: brain-ready 2025-01-31 14:46:42 -08:00
Enrico Ros 617cb79299 DeepSeek: reasoning hint 2025-01-31 14:43:27 -08:00
Enrico Ros dbad11ad9a OpenAI o3: namings. Support complete. 2025-01-31 12:58:27 -08:00
Enrico Ros 04cb6d2538 OpenAI o3: max_completion_tokens and developer message 2025-01-31 12:56:33 -08:00
Enrico Ros b6ff3852a0 OpenAI o3: strip images 2025-01-31 12:55:53 -08:00
Enrico Ros 70a68bb676 OpenAI: models change visibility 2025-01-31 12:50:39 -08:00
Enrico Ros e04fc80b62 OpenAI: models sorting 2025-01-31 12:50:27 -08:00
Enrico Ros 35d63e7894 OpenAI: sorted models 2025-01-31 12:33:58 -08:00
Enrico Ros 9e71358ae2 OpenAI o3: models update 2025-01-31 12:27:35 -08:00
Enrico Ros 0891b103e0 Ctrl+L: attach web link 2025-01-31 12:00:16 -08:00
Enrico Ros 2480904929 Docs: add a Data Ownership guide 2025-01-31 11:13:33 -08:00
Enrico Ros da903d1879 Fix Mobile Open Pane unnecessary padding 2025-01-31 09:23:03 -08:00
Enrico Ros eafc009ff0 Optima: optimize, add 'gone' functionality 2025-01-31 08:51:08 -08:00
Enrico Ros 3023bcaf95 /tools folder 2025-01-30 22:03:15 -08:00
Enrico Ros 2d29953318 1.92.0-RC1 2025-01-30 22:02:58 -08:00
Enrico Ros 6b9ec4bc05 Fix Autocomplete issue 2025-01-30 16:43:59 -08:00
Enrico Ros 540176059a Add Mistral-3 (24B) 2025-01-30 14:57:18 -08:00
Enrico Ros 9051354c58 Mistral: hide symlinks 2025-01-30 13:00:47 -08:00
Enrico Ros 26985aeacb Mistral: update models 2025-01-30 12:58:42 -08:00
Enrico Ros c2a84c7f93 Autocomplete the tags 2025-01-29 14:06:21 -08:00
Enrico Ros 51975f6748 Ollama: match vision support 2025-01-29 13:21:51 -08:00
Enrico Ros 6fdc16c33f Ollama: update models 2025-01-29 13:10:22 -08:00
Enrico Ros ed4f347563 Ollama: add description 2025-01-29 12:41:46 -08:00
Enrico Ros a1cdb3b273 OpenPipe: extract models file 2025-01-29 11:52:32 -08:00
Enrico Ros 8b8088b74a Azure: move models function 2025-01-29 11:52:19 -08:00
Enrico Ros 94e9f2678d Together: note 2025-01-29 11:49:45 -08:00
Enrico Ros 05965e749a OpenRouter: extract models functions 2025-01-29 11:49:39 -08:00
Enrico Ros 1a9cea263f Relax status check for Azure Openai. Fixes #744 2025-01-29 10:39:59 -08:00
Enrico Ros 966c402ecc Deepseek: fix assistant message alternation 2025-01-28 23:50:24 -08:00
Enrico Ros d5e0a3e4f6 Deepseek: better namings 2025-01-28 23:41:53 -08:00
Enrico Ros 2fafca7dfd OpenRouter: support reasoning sideband 2025-01-28 23:41:42 -08:00
Enrico Ros bfbd1bcfed OpenRouter: update visibility 2025-01-28 23:20:19 -08:00
Enrico Ros c1d476a991 Together: update models 2025-01-28 23:20:12 -08:00
Enrico Ros f7b78ca855 Together: update models 2025-01-28 23:08:12 -08:00
Enrico Ros 0e1429b604 Together: update parsers 2025-01-28 23:05:22 -08:00
Enrico Ros 57f2ca6460 Groq: update models 2025-01-28 22:19:49 -08:00
Enrico Ros e1d8dabd3d Groq: extract models 2025-01-28 21:57:38 -08:00
Enrico Ros d498287f76 Fix env var parsing 2025-01-28 21:43:36 -08:00
Enrico Ros 8a3026e43e Roll packages 2025-01-28 21:40:03 -08:00
Enrico Ros 133f26c691 Optimize: StatusBar 2025-01-28 21:37:23 -08:00
Enrico Ros 9b169d1f43 Pmix: reduce verbosity 2025-01-28 21:27:37 -08:00
Enrico Ros 2c331f9a65 useDebugHook: invert params 2025-01-28 21:27:26 -08:00
Enrico Ros b9e8559002 Optimize BlockPartModelAux 2025-01-28 21:14:15 -08:00
Enrico Ros a8f843fea5 Fix reasoning chip outline 2025-01-28 21:11:22 -08:00
Enrico Ros a0da3b564f Beam: fix z-index 2025-01-28 21:10:48 -08:00
Enrico Ros bdc5e09ecc ChatMessageList: cleanup 2025-01-25 10:12:43 -08:00
Enrico Ros d88e16dccf CSV: switch to a newer lib 2025-01-24 00:35:47 -08:00
Enrico Ros 77680fcdc9 CSV: Improve Buttons 2025-01-24 00:35:43 -08:00
Enrico Ros 6afcc42c38 Beam: fix exposition of multiple fragments. 2025-01-23 14:57:25 -08:00
Enrico Ros 0bf7b86217 Reasoning style. 2025-01-23 14:48:05 -08:00
Enrico Ros fa306338aa Largely disable the NoVoid duplications (fragments, messages, conversations) 2025-01-23 13:42:33 -08:00
Enrico Ros 5921a099d9 Roll packages 2025-01-23 13:31:01 -08:00
Enrico Ros e6dd1f0c48 Inline Thinking Fragments 2025-01-23 09:54:18 -08:00
Enrico Ros ae8602a769 Gemini: show thoughts 2025-01-23 08:36:46 -08:00
Enrico Ros 8d86636a95 Gemini: access that can switch to v1alpha 2025-01-23 08:25:49 -08:00
Enrico Ros 87a9191013 Gemini: update Models 2025-01-23 08:25:49 -08:00
Enrico Ros e847933c3c AIX: Gemini: dispatch vndGeminiShowThoughts 2025-01-23 08:25:49 -08:00
Enrico Ros ad7280c065 LLMs: Parameters: add Google CoT 2025-01-23 08:24:42 -08:00
Enrico Ros b124bac190 LLMs: Parameters: apply initial values where Defined 2025-01-23 08:24:32 -08:00
Enrico Ros 6f926f4849 Perplexity: update models 2025-01-23 06:38:35 -08:00
Enrico Ros 48df9d4af6 Deep roll. 2025-01-21 18:35:01 -08:00
Enrico Ros a5d0c183a7 Roll packages 2025-01-21 18:28:31 -08:00
Enrico Ros 37354484c2 Cherry-picked the 1.16.9 release, to update GitHub desc. 2025-01-21 18:22:09 -08:00
Enrico Ros eeae13d4ba v1-dev has been fully obsoleted and removed. v1-stable is the stable v1, for Docker and big-agi.com, while v2-dev will soon become the new stable and replace v1-stable. 2025-01-21 18:19:46 -08:00
Enrico Ros c84b474632 Improve bug reporting. Require where. 2025-01-21 17:41:35 -08:00
Enrico Ros a207030899 Move puppeteer-core to non-dev dependency. Fixes #732 2025-01-21 17:38:56 -08:00
Enrico Ros b97e28ad3b YouTube transcripts: improve module 2025-01-21 17:35:16 -08:00
Enrico Ros b307adda99 DeepSeek: don't disable FC 2025-01-20 20:44:15 -08:00
Enrico Ros 069421f47a Roll AIX for reasoning & temperature changes 2025-01-20 09:31:43 -08:00
Enrico Ros 8f1a11757f DeepSeek: update Reasoner Short name. Fixes #726 2025-01-20 09:13:46 -08:00
Enrico Ros 3fa5f07f51 RenderPlainText: opti 2025-01-20 09:13:46 -08:00
Enrico Ros 8b8a200b83 Render ModelAux Void (reasoning) fragments 2025-01-20 09:13:46 -08:00
Enrico Ros 2c87d3e714 AIX & Fragments: ModelAux Void Part 2025-01-20 08:15:28 -08:00
Enrico Ros ddf3b54917 AIX: Deepseek: parse and transmit reasoning text 2025-01-20 07:28:10 -08:00
Enrico Ros 846da8e17d AIX: client, hotfix-no-temperature 2025-01-20 07:13:08 -08:00
Enrico Ros 0d0d414fc8 AIX: dispatches: optional temperature 2025-01-20 07:13:08 -08:00
Enrico Ros 0c01bce460 DeepSeek: R1 model 2025-01-20 07:13:08 -08:00
Enrico Ros 37c83ce039 LLMs: disable temperature editing by hotfix-interface 2025-01-20 07:13:08 -08:00
Enrico Ros 9e504d577e LLMs: hotfix no-temperature and nullable temperature 2025-01-20 06:58:54 -08:00
Enrico Ros ab70692c49 Mermaid: stop double quoting 2025-01-16 22:39:18 -08:00
Enrico Ros d48f594147 SVG: fix <xml>\n<svg> parsing 2025-01-16 22:39:08 -08:00
Enrico Ros 3e4e634c97 useLLMChain: make the callback optional 2025-01-15 20:04:45 -08:00
Enrico Ros 0e17a0bcd0 Roll packages deep 2025-01-15 19:29:04 -08:00
Enrico Ros 32e0d32dea Roll packages 2025-01-15 19:27:44 -08:00
Enrico Ros 1ecf355346 Relocate YouTubeURLInput 2025-01-15 19:26:13 -08:00
Enrico Ros 2ff15b54af OptimaDrawerHeader: support title click 2025-01-15 19:24:25 -08:00
Enrico Ros 30ac3f8c0a AIX: OpenRouter: multi-key support. See #653 2025-01-15 01:11:20 -08:00
868 changed files with 103224 additions and 17945 deletions
+1
View File
@@ -0,0 +1 @@
commands/code/apply-issue-main.md
+20
View File
@@ -0,0 +1,20 @@
---
description: Increment the AIX monotonic version number
allowed-tools: Bash(git add:*),Bash(git status:*),Bash(git commit:*),Edit,Write
model: haiku
disable-model-invocation: true
---
Increment `Monotonics.Aix` in `src/common/app.release.ts` and commit it.
**Pre-flight checks (MUST pass or abort):**
1. Run `git branch --show-current` - MUST be on `main` branch
2. Run `git status src/common/app.release.ts` - file MUST be unmodified (no changes on this specific file)
**Execute:**
1. Read current `Monotonics.Aix` value from `src/common/app.release.ts`
2. Increment by 1
3. Update ONLY that line
4. Run: `git add src/common/app.release.ts && git commit -m "Roll AIX"`
Confirm new version number.
@@ -0,0 +1,31 @@
---
description: Sync Anthropic API implementation with latest upstream documentation
argument-hint: specific feature to check
---
Please take a look at my API code for Anthropic: message wire types `src/modules/aix/server/dispatch/wiretypes/anthropic.wiretypes.ts`, assembly of the request messages (adapters) `src/modules/aix/server/dispatch/chatGenerate/adapters/anthropic.messageCreate.ts`, and parsing of the response in streaming or not `src/modules/aix/server/dispatch/chatGenerate/parsers/anthropic.parser.ts`.
IMPORTANT: we only support the Messages API (message create). We do NOT support other APIs such as the older Completions API.
We support Anthropic caching natively, and want to make sure tools and state (crafting the history) are also done well.
Then take a look at the newest API information available. Try these sources, and be creative if some are blocked:
**Primary Sources:**
- Docs API: https://docs.claude.com/en/api/messages
- Release notes: https://docs.claude.com/en/release-notes/api
- Tools use: https://docs.claude.com/en/docs/agents-and-tools/tool-use/overview
- Handling stop reasons: https://docs.claude.com/en/api/handling-stop-reasons
**Alternative Sources if primary blocked:**
- Anthropic TypeScript SDK: https://github.com/anthropics/anthropic-sdk-typescript
- Anthropic Python SDK: https://github.com/anthropics/anthropic-sdk-python
- Recent news and announcements: Web Search for "anthropic api changelog" or "new claude api" or "new claude api pricing"
**If all blocked:** Explain what you attempted and ask user to provide documentation manually.
$ARGUMENTS
Check carefully and look if there are any discrepancies in the protocols, the available API surface, the structure of the messages, functionality, logic, etc.
Make sure you look deep in the fields of the requests and responses, especially required fields, streaming event types, and any new response shapes.
Please point out all of the differences in the API whether it's in the final parsing and reassembly of the streaming message, or the protocol changed, etc.
Prioritize breaking changes and new capabilities that would improve the user experience.
+30
View File
@@ -0,0 +1,30 @@
---
description: Sync Google Gemini API implementation with latest upstream documentation
argument-hint: specific feature to check
---
Please take a look at my API code for Google Gemini: message wire types `src/modules/aix/server/dispatch/wiretypes/gemini.wiretypes.ts`, assembly of the request messages (adapters) `src/modules/aix/server/dispatch/chatGenerate/adapters/gemini.generateContent.ts`, and parsing of the response in streaming or not `src/modules/aix/server/dispatch/chatGenerate/parsers/gemini.parser.ts`.
IMPORTANT: we only support the generateContent API, not other Gemini APIs such as embeddings, etc.
Caching is only supported when implicit, we do not explicitly manage Gemini Caches. Same for file uploads and other systems.
Image generation happens through models, i.e. 'Gemini 2.5 Flash - Nano Banana' generates images using AIX from generateContent (chat input).
Then take a look at the newest API information available. Try these sources, and be creative if some are blocked:
**Primary Sources:**
- Docs API 1/2: https://ai.google.dev/api/generate-content
- Docs API 2/2: https://ai.google.dev/api/caching#Content
- Release notes: https://ai.google.dev/gemini-api/docs/changelog
**Alternative Sources if primary blocked:**
- Google AI JavaScript SDK: https://github.com/googleapis/js-genai (check latest commits, README, type definitions)
Recent news and announcements: Web Search for "gemini api changelog" or "nwe gemini api updates" or "new gemini api pricing"
**If all blocked:** Explain what you attempted and ask user to provide documentation manually.
$ARGUMENTS
Check carefully and look if there are any discrepancies in the protocols, the available API surface, the structure of the messages, functionality, logic, etc.
Make sure you look deep in the fields of the requests and responses, especially required fields, streaming event types, and any new response shapes.
Please point out all of the differences in the API whether it's in the final parsing and reassembly of the streaming message, or the protocol changed, etc.
Prioritize breaking changes and new capabilities that would improve the user experience.
+34
View File
@@ -0,0 +1,34 @@
---
description: Sync OpenAI API implementation with latest upstream documentation
argument-hint: specific feature to check
---
Please take a look at my API code for OpenAI: message wire types `src/modules/aix/server/dispatch/wiretypes/openai.wiretypes.ts`, assembly of the request messages (adapters) `src/modules/aix/server/dispatch/chatGenerate/adapters/openai.chatCompletions.ts`, and parsing of the response in streaming or not `src/modules/aix/server/dispatch/chatGenerate/parsers/openai.parser.ts`.
IMPORTANT: we prioritize the new Responses API, while Chat Completions is still supported but legacy.
We do NOT support other APIs such as Realtime (incl. websockets), etc.
We also do not support Agentic APIs (Agent SDK, AgentKit, ChatKit, Assistants API etc), as we perform similar functionality in AIX (server or client side).
Then take a look at the newest API information available. Try these sources, and be creative if some are blocked:
**Primary Sources:**
- Responses API (AIX prioritizes it): https://platform.openai.com/docs/api-reference/responses/create
- Chat Completions API: https://platform.openai.com/docs/api-reference/chat/create
- Changelog: https://platform.openai.com/docs/changelog
- Models: https://platform.openai.com/docs/models
- Pricing (use Copy Page button to download markdown): https://platform.openai.com/docs/pricing
**Alternative Sources if primary blocked:**
- OpenAI Node.js SDK: https://github.com/openai/openai-node
- OpenAI Python SDK: https://github.com/openai/openai-python
- OpenAI OpenAPI spec: https://github.com/openai/openai-openapi
Recent news and announcements: Web Search for "openai api changelog" or "openai new models" or "openai new prices"
**If all blocked:** Explain what you attempted and ask user to provide documentation manually.
$ARGUMENTS
Check carefully and look if there are any discrepancies in the protocols, the available API surface, the structure of the messages, functionality, logic, etc.
Make sure you look deep in the fields of the requests and responses, especially required fields, streaming event types, and any new response shapes.
Please point out all of the differences in the API whether it's in the final parsing and reassembly of the streaming message, or the protocol changed, etc.
Prioritize breaking changes and new capabilities that would improve the user experience.
@@ -0,0 +1,49 @@
---
description: Sync OpenRouter API implementation with latest upstream documentation
argument-hint: specific feature to check
---
Review the OpenRouter implementation:
- Models list: `src/modules/llms/server/openai/openrouter.wiretypes.ts` (list API response schema)
- Chat wire types: `src/modules/aix/server/dispatch/wiretypes/openai.wiretypes.ts` (OpenAI-compatible)
- Request adapter: `src/modules/aix/server/dispatch/chatGenerate/adapters/openai.chatCompletions.ts` ('openrouter' dialect)
- Response parser: `src/modules/aix/server/dispatch/chatGenerate/parsers/openai.parser.ts` (shared OpenAI parser)
- Vendor config: `src/modules/llms/vendors/openrouter/openrouter.vendor.ts`
GOAL: Ensure complete support for OpenRouter's API including advanced features like reasoning/thinking tokens, tool use, search integration, and multi-modal capabilities. OpenRouter is OpenAI-compatible but has important extensions and differences.
Use Task tool with subagent_type=Explore and thoroughness="very thorough" to discover:
1. Map API structure - all endpoints, parameters, capabilities from https://openrouter.ai/docs
2. **Advanced features** - How to use: reasoning/thinking tokens (o1, DeepSeek R1), tool use/function calling, search integration, multi-modal (vision/audio)
3. Changelog location - How does OpenRouter communicate API updates and breaking changes?
4. Model metadata - What capabilities are exposed in the models list API? How to detect feature support?
5. OpenAI deviations - Extensions, special headers (HTTP-Referer, X-Title), response fields, streaming differences
Then check the latest API information. Try these sources (be creative if blocked):
**Primary Sources:**
- API Reference: https://openrouter.ai/docs/api-reference
- Chat Completions: https://openrouter.ai/docs/api-reference#chat-completions
- Models List: https://openrouter.ai/docs/api-reference#models-list
- Parameters Guide: https://openrouter.ai/docs/parameters
- Announcements: https://openrouter.ai/announcements (feature launches, API updates, new models)
- Models Directory: https://openrouter.ai/models (check metadata for capabilities)
**Alternative Sources:**
- GitHub: https://github.com/OpenRouterTeam (SDKs, examples, issues for recent changes)
- Web Search: "openrouter api changelog" or "openrouter reasoning tokens" or "openrouter tool use"
**If blocked:** Ask user to provide documentation.
$ARGUMENTS
Focus on discrepancies and gaps:
- **Request/Response structure**: New fields, changed requirements, streaming event types
- **Feature support**: Thinking tokens format, tool calling protocol, search parameters
- **Model capabilities**: How to detect and enable advanced features per model
- **OpenRouter extensions**: Headers, routing, fallbacks, rate limiting (free vs paid)
- **Breaking changes**: Protocol updates, deprecated fields, new required parameters
Report differences in wire types, adapter logic, parser handling, or dialect-specific quirks.
Prioritize new capabilities that improve user experience (reasoning visibility, better tool use, etc.).
When making changes, add comments with date: `// [OpenRouter, 2026-MM-DD]: explanation`
+56
View File
@@ -0,0 +1,56 @@
---
description: Sync xAI Responses API implementation with latest upstream documentation
argument-hint: specific feature to check
---
Review the xAI Responses API implementation:
- xAI wire types: `src/modules/aix/server/dispatch/wiretypes/xai.wiretypes.ts` (xAI-specific request schema, tools)
- Request adapter: `src/modules/aix/server/dispatch/chatGenerate/adapters/xai.responsesCreate.ts` (AIX → xAI Responses API)
- Response parser: `src/modules/aix/server/dispatch/chatGenerate/parsers/openai.responses.parser.ts` (shared with OpenAI Responses)
- Dispatch routing: `src/modules/aix/server/dispatch/chatGenerate/chatGenerate.dispatch.ts` (dialect='xai' routing)
- OpenAI shared types: `src/modules/aix/server/dispatch/wiretypes/openai.wiretypes.ts` (InputItem/OutputItem schemas reused by xAI)
IMPORTANT context:
- We use ONLY the xAI Responses API (`POST /v1/responses`). We do NOT use the Chat Completions API (`/v1/chat/completions`) for xAI anymore.
- xAI's Responses API is similar to OpenAI's but has key differences - the skill should find what changed since our last sync.
- Response streaming/parsing reuses the OpenAI Responses parser since the format is compatible.
- We do NOT implement: Files API, Collections Search, Remote MCP tools, Voice Agent API, Image/Video generation, Batch API, or Deferred Completions.
Then take a look at the newest API information available. Try these sources, and be creative if some are blocked:
**Primary Sources (guide pages work well with WebFetch despite being JS-rendered):**
- Responses API Guide: https://docs.x.ai/docs/guides/chat
- Stateful Responses: https://docs.x.ai/docs/guides/responses-api
- Tools Overview: https://docs.x.ai/docs/guides/tools/overview
- Search Tools (web_search, x_search): https://docs.x.ai/docs/guides/tools/search-tools
- Code Execution Tool: https://docs.x.ai/docs/guides/tools/code-execution-tool
- Function Calling: https://docs.x.ai/docs/guides/function-calling
- Streaming: https://docs.x.ai/docs/guides/streaming-response
- Reasoning: https://docs.x.ai/docs/guides/reasoning
- Structured Outputs: https://docs.x.ai/docs/guides/structured-outputs
- Models & Pricing: https://docs.x.ai/developers/models
- Release Notes: https://docs.x.ai/developers/release-notes
- API Reference: https://docs.x.ai/developers/api-reference#create-new-response
**Alternative Sources if primary blocked:**
- xAI Python SDK: https://github.com/xai-org/xai-sdk-python
- Web Search for "xai grok api changelog 2026" or "xai responses api new features"
**If all blocked:** Explain what you attempted and ask user to provide documentation manually.
$ARGUMENTS
Check carefully for discrepancies between our implementation and the current API docs:
1. **Request fields**: Compare `XAIWire_API_Responses.Request_schema` against current docs - any new, changed, or deprecated parameters?
2. **Tool definitions**: Compare `XAIWire_Responses_Tools` - any new parameters on web_search/x_search/code_interpreter? Any new hosted tool types?
3. **Input/Output item types**: Any xAI-specific output items not handled by the shared OpenAI parser (e.g., x_search_call, web_search_call, code_interpreter_call)?
4. **Streaming events**: Any xAI-specific SSE event types beyond what the OpenAI Responses parser handles?
5. **Response shape**: Usage reporting differences, new fields in the response object?
6. **Adapter logic**: Message role mapping, content type handling, system message approach - still correct?
7. **Include options**: Any new values for the `include` array?
8. **Reasoning config**: Which models support it and with what values?
Prioritize breaking changes and new capabilities that would improve the user experience.
When making changes, add comments with date: `// [xAI, 2026-MM-DD]: explanation`
**Self-update this skill**: After completing the sync, if your research reveals that assumptions in THIS skill file (`.claude/commands/aix/sync-xai-api.md`) are wrong or outdated - e.g., new APIs we now implement, new tool types added, URLs moved, file paths changed - update this skill file to stay accurate for next time.
+63
View File
@@ -0,0 +1,63 @@
---
description: Search git history for commits that introduce or remove an exact string, within a commit range
argument-hint: "[search-string] [ancestor-commit]"
allowed-tools: Bash(git *)
---
Search git history using `git log -S` (pickaxe) to find commits that add or remove an exact string.
This repo has 7000+ commits, so pickaxe searches can take 30-60+ seconds - this is expected.
## Parameters
- `$0` - The exact string to search for in file contents (not commit messages). Examples: `getLabsSUDO`, `EXPERIMENT_ON_SUDO`, `myFunctionName`
- `$1` - A commit hash or unique commit message substring to identify the start of the range. Examples: `5af80b96a8`, `"Sudo Mode": 10-click`
## Example
```
/code:grep-history EXPERIMENT_ON_SUDO "Sudo Mode": 10-click
```
This searches all commits between the `"Sudo Mode": 10-click` commit and HEAD for any that add or remove the string `EXPERIMENT_ON_SUDO` in file contents.
## Procedure
### Step 1: Resolve the ancestor commit
If `$1` looks like a commit hash (hex string), use it directly.
Otherwise, search for it by message, restricting to ancestors of HEAD:
```bash
git log --oneline --grep='$1' HEAD | head -5
```
This only walks commits reachable from HEAD, so every result is a guaranteed ancestor - no verification loop needed.
If multiple results, pick the oldest (last listed) since it represents the earliest matching commit.
If none, report the error and stop.
### Step 2: Run pickaxe search
```bash
git log -S "$0" --oneline <resolved_ancestor>..HEAD
```
This finds commits where the count of `$0` in the codebase changes (i.e., it was added or removed).
This can be slow on 7000+ commits - wait for it.
### Step 3: Check endpoints
Also check whether the string exists at HEAD and at the ancestor commit:
```bash
git grep -l "$0" HEAD 2>/dev/null || echo "(not found at HEAD)"
git grep -l "$0" <resolved_ancestor> 2>/dev/null || echo "(not found at ancestor)"
```
### Step 4: Report
Present results concisely:
- Number of commits found (or "none")
- List of matching commits (hash + subject line)
- Whether the string exists at HEAD and/or at the ancestor
- If found, suggest next steps (e.g., `git show <hash>` to inspect specific commits)
+34
View File
@@ -0,0 +1,34 @@
---
description: Review in-flight changes for coherence, completeness, and quality
---
Review the current in-flight changes in the big-agi-private repository (dev branch, continuously rebased ~1800 commits on top of main).
**Step 1: Scope and read**
`git diff --stat` + `git status` for breadth. Then full `git diff` (if empty: `git diff --cached`, then `git diff HEAD~1`).
For every file in the diff, read surrounding context in the actual source file - the diff alone hides bugs in adjacent untouched code.
**Step 2: Reverse-engineer the intent**
From the diff, determine the **what**, **how**, and **why**. Present this concisely so the author can confirm or correct,
but don't stop here, continue to the full review in the same response.
**Step 3: Validate**
Run `tsc --noEmit --pretty` and `npm run lint` (in parallel). Report any errors with the review.
If the diff removes/renames identifiers, grep the codebase for stale references to the OLD names. This catches broken guards, stale imports, and incomplete migrations.
**Step 4: Deep review**
Evaluate every file in the diff.
Leave no rocks unturned - correctness, coherence, completeness, excess, generalization, maintenance burden,
codebase consistency, etc.
**Step 5: Prioritized next steps**
Think about what happens when the next developer touches this code.
Rank findings by severity (bug > correctness > cleanup > cosmetic). Be specific about what to change and where.
Remember: design values for this codebase: orthogonal features, features that generalize well, modularized and reusable code,
type-discriminated data, optimized code, zero maintenance burden. Minimize future pain, etc.
+57
View File
@@ -0,0 +1,57 @@
---
description: Show a hierarchical progress tree of the current conversation thread
---
Analyze this conversation thread and produce a **hierarchical progress tree** - a vertical breadcrumb of the chat and actions from the very start to now.
**Format:**
A tree, where every rabbithole that was taken adds a level.
```
[ ] Brief initial phase/ask/goal description
[x] Specific thing done or decided - "user quote if relevant"
[x] Another step
[ ] Sub-phase/rabbithole/etc
[x] Done step (if important)
[ ] Sub-sub-phase
[ ] Current step doing <-- HERE
[ ] Next step since this sub-sub-phase was broken out
[ ] Remaining step
[ ] ...
[ ] Missing, back to the main goal
[ ] ...
### What do we rewind the rabbithole to (once the current level is complete)?
...
### What's up (towards user value) and down (towards deeper code levels) the rabbithole?
...
### What's a good hyphenated title for this chat?
...
```
**Rules:**
- `[x]` done, `[ ]` not done. Parent is done only when ALL children on the next level are `[x]`
- Each node: a few words, specific. Quote the user briefly when it captures the intent
- Group by logical phases or rabbitholes (when descending to a deeper level of implementation or going off for a temporary tangent or sub-quest), not by messages
- Earlier levels that are fully completed don't need to be expanded in subtasks
- Root nodes/completed nodes need to show what was "wanted" from them, not being checked because they are shown as earlier phases (i.e. upper hierarchy contains more)
- Some earlier sub-phases or even levels of rabbitholes can be marked as done as indented [x] below each other (do not add non-major bullets on already completed nodes)
- Insert newlines in between large groups of items
- Decisions: state what was chosen, not the alternatives
- If a former phase produced no code change or decision, omit
- Very important to insert incomplete `[ ]` items for things that wre mentioned and are likely useful but mentioned at higher levels of the rabbithole so they must come after, when unwinding the stack
- Keep it short, tight (min 0 max item count below *ONE QUARTER the user messages*). This is a navigation aid, not a transcript
It's important for this to represent a high-level sequence of important actions and turns and pivots and rabbiholes, all focuses on trying to solve something.
First think through it looking at all the chat from the back to the front, then front to back, user requests, and understand the main storybeats. This is useful especially to remove already done leaves that don't add much if shown.
So think about the full list, so you have it all in front of you when you do the last pass to show it to me.
It's important to see the progress of what we were doing (e.g. see that we set out to do something at the beginning, but a few items of those are still incomplete, also because we took 2 detours to fix more things in the meantime...).
At the end anser the questions in the Format, with brief bullet points.
+63
View File
@@ -0,0 +1,63 @@
---
description: Sync LLM parameter options between full model dialog and chat side panel
---
Audit and sync LLM parameter configurations between the two UI editors. Goal: identical `value` fields in option arrays + equivalent onChange logic. Labels/descriptions can differ for UI space.
**Files to Compare:**
1. **Full Model Dialog**: `src/modules/llms/models-modal/LLMParametersEditor.tsx` (main branch)
2. **Chat Side Panel**: `src/apps/chat/components/layout-panel/ChatPanelModelParameters.tsx` (main derived branches only)
**Reference Documentation:**
- Parameter system: `kb/systems/LLM-parameters-system.md`
- Parameter registry: `src/common/stores/llms/llms.parameters.ts`
**Task: Perform a comprehensive audit**
1. **Read both files** and extract all option arrays (e.g., `_reasoningEffortOptions`, `_antEffortOptions`, `_geminiThinkingLevelOptions`, etc.)
2. **Check for missing parameters:**
- Parameters handled in `LLMParametersEditor.tsx` but NOT in `ChatPanelModelParameters.tsx`
- Parameters in `ChatPanelModelParameters.tsx`'s `_interestingParameters` array but missing UI controls
- Note: The side panel intentionally shows only "interesting" parameters - focus on those listed in `_interestingParameters`
3. **Check for value mismatches** between corresponding option arrays:
- Different number of options (e.g., 3 vs 4 options)
- Same label but different `value` (this causes the bug in issue #926)
- Different labels for the same `value`
- Missing `_UNSPECIFIED`/Default option in one but not the other
4. **Check onChange handler consistency:**
- Both should remove parameter on `_UNSPECIFIED` selection
- Both should set explicit values the same way
- Watch for conditions like `value === 'high'` that may differ
**Output Format:**
```
## Parameter Sync Audit Report
### Missing Parameters
- [ ] `llmVndXyz` - In full dialog, missing from side panel
### Value Mismatches
- [ ] `_xyzOptions`:
- Full dialog: [values...]
- Side panel: [values...]
- Issue: [description]
### Handler Inconsistencies
- [ ] `llmVndXyz` onChange differs: [explanation]
### Recommended Fixes
1. [Specific fix with code snippet if needed]
```
**Fix Direction:** Full dialog is source of truth. Update side panel to match its values when mismatched.
**Notes:**
- Side panel uses shorter descriptions (space-constrained) - that's fine
- Variable names may differ (e.g., `_anthropicEffortOptions` vs `_antEffortOptions`) - that's fine, but same is better
- `value` fields must be identical sets
- `_UNSPECIFIED` must mean the same thing in both
- onChange: remove on `_UNSPECIFIED`, set explicit value otherwise
@@ -0,0 +1,20 @@
---
description: Update Alibaba model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/alibaba.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models & Pricing: https://www.alibabacloud.com/help/en/model-studio/models
- Billing Guide: https://www.alibabacloud.com/help/en/model-studio/billing-for-model-studio
**Fallbacks if blocked:**
- Search "alibaba model studio latest pricing", "alibaba latest models", "qwen models pricing", or search GitHub for latest model prices and context windows
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,49 @@
---
description: Update Anthropic model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/anthropic/anthropic.models.ts` with latest model definitions.
Reference files (for context only, do not modify):
- `src/modules/llms/server/llm.server.types.ts`
- `src/modules/llms/server/models.mappings.ts`
- `src/common/stores/llms/llms.parameters.ts`
**Workflow: Start with recent changes, then verify the full model list.**
**Primary Sources (append `.md` to any path for clean markdown):**
1. Recent changes: https://platform.claude.com/docs/en/release-notes/overview.md
2. Models & IDs: https://platform.claude.com/docs/en/about-claude/models/overview.md
3. Pricing (base, cache, batch, long context): https://platform.claude.com/docs/en/about-claude/pricing.md
4. Deprecations & retirement dates: https://platform.claude.com/docs/en/about-claude/model-deprecations.md
**Discovering feature docs:** The release notes and models overview markdown
contain inline links to feature-specific pages (thinking modes, effort,
context windows, what's-new pages, etc.). When a new capability is
referenced, follow those links - append `.md` to get markdown. Examples of
pages you might discover this way:
- `about-claude/models/whats-new-claude-*` - per-generation changes
- `build-with-claude/extended-thinking` - thinking budget configuration
- `build-with-claude/effort` - effort parameter levels
- `build-with-claude/adaptive-thinking` - adaptive thinking mode
**Fallback web pages** (crawl if `.md` paths break or structure changes):
- https://platform.claude.com/docs/en/about-claude/models/overview
- https://platform.claude.com/docs/en/about-claude/pricing
- https://platform.claude.com/docs/en/release-notes/overview
- https://claude.com/pricing
**Fallbacks if blocked:** Check the Anthropic TypeScript SDK at
https://github.com/anthropics/anthropic-sdk-typescript, or web-search
for "anthropic models latest pricing" / "anthropic latest models".
**Important:**
- Review the full model list for additions, removals, and price changes
- For new models: check which `parameterSpecs` are needed (thinking mode,
effort levels, 1M context, skills, web tools) by reading the linked
feature docs and comparing with existing model entries
- When thinking/effort semantics change between generations
(e.g. adaptive vs manual thinking), document in comments
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,22 @@
---
description: Update DeepSeek model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/deepseek.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Pricing: https://api-docs.deepseek.com/quick_start/pricing
- Model List: https://api-docs.deepseek.com/api/list-models
- Release Notes: https://api-docs.deepseek.com/updates (check for version updates like V3.2-Exp)
**Note:** DeepSeek frequently releases new versions with significant pricing changes. Always check release notes first.
**Fallbacks if blocked:** Search "deepseek api latest pricing", "deepseek latest models", "deepseek models list" or search GitHub for latest model prices and context windows
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,91 @@
---
description: Update/validate dynamic vendor model parsers (OpenRouter, TogetherAI, Alibaba, Azure, Novita, ChutesAI, FireworksAI, TLUS, LM Studio, LocalAI, FastAPI)
---
Validate that the dynamic (API-fetched) vendor model parsers are up to date and not silently broken.
These vendors do NOT have hardcoded model lists - they fetch models from APIs at runtime. But their parsers, filters, heuristic detection, and capability mapping can break if upstream APIs change. This skill covers all dynamic vendors NOT covered by the other `llms:update-models-{vendor}` skills.
## Vendors to Validate
### High Risk
**OpenRouter** - `src/modules/llms/server/openai/models/openrouter.models.ts`
- Most complex parser. Vendor-specific parameter inheritance (Anthropic thinking variants, Gemini thinking/image, OpenAI reasoning effort, xAI/DeepSeek reasoning).
- Hardcoded family ordering list (lines ~24-37) - check if new leading vendors are missing.
- Hardcoded old/deprecated model hiding list (lines ~39-49) - check if stale.
- Cache pricing detection (Anthropic-style vs OpenAI-style) - verify format still valid.
- Variant injection for Anthropic thinking/non-thinking - verify still correct.
- Reference: https://openrouter.ai/docs/models
### Medium Risk
**Novita** - `src/modules/llms/server/openai/models/novita.models.ts`
- Features array mapping (`function-calling`, `reasoning`, `structured-outputs`) and input modalities parsing.
- Pricing unit conversion (hundredths of cent per million → dollars per 1K).
- Hostname heuristic: `novita.ai`.
**ChutesAI** - `src/modules/llms/server/openai/models/chutesai.models.ts`
- Custom `max_model_len` field for context window.
- Assumes all models support Vision + Functions (aggressive).
- Hostname heuristic: `.chutes.ai`.
**FireworksAI** - `src/modules/llms/server/openai/models/fireworksai.models.ts`
- Relies on provider capability flags: `supports_chat`, `supports_image_input`, `supports_tools`.
- Hostname heuristic: `fireworks.ai/`.
**TogetherAI** - `src/modules/llms/server/openai/models/together.models.ts`
- Type allow-list (`type: 'chat'`), vision detection by string match.
- Custom wire schema with pricing conversion.
**TLUS** - `src/modules/llms/server/openai/models/tlusapi.models.ts`
- Detected by response structure (`total_models`, `free_models`, `pro_models` fields).
- Capability enum mapping (`text`, `vision`, `audio`, `tool-calling`, `reasoning`, `websearch`).
- Tier-based pricing (`free` vs paid).
**Alibaba** - `src/modules/llms/server/openai/models/alibaba.models.ts`
- Model list was cleared (dynamic-only). Exclusion patterns for non-chat models.
- Assumes 128K context and Vision+Functions for all models (overly permissive).
- Check if hardcoded data should be restored now that naming has stabilized.
### Low Risk (local/generic - validate only if issues reported)
**Azure** - `src/modules/llms/server/openai/models/azure.models.ts`
- Custom deployments API, not `/v1/models`. User-specific. Deployment name fallback logic.
**LM Studio** - `src/modules/llms/server/openai/models/lmstudio.models.ts`
- Local service, native API (`/api/v1/models`). GGUF metadata parsing, capability flags.
**LocalAI** - `src/modules/llms/server/openai/models/localai.models.ts`
- Local service. String-based hide list, vision/reasoning detection by name pattern.
**FastAPI** - `src/modules/llms/server/openai/models/fastapi.models.ts`
- Generic passthrough. Detected by `owned_by === 'fastchat'`. Minimal parsing.
## Validation Checklist
For each vendor (prioritize High > Medium > Low):
1. **Read the parser file** and check for:
- Deny/allow lists that may be stale (new model families missing)
- Capability assumptions that may be wrong (e.g. "all models support vision")
- Field names that may have changed upstream
- Pricing conversion math that may use wrong units
2. **Check upstream docs** (where available) for:
- API response schema changes
- New model types or capability fields
- Deprecated fields
3. **Cross-reference with OpenRouter** (aggregator):
- OpenRouter surfaces models from many of these vendors
- If OpenRouter shows capabilities that a vendor's parser misses, the parser is stale
4. **Fix issues found** - update parsers, filters, deny lists as needed.
5. Run `tsc --noEmit` after changes.
**Important:**
- Do NOT convert dynamic vendors to hardcoded lists - the dynamic approach is intentional
- Focus on parser correctness, not model coverage
- Flag any vendor whose API response format seems to have changed substantially
@@ -0,0 +1,21 @@
---
description: Update Gemini model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/gemini/gemini.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.types.ts`, `src/modules/llms/server/llm.server.types.ts`, and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models: https://ai.google.dev/gemini-api/docs/models
- Pricing: https://ai.google.dev/gemini-api/docs/pricing
- Changelog: https://ai.google.dev/gemini-api/docs/changelog
**Fallbacks if blocked:** Check Google AI JS SDK at https://github.com/googleapis/js-genai, search "gemini models latest pricing", "gemini latest models", or search GitHub for latest model prices and context windows
**Important:**
- Ignore context windows (auto-determined at runtime) and training cutoffs (not supported)
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review, do NOT remove comments
- Flag broken links or unexpected content
@@ -0,0 +1,19 @@
---
description: Update Groq model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/groq.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Source:**
- Fetch https://console.groq.com/docs/models.md directly (markdown format, no search needed)
- Pricing: https://groq.com/pricing/
**Do NOT use web search.** The `.md` endpoint provides structured markdown content directly.
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,19 @@
---
description: Update Kimi model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/moonshot.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources (fetch directly, no search needed):**
- Pricing: https://platform.moonshot.ai/docs/pricing/chat
- API Reference: https://platform.moonshot.ai/docs/api/chat
**Do NOT use web search.** Fetch the URLs directly, or ask the user to provide data, if unaccessible.
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,26 @@
---
description: Update MiniMax model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/minimax.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models & Changelog: https://platform.minimax.io/docs/release-notes/models.md
- Pricing: https://platform.minimax.io/docs/guides/pricing-paygo.md
- Pricing Overview: https://platform.minimax.io/docs/pricing/overview.md
- Text Generation API: https://platform.minimax.io/docs/guides/text-generation.md
**Note:** MiniMax is a hardcoded-only vendor (no `/v1/models` API yet). All model IDs, context windows, and pricing must be manually maintained from the docs. Pay attention to new model releases (M-series), highspeed variants, and deprecated models.
**Fallbacks if blocked:** Search "minimax api models pricing", "minimax m2 m3 models", "minimax api changelog" or check https://openrouter.ai models list for MiniMax entries.
**Important:**
- Models are `ModelDescriptionSchema[]` objects (not ManualMappings) - match existing pattern in the file
- Review the full model list for additions, removals, and price changes
- Check for new `-highspeed` variants and new model families
- Verify context window sizes and max completion tokens against docs
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,24 @@
---
description: Update Mistral model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/mistral.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models: https://docs.mistral.ai/getting-started/models/models_overview/
- Pricing: https://mistral.ai/pricing#api-pricing
- Changelog: https://docs.mistral.ai/getting-started/changelog/
**Fallbacks if blocked:**
- Search "mistral [model-name] latest pricing", "mistral api latest pricing", "mistral latest models", or search GitHub for latest model prices and context windows
- Cross-reference: pricepertoken.com, helicone.ai, artificialanalysis.ai
- Check Mistral API list models response
- As last resort: Use Chrome DevTools MCP to render pricing table
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,41 @@
---
description: Update Ollama model definitions with latest featured models
---
Update `src/modules/llms/server/ollama/ollama.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Automated Workflow:**
```bash
# 1. Fetch the HTML to a cross-platform temp path (sorted by newest for stable ordering)
curl -s "https://ollama.com/library?sort=newest" -o "$(node -p "require('os').tmpdir()")/ollama-newest.html"
# 2. Parse it with the script (auto-finds the file in os.tmpdir())
node .claude/scripts/parse-ollama-models.js 2>&1
```
The parser outputs: `modelName|pulls|capabilities|sizes`
- Example: `deepseek-r1|66200000|tools,thinking|1.5b,7b,8b,14b,32b,70b,671b`
**Primary Sources:**
- Model Library: https://ollama.com/library?sort=newest
- Parser script: `.claude/scripts/parse-ollama-models.js`
**Fallbacks if blocked:** Check https://github.com/ollama/ollama, search "ollama featured models", "ollama latest models", or search GitHub for latest model info
**Important:**
- Parser filtering rules:
- Top 30 newest models are always included (regardless of pull count)
- After top 30, only models with 50K+ pulls are included
- Models with 'cloud' capability are automatically excluded
- Models with 'embedding' capability are automatically excluded
- Sort them in the EXACT same order as the source (newest first, for stable ordering)
- Extract tags: 'tools' → hasTools, 'vision' → hasVision, 'embedding' → isEmbeddings (note the 's'), 'thinking' → tags only
- Extract 'b' tags (1.5b, 7b, 32b) to tags field
- Set today's date (YYYYMMDD format) for newly added models only
- Update OLLAMA_LAST_UPDATE constant to today's date
- Do NOT change dates of existing models
- Review the full model list for additions, removals, and changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments and newlines to make diffs easy to review
@@ -0,0 +1,26 @@
---
description: Update OpenAI model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/openai.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Manual hint:** For pricing page, expand all tables before copying content.
**Primary Sources:**
- Models: https://platform.openai.com/docs/models (use Copy Page button)
- Pricing: https://platform.openai.com/docs/pricing (expand tables first)
**Known Issue:** OpenAI docs block automated access (403 Forbidden). Manual browser access required.
**Fallbacks if blocked:**
- Search "openai models latest pricing", "openai latest models" for third-party aggregators, or search GitHub for latest model prices and context windows
- OpenAI Node SDK (https://github.com/openai/openai-node) has limited model metadata only
- As last resort: Use Chrome DevTools MCP to navigate and extract from official docs
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,19 @@
---
description: Update OpenPipe model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/openpipe.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Base Models: https://docs.openpipe.ai/base-models
- Pricing: https://docs.openpipe.ai/pricing/pricing
**Fallbacks if blocked:** Search "openpipe models latest pricing", "openpipe latest models", "openpipe base models", or search GitHub for latest model prices and context windows
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,20 @@
---
description: Update Perplexity model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/perplexity.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models: https://docs.perplexity.ai/getting-started/models
- Pricing: https://docs.perplexity.ai/getting-started/pricing
- Changelog: https://docs.perplexity.ai/changelog/changelog
**Fallbacks if blocked:** Search "perplexity api latest pricing", "perplexity latest models", or search GitHub for latest model prices and context windows
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,23 @@
---
description: Update xAI model definitions with latest pricing and capabilities
---
Update `src/modules/llms/server/openai/models/xai.models.ts` with latest model definitions.
Reference `src/modules/llms/server/llm.server.types.ts` and `src/modules/llms/server/models.mappings.ts` for context only. Focus on the model file, do not descend into other code.
**Primary Sources:**
- Models & Pricing: https://docs.x.ai/docs/models?cluster=us-east-1#detailed-pricing-for-all-grok-models
**Known Issue:** docs.x.ai blocks automated access (403 Forbidden). Use fallbacks below.
**Fallbacks if blocked:**
- Search "xai grok latest pricing", "xai latest models", "xai api models", or search GitHub for latest model prices and context windows
- Random sites? https://the-rogue-marketing.github.io/grok-api-latest-llms-pricing-october-2025/ (find a newer version), https://langdb.ai/app/providers/xai/ (browse by model, limited coverage)
- As last resort: Use Chrome DevTools MCP to access docs.x.ai
**Important:**
- Review the full model list for additions, removals, and price changes
- Minimize whitespace/comment changes, focus on content
- Preserve comments to make diffs easy to review
- Flag broken links or unexpected content
@@ -0,0 +1,66 @@
---
description: Verify model parameterSpecs match API-validated sweep data
argument-hint: openai | anthropic | gemini | xai (or empty for all)
---
# Verify LLM Parameters
Compare model `parameterSpecs` in definition files against API-validated sweep data.
If `$ARGUMENTS` provided, verify only that dialect, which includes reading the pair of sweep results and model defintions. Otherwise verify all four, and read the pairs in sequence.
## Files
**Sweep results** (source of truth for select parameters):
- `tools/develop/llm-parameter-sweep/llm-{dialect}-parameters-sweep.json`
By the time you see these files, the repo owner has already updated them via `tools/develop/llm-parameter-sweep/sweep.sh` (very long running, 15 min per vendor).
**Model definitions (source of truth for model defintions for the user and application, including constants, interfaces, supported parameters and sometimes allowed parameter values)**:
- OpenAI: `src/modules/llms/server/openai/models/openai.models.ts`
- Anthropic: `src/modules/llms/server/anthropic/anthropic.models.ts`
- Gemini: `src/modules/llms/server/gemini/gemini.models.ts`
- xAI: `src/modules/llms/server/openai/models/xai.models.ts`
## Task
The sweep data is the source of truth for allowed model parameter values or value ranges, and for the `fn` function-calling capability probe.
For each model in the sweep, verify the model definition exposes exactly those capabilities - no more, no less. This includes:
- The parameter is present in parameterSpecs
- The paramId variant covers exactly the values from the sweep, if applicable
- `LLM_IF_OAI_Fn` in `interfaces` matches `"roundtrip"` in the sweep's `fn` array (see below)
- etc.
Report models where the definition doesn't match the sweep.
## Parameter Mapping
Example parameter mapping. Note that new parameters may have been added to both the definition, and the sweep.
The objective of the sweep is to hint at model definition values, but the model definitions are what matters for Big-AGI,
and need to be carefully updated, otherwise thousands of clients may break.
| Dialect | Sweep Key | Model paramId |
|-----------|--------------------------|------------------------------|
| OpenAI | `oai-reasoning-effort` | `llmVndOaiEffort` |
| OpenAI | `oai-verbosity` | `llmVndOaiVerbosity` |
| OpenAI | `oai-image-generation` | `llmVndOaiImageGeneration` |
| OpenAI | `oai-web-search` | `llmVndOaiWebSearchContext` |
| Anthropic | `ant-effort` | `llmVndAntEffort` |
| Anthropic | `ant-thinking-budget` | `llmVndAntThinkingBudget` |
| Gemini | `gemini-thinking-level` | `llmVndGemEffort` |
| Gemini | `gemini-thinking-budget` | `llmVndGeminiThinkingBudget` |
| xAI | `xai-web-search` | `llmVndXaiWebSearch` |
## Function-Calling Capability (`fn`)
The sweep `fn` array is a capability probe (not a paramId). `"roundtrip"` is the authoritative signal - full tool-call -> response -> coherent follow-up. `LLM_IF_OAI_Fn` in the model's `interfaces` must track `"roundtrip"`: present iff present.
Flag:
- `"roundtrip"` in sweep but `LLM_IF_OAI_Fn` missing (or vice versa)
- `fn` contains `"auto"`/`"required"` without `"roundtrip"` - partial capability, call it out
## Output
Report first for every model the expected values from the sweep, then the actual values from the definition, then the mismatches.
Finally make one table for each dialect listing all models with mismatches and the specific issues.
+56
View File
@@ -0,0 +1,56 @@
---
description: Generate changelog bullets for big-agi.com/changes
argument-hint: date like "2026-01-10" or empty for auto-detect
---
Generate changelog bullets for a single entry in https://big-agi.com/changes
**Step 1: Find the starting date**
IMPORTANT: This repo rebases frequently, so commits are INTERLEAVED throughout history.
New commits can appear at line 10, 500, or 1800. Use AUTHOR DATE (`%ad`) to filter - it's preserved during rebases.
If `$ARGUMENTS` provided, use it as the cutoff date.
If NO argument:
1. Fetch https://big-agi.com/changes to get the most recent changelog date
2. Use that date as the cutoff
**Step 2: Get commits by author date**
Filter commits by author date to catch ALL new commits regardless of position in history:
```bash
# For commits after Jan 10, 2026 (adjust date pattern as needed)
git log --oneline --no-merges --format="%h %ad %s" --date=short | grep "2026-01-1[1-9]\|2026-01-2\|2026-02"
# Verify interleaving by checking line numbers
git log --oneline --no-merges --format="%h %ad %s" --date=short | grep -n "2026-01-1[1-9]"
```
The line numbers prove commits are scattered (e.g., lines 14, 638, 1156, 1803 = interleaved).
**Step 3: Write bullets**
Real examples from big-agi.com/changes:
- "Gemini 3 Flash support with 4-level thinking: high, medium, low, minimal"
- "Cloud Sync launched! - long awaited and top requested"
- "Deepseek V3.2 Speciale comes with almost Gemini 3 Pro performance but 20 times cheaper"
- "Anthropic Opus 4.5 with controls for effort (speed tradeoff), thinking budget, search"
- "Login with email, via magic link"
- "Mobile UX fixes for popups drag/interaction"
**Rules:**
1. **Order by importance** - most significant changes first, minor fixes last
2. **Feature-first, no verb prefixes** - "Gemini 3 support" not "Add Gemini 3 support"
3. **Model names lead** when it's about LLMs
4. **Specific details** - "4-level thinking: high, medium, low, minimal" not "multiple thinking levels"
5. **One-liners** - short, no fluff
6. **Consolidate commits** - 10 persona editor commits = 1 bullet
7. **No corporate speak** - no "enhanced", "streamlined", "robust", "leverage"
**Skip:** WIP, internal refactors, KB docs, automation, review cleanups, trivial fixes, deps bumps, CI changes.
**Output:** Just bullets, ready to paste. 2-5 bullets but adapt depending on scope, especially
in relation to the usual https://big-agi.com/changes entries.
+149
View File
@@ -0,0 +1,149 @@
---
description: Execute the Big-AGI release process
argument-hint: version like "2.0.4" or empty to auto-increment patch
---
Execute the release process for Big-AGI. Go step-by-step, waiting for user approval between major steps.
## Step 1: Determine Version
If `$ARGUMENTS` provided, use it. Otherwise, read `package.json` and increment patch version.
## Step 2: Gather Context
Before drafting, gather what changed:
1. `git log --oneline` since last release tag to see all commits
2. Fetch https://big-agi.com/changes to see what daily entries already covered
3. `gh issue list --state closed --search "closed:>LAST_RELEASE_DATE"` to find closed issues
4. Check auto-generated release notes (`gh release create --generate-notes --draft`) for community PRs and new contributors
## Step 3: Update Files
1. **package.json** - Update `version` field
2. **src/common/app.release.ts** - Increment `Monotonics.NewsVersion` (e.g., 203 → 204)
3. **src/apps/news/news.data.tsx** - Add new entry at top of `NewsItems` array
For the news entry, ask user for release name and key highlights.
**News entry style** - Draft is a starting point, user will refine:
- Models lead when model-heavy, grouped together
- Callout features get own bullet with colon explanation
- UX items grouped, minimal bold
- Fixes last, brief
- Release name stays subtle - don't oversell the theme
- Apply the draft, then let the user edit manually and re-read after - don't over-iterate
Use `<B>`, `<B issue={N}>`, `<B href='url'>`. Re-read file after user edits.
4. User runs `npm i` to update lockfile
## Step 4: README
Update `README.md`:
- Line ~46: Update model examples if new flagship models
- Line ~147: Add release bullet above previous version
**Style:** `- Open X.Y.Z: **Name** feature1, feature2, feature3`
## Step 5: Git Operations
User commits changes, then:
```bash
git tag vX.Y.Z
git push opensource vX.Y.Z
```
## Step 6: GitHub Release
Create release with `gh release create` using `--notes` (not `--body`).
**Structure** - discursive intro paragraph, then themed sections, not a generic "What's New" header:
```
# Big-AGI X.Y.Z - Name
### Theme tagline.
1-2 sentence discursive paragraph setting the release theme - what it means, not a feature list.
### Section Name (e.g., Models & Parameters)
- Bullet points for specifics
- Group by theme, not by commit order
### Vendor/Platform Section (when enough substance)
- Give a vendor its own section if 3+ related changes (e.g., Anthropic, AWS Bedrock)
### Also New
- Remaining features, scannable
## New Contributors
* @user made their first contribution (brief description) in PR_URL
**Full Changelog**: https://github.com/enricoros/big-AGI/compare/vPREV...vNEW
## Get Started
Available now at [big-agi.com](https://big-agi.com), via Docker, or self-host from source.
```
## Step 7: Changelog (big-agi.com/changes)
The Open release entry on big-agi.com/changes is lightweight - just 1-2 bullets announcing the stable release, since daily entries already covered the individual features. Use `/rel:changelog` to generate.
**Style:** `- Open X.Y.Z Name stable release on GitHub and Docker`
followed by 1 bullet summarizing what landed in the final days since the last daily entry.
## Step 8: Announcements
Draft for user to post:
**Twitter** - Thematic, not feature dumps. Talk about what it means, not what it lists:
```
Big-AGI Open X.Y.Z is out!
[Theme - e.g., "Lots of love to models: native support, latest protocols, total configuration - puts you in control."]
[One more angle, natural prose]
[Optional link]
```
**Discord** - Structured with bold headers:
```
## :partyblob: Big-AGI **Open** X.Y.Z
**Category:** Items
**Category:** Items
**More:** Count of commits/fixes
```
## Step 9: Cover Image Prompts
Offer cover image prompt alternatives for the release. Read past prompts from `news.data.tsx` comments (lines ~24-37) for the pattern.
**Pattern:** Always a capybara sculpture made of crystal glass, wearing rayban-like oversized black sunglasses. Each release has a unique theme/activity that symbolizes the release.
**Shared prefix:** `High-key white scene, very clean, hero framing. A close-up photo of a capybara sculpture made of crystal glass. The capybara wears rayban-like oversized black sunglasses.`
**Also offer future release concepts** tied to vision vectors from `kb/vision-inlined.md` (e.g., agency, inhabitation, sculpting, safe exploration).
## Tone Guide
**Good:**
- "Lots of love to models: native support, latest protocols, total configuration"
- "UX quality of life improvements, from Google Drive to message reorder"
- "Gemini 3 Flash support with 4-level thinking: high, medium, low, minimal"
**Bad:**
- "Rolling out the red carpet for top models!" (too salesy)
- "Enhanced and streamlined the robust model experience" (corporate speak)
- "Added support for Gemini 3 Flash model with multiple thinking levels" (verb prefix, vague)
## Reference
Find previous copy at:
- **GitHub releases:** https://github.com/enricoros/big-AGI/releases
- **News entries:** `src/apps/news/news.data.tsx`
- **README:** `README.md` release notes section
- **Changelog:** https://big-agi.com/changes
Match the existing tone - professional but human, specific not generic, features not marketing.
+115
View File
@@ -0,0 +1,115 @@
#!/usr/bin/env node
/**
* Parse Ollama models from HTML (sorted by newest for stable ordering)
*
* Usage:
* 1. Fetch HTML: curl -s "https://ollama.com/library?sort=newest" -o /tmp/ollama-newest.html
* 2. Parse: node .claude/scripts/parse-ollama-models.js
*
* Outputs: pipe-delimited format: modelName|pulls|capabilities|sizes
* Example: deepseek-r1|66200000|tools,thinking|1.5b,7b,8b,14b,32b,70b,671b
*
* Filtering rules:
* - Top 30 newest models are always included (regardless of pull count)
* - After top 30, only models with 50K+ pulls are included
* - Models with 'cloud' capability are always excluded
* - Models with 'embedding' capability are always excluded
*
* Pull counts are rounded to significant figures for stable diffs:
* - >=10M: round to 100K (e.g., 109,123,456 -> 109,100,000)
* - >=1M: round to 10K (e.g., 5,432,100 -> 5,430,000)
* - <1M: round to 1K (e.g., 88,700 -> 89,000)
*/
const fs = require('fs');
const os = require('os');
const path = require('path');
const htmlPath = process.argv[2] || path.join(os.tmpdir(), 'ollama-newest.html');
const TOP_N_ALWAYS_INCLUDE = 30;
const MIN_PULLS_THRESHOLD = 50000;
if (!fs.existsSync(htmlPath)) {
console.error(`Error: HTML file not found at ${htmlPath}`);
console.error('Please fetch it first with:');
console.error(' curl -s "https://ollama.com/library?sort=newest" -o /tmp/ollama-newest.html');
process.exit(1);
}
const html = fs.readFileSync(htmlPath, 'utf8');
// Split into model sections - each starts with <a href="/library/
const modelSections = html.split(/<a href="\/library\//);
const allParsedModels = [];
for (let i = 1; i < modelSections.length; i++) {
const section = modelSections[i].substring(0, 5000); // Large enough window to capture all data
// Extract model name (first quoted string)
const nameMatch = section.match(/^([^"]+)"/);
if (!nameMatch) continue;
const name = nameMatch[1];
// Extract pulls using x-test-pull-count
const pullsMatch = section.match(/x-test-pull-count>([^<]+)</);
let pulls = 0;
if (pullsMatch) {
const pullStr = pullsMatch[1].replace(/,/g, '');
if (pullStr.includes('M')) {
pulls = Math.floor(parseFloat(pullStr) * 1000000);
} else if (pullStr.includes('K')) {
pulls = Math.floor(parseFloat(pullStr) * 1000);
} else {
pulls = parseInt(pullStr);
}
}
// Extract capabilities (tools, vision, embedding, thinking, cloud)
const capabilities = [];
const capabilityRegex = /x-test-capability[^>]*>([^<]+)</g;
let capMatch;
while ((capMatch = capabilityRegex.exec(section)) !== null) {
capabilities.push(capMatch[1].trim());
}
// Extract sizes (1.5b, 7b, etc.)
const sizes = [];
const sizeRegex = /x-test-size[^>]*>([^<]+)</g;
let sizeMatch;
while ((sizeMatch = sizeRegex.exec(section)) !== null) {
sizes.push(sizeMatch[1].trim());
}
// Skip models with 'cloud' or 'embedding' capability
if (capabilities.includes('cloud') || capabilities.includes('embedding')) {
continue;
}
allParsedModels.push({ name, pulls: roundPulls(pulls), capabilities, sizes });
}
// Apply filtering: top 30 always included, rest need 50K+ pulls
const models = allParsedModels.filter((model, index) => {
return index < TOP_N_ALWAYS_INCLUDE || model.pulls >= MIN_PULLS_THRESHOLD;
});
/**
* Round pulls to significant figures for stable output.
* This reduces churn from daily fluctuations while preserving magnitude.
*/
function roundPulls(pulls) {
if (pulls >= 10000000) return Math.round(pulls / 100000) * 100000; // >=10M: round to 100K
if (pulls >= 1000000) return Math.round(pulls / 10000) * 10000; // >=1M: round to 10K
return Math.round(pulls / 1000) * 1000; // <1M: round to 1K
}
// Output in pipe-delimited format (in the order they appear on the page)
models.forEach(m => {
const caps = m.capabilities.join(',');
const tags = m.sizes.join(',');
console.log(`${m.name}|${m.pulls}|${caps}|${tags}`);
});
const topNCount = Math.min(TOP_N_ALWAYS_INCLUDE, allParsedModels.length);
const thresholdCount = models.length - topNCount;
console.error(`\nTotal models: ${models.length} (top ${topNCount} newest + ${thresholdCount} with ${MIN_PULLS_THRESHOLD / 1000}K+ pulls)`);
+49
View File
@@ -0,0 +1,49 @@
{
"permissions": {
"allow": [
"Bash(cat:*)",
"Bash(cp:*)",
"Bash(curl:*)",
"Bash(eslint:*)",
"Bash(find:*)",
"Bash(gh issue list:*)",
"Bash(gh issue view:*)",
"Bash(git branch:*)",
"Bash(git cherry-pick:*)",
"Bash(git describe:*)",
"Bash(git grep:*)",
"Bash(git log:*)",
"Bash(git ls-tree:*)",
"Bash(git mv:*)",
"Bash(git show:*)",
"Bash(grep:*)",
"Bash(head:*)",
"Bash(ls:*)",
"Bash(mkdir:*)",
"Bash(node:*)",
"Bash(npm install)",
"Bash(npm install:*)",
"Bash(npm run:*)",
"Bash(npx eslint:*)",
"Bash(npx tsc:*)",
"Bash(rg:*)",
"Bash(rm:*)",
"Bash(sed:*)",
"Bash(tail:*)",
"Bash(tree:*)",
"Bash(tsc:*)",
"Read(//tmp/**)",
"Skill(llms:update-models*)",
"WebFetch",
"WebFetch(domain:big-agi.com)",
"WebSearch",
"mcp__chrome-devtools",
"mcp__github",
"mcp__ide__getDiagnostics"
],
"deny": [
"Read(node_modules)",
"Read(node_modules/**)"
]
}
}
+15 -40
View File
@@ -1,43 +1,18 @@
# big-AGI non-code files
/docs/
/dist/
README.md
*
# Ignore build and log files
Dockerfile
/.dockerignore
!app/
!kb/
!pages/
!public/
!src/
!tools/
# Node build artifacts
/node_modules
/.pnp
.pnp.js
!*.mjs
!middleware_BASIC_AUTH.ts
!middleware.ts
!next.config.ts
!package*.json
!tsconfig.json
# next.js
/.next/
/out/
# production
/build
# versioning
.git/
.github/
# IDEs
.idea/
# debug
npm-debug.log*
yarn-debug.log*
yarn-error.log*
.pnpm-debug.log*
# local env files
.env*.local
# vercel
.vercel
# typescript
*.tsbuildinfo
next-env.d.ts
!LICENSE
!README.md
-3
View File
@@ -1,3 +0,0 @@
{
"extends": "next/core-web-vitals"
}
+70
View File
@@ -0,0 +1,70 @@
name: 🔥 Make AI Fix This
description: Bug, question, or feedback - AI analyzes and changes Big-AGI appropriately
labels: [ 'claude-triage' ]
body:
- type: markdown
attributes:
value: |
Thanks for opening an issue! Our AI will analyze it and change Big-AGI appropriately.
**What happens next:**
- AI searches the codebase and documentation
- You get a response, typically within 30 minutes
- Ticket gets follow-up and community votes
- type: textarea
attributes:
label: What's happening?
description: Describe the bug, feature request, or question. Be as detailed as you can.
placeholder: |
Bug example: "In Beam, Anthropic models seem to have search off..."
Model request: "Add Claude Opus 4.5 out today, see https://..."
Feature example: "Add the option to to save frequent prompt templates for reuse..."
validations:
required: true
- type: dropdown
attributes:
label: Where does this happen?
description: If this is a bug or issue, where are you experiencing it?
options:
- Big-AGI Pro (big-agi.com)
- Self-deployed from GitHub
- Docker deployment
- Local development
- Not applicable (question/feedback)
- Other
validations:
required: false
- type: dropdown
attributes:
label: Impact on your workflow
description: How does this affect your use of Big-AGI?
options:
- Blocking - Can't use Big-AGI
- High - Major feature broken
- Medium - Workaround exists
- Low - Minor inconvenience
- None - Just a question/suggestion
validations:
required: false
- type: textarea
attributes:
label: Environment (if applicable)
description: Device, OS, browser - only if reporting a bug
placeholder: |
Device: Macbook Pro M3
OS: macOS 15.2
Browser: Chrome 131
validations:
required: false
- type: textarea
attributes:
label: Additional context
description: Screenshots, error messages, or anything else that helps
placeholder: Paste screenshots or error messages here
validations:
required: false
+19 -2
View File
@@ -5,14 +5,29 @@ labels: [ 'type: bug' ]
body:
- type: markdown
attributes:
value: Thank you for reporting a bug.
value: Thank you for reporting a bug. Please help us by providing accurate environment information.
- type: dropdown
attributes:
label: Environment
description: (required) Where are you experiencing this issue?
options:
- Big-AGI Pro (big-agi.com)
- Self-deployed from GitHub
- Docker container (specify in description)
- Local development
- Other
validations:
required: true
- type: textarea
attributes:
label: Description
description: (required) Please provide a clear description. Please also provide the steps to reproduce.
description: (required) Please provide a clear description and **steps to reproduce**.
placeholder: 'Concise description + steps to reproduce.'
validations:
required: true
- type: textarea
attributes:
label: Device and browser
@@ -20,10 +35,12 @@ body:
placeholder: 'Device: (e.g., iPhone 16, Pixel 9, PC, Macbook...), OS: (e.g., iOS 17, Windows 12), Browser: (e.g., Chrome 119, Safari 18, Firefox..)'
validations:
required: true
- type: textarea
attributes:
label: Screenshots and more
placeholder: 'Attach screenshots, or add any additional context here.'
- type: checkboxes
attributes:
label: Willingness to Contribute
@@ -32,7 +32,6 @@ assignees: enricoros
- [ ] verify deployment on Vercel
- [ ] verify container on GitHub Packages
- [ ] update the GitHub release
- [ ] push as stable `git push opensource main:main-stable`
- Announce:
- [ ] Discord announcement
- [ ] Twitter announcement
@@ -51,7 +50,7 @@ To familiarize yourself with the application, the following are the Website and
```
- paste the URL: https://big-agi.com
- drag & drop: [README.md](https://raw.githubusercontent.com/enricoros/big-AGI/v2-dev/README.md)
- drag & drop: [README.md](https://raw.githubusercontent.com/enricoros/big-AGI/main/README.md)
```markdown
I am announcing a new version, 1.2.3.
+69
View File
@@ -0,0 +1,69 @@
version: 2
updates:
- package-ecosystem: docker
directory: /
schedule:
interval: weekly
commit-message:
prefix: "chore(deps)"
ignore:
- dependency-name: "node"
versions: [">=25", "<26"] # Node 25 breaks the build because of a dummy localStorage object
- package-ecosystem: github-actions
directory: /
schedule:
interval: weekly
commit-message:
prefix: "chore(deps)"
# Disabled npm updates for now - will need precise package pinning, as some packages changed behavior upstream
# - package-ecosystem: npm
# directory: /
# schedule:
# interval: weekly
# commit-message:
# prefix: "chore(deps)"
# cooldown:
# semver-patch: 3
# semver-minor: 7
# semver-major: 14
# # Ignore packages intentionally pinned due to upstream issues
# ignore:
# # Issue #857: v11.6+ breaks streaming; tried 11.4.4/11.6/11.7, only 11.5.1 works
# - dependency-name: "@trpc/*"
# versions: [">=11.5.1", "<12"]
# # Pinned during tRPC #857 debugging - may be safe to unpin, test first
# - dependency-name: "@tanstack/react-query"
# versions: [">=5.90.10", "<6"]
# # Pinned because 5.0.8 changes signatures so return set({ .. }) != void;
# - dependency-name: "zustand"
# versions: [">=5.0.7", "<6"]
# groups:
# next:
# patterns:
# - "@next/*"
# - "eslint-config-next"
# - "next"
# react:
# patterns:
# - "react"
# - "react-dom"
# - "@types/react"
# - "@types/react-dom"
# emotion:
# patterns:
# - "@emotion/*"
# mui:
# patterns:
# - "@mui/*"
# dnd-kit:
# patterns:
# - "@dnd-kit/*"
# prisma:
# patterns:
# - "@prisma/*"
# - "prisma"
# vercel:
# patterns:
# - "@vercel/*"
+59
View File
@@ -0,0 +1,59 @@
name: Claude Code DM
on:
issues:
types: [opened, assigned]
issue_comment:
types: [created]
pull_request_review:
types: [submitted]
pull_request_review_comment:
types: [created]
jobs:
claude-dm:
# Only allow repository owner to trigger DMs with @claude (blocks other users and bots)
if: |
github.actor == 'enricoros' &&
github.triggering_actor == 'enricoros' &&
((github.event_name == 'issues' && (contains(github.event.issue.body, '@claude') || contains(github.event.issue.title, '@claude'))) ||
(github.event_name == 'issue_comment' && contains(github.event.comment.body, '@claude')) ||
(github.event_name == 'pull_request_review' && contains(github.event.review.body, '@claude')) ||
(github.event_name == 'pull_request_review_comment' && contains(github.event.comment.body, '@claude')))
runs-on: ubuntu-latest
timeout-minutes: 30
permissions:
contents: write # Required for code creation and commits
issues: write
pull-requests: write
actions: read # Required for Claude to read CI results on PRs
id-token: write # required to use OIDC to authenticate to Claude Code API
steps:
- name: Checkout repository
uses: actions/checkout@v6
with:
fetch-depth: 0 # 1 -> 0: full history helps with git blame, etc.
- name: Run Claude Code DM Response
id: claude
uses: anthropics/claude-code-action@v1
with:
claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
# Security: Only users with write access can trigger (DMs allow code execution)
# Note: contents:write permission enables code creation and commits
# This is an optional setting that allows Claude to read CI results on PRs
additional_permissions: |
actions: read
# Optional: Add claude_args to customize behavior and configuration
# See https://github.com/anthropics/claude-code-action/blob/main/docs/usage.md
# or https://docs.claude.com/en/docs/claude-code/cli-reference for available options
claude_args: |
--model claude-opus-4-6
--max-turns 100
--allowedTools "Edit,Read,Write,WebFetch,WebSearch,Bash(cat:*),Bash(cp:*),Bash(find:*),Bash(git branch:*),Bash(grep:*),Bash(ls:*),Bash(mkdir:*),Bash(npm run:*),Bash(gh issue:*),Bash(gh search:*),Bash(gh label:*),Bash(gh pr:*),SlashCommand"
+83
View File
@@ -0,0 +1,83 @@
name: Claude Code Auto-Triage Issues
on:
issues:
types: [ opened ]
jobs:
claude-issue-triage:
# Optional: Skip for bot users and direct mentions in the body (handled by claude-dm.yml)
if: |
github.event.issue.user.type != 'Bot' &&
!contains(github.event.issue.body, '@claude')
runs-on: ubuntu-latest
timeout-minutes: 30
permissions:
contents: read
issues: write
pull-requests: read # was write, but we're not altering PRs here
actions: read
id-token: write # required to use OIDC to authenticate to Claude Code API
steps:
- name: Checkout repository
uses: actions/checkout@v6
with:
fetch-depth: 0 # 1 -> 0: full history helps with git blame, etc.
- name: Analyze issue and provide help
uses: anthropics/claude-code-action@v1
with:
claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
# Security: Allow any user to trigger triage (automated issue help is safe)
github_token: ${{ secrets.GITHUB_TOKEN }}
allowed_non_write_users: '*'
# track_progress: true # Enables tracking comments
show_full_output: ${{ github.event.repository.private }} # security: do not log verbosely in private repo
# This is an optional setting that allows Claude to read CI results on PRs
additional_permissions: |
actions: read
prompt: |
REPO: ${{ github.repository }}
ISSUE NUMBER: #${{ github.event.issue.number }}
A user has reported an issue. Please help them by:
1. Deep think about the issue:
**Understand the problem**: Analyze the issue description and any error messages
**Search for context**:
- Use the repository's CLAUDE.md for high level guidance and especially kb/ documentation
- Look in relevant code files, including kb/ documentation
**Use web search**: When potentially outside Big-AGI (e.g. user configuration), search the web for similar errors or related issues
**Provide a solution**:
- Provide multiple solutions if uncertain, and say so
- Analyze the code and suggest specific fixes with code examples
- If possible also suggest fixes or workarounds for immediate relief
- Reference specific files and line numbers
- Suggest workarounds for immediate relief if applicable
- Use web search to find similar issues and solutions
- Test selectively and even npm install and run build if needed to verify the solution
2. Always add the 'claude-triage' issue label to indicate this issue was triaged by Claude
3. Comment with:
- Very brief thank you note, if applicable
- Initial assessment
- Next steps or clarification needed
- Link duplicates if found
Remember: design values for this codebase: orthogonal features, features that generalize well, modularized and reusable code,
type-discriminated data, optimized code, zero maintenance burden. Minimize future pain, etc.
IMPORTANT: You are in READ-ONLY triage mode. Analyze and suggest solutions in your comment, but do NOT attempt to push code changes.
If you're uncertain, say so and suggest next steps.
Be welcoming, helpful, professional, solution-focused and no-BS.
# See https://github.com/anthropics/claude-code-action/blob/main/docs/usage.md
# or https://docs.claude.com/en/docs/claude-code/cli-reference for available options
claude_args: |
--model claude-opus-4-6
--max-turns 75
--allowedTools "Edit,Read,Write,WebFetch,WebSearch,Bash(cat:*),Bash(cp:*),Bash(find:*),Bash(git branch:*),Bash(grep:*),Bash(ls:*),Bash(mkdir:*),Bash(npm run:*),Bash(gh issue:*),Bash(gh search:*),Bash(gh label:*),Bash(gh pr:*),SlashCommand"
+134 -43
View File
@@ -12,38 +12,130 @@ name: Create and publish Docker images
on:
push:
branches:
- v2-dev
#- v1-dev # Disabled because this is not needed anymore
#- v1-stable # Disabled as the v* tag is used for stable releases
- main # Primary branch (Big-AGI Open)
tags:
- 'v*' # Trigger on version tags (e.g., v1.7.0)
- 'v2.*' # Stable releases (v2.0.0, v2.1.0, etc.)
env:
REGISTRY: ghcr.io
IMAGE_NAME: ${{ github.repository }}
concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true
jobs:
build-and-push-image:
runs-on: ubuntu-latest
# Build job: runs on native runners for each platform (no QEMU emulation)
build:
strategy:
fail-fast: false
matrix:
include:
- platform: linux/amd64
runner: ubuntu-latest
- platform: linux/arm64
runner: ubuntu-24.04-arm
runs-on: ${{ matrix.runner }}
name: Build ${{ matrix.platform }}
timeout-minutes: 30
permissions:
contents: read
packages: write
security-events: write
steps:
- name: Checkout repository
uses: actions/checkout@v4
with:
fetch-depth: 0
- name: Prepare
run: |
platform=${{ matrix.platform }}
echo "PLATFORM_PAIR=${platform//\//-}" >> $GITHUB_ENV
echo "IMAGE_NAME_LC=${IMAGE_NAME,,}" >> $GITHUB_ENV
- name: Set up QEMU
uses: docker/setup-qemu-action@v3
- name: Checkout repository
uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
fetch-depth: 1
- name: Set up Docker Buildx
uses: docker/setup-buildx-action@v3
uses: docker/setup-buildx-action@4d04d5d9486b7bd6fa91e7baf45bbb4f8b9deedd # v4.0.0
- name: Log in to the Container registry
uses: docker/login-action@v3
uses: docker/login-action@4907a6ddec9925e35a0a9e82d7399ccc52663121 # v4.1.0
with:
registry: ${{ env.REGISTRY }}
username: ${{ github.actor }}
password: ${{ secrets.GITHUB_TOKEN }}
- name: Extract metadata for Docker
id: meta
uses: docker/metadata-action@030e881283bb7a6894de51c315a6bfe6a94e05cf # v6.0.0
with:
images: ${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}
labels: |
org.opencontainers.image.title=Big-AGI Open
org.opencontainers.image.description=Big-AGI Open - Multi-model AI workspace for experts who need to think broader, decide smarter, and build with confidence.
org.opencontainers.image.source=${{ github.server_url }}/${{ github.repository }}
org.opencontainers.image.documentation=https://big-agi.com
- name: Build and push by digest
id: build
uses: docker/build-push-action@bcafcacb16a39f128d818304e6c9c0c18556b85f # v7.1.0
with:
context: .
file: Dockerfile
platforms: ${{ matrix.platform }}
labels: ${{ steps.meta.outputs.labels }}
tags: ${{ env.REGISTRY }}/${{ env.IMAGE_NAME_LC }}
build-args: |
NEXT_PUBLIC_GA4_MEASUREMENT_ID=${{ secrets.GA4_MEASUREMENT_ID }}
NEXT_PUBLIC_BUILD_HASH=${{ github.sha }}
NEXT_PUBLIC_BUILD_REF_NAME=${{ github.ref_name }}
outputs: type=image,push-by-digest=true,name-canonical=true,push=true,oci-mediatypes=true
provenance: false
cache-from: type=gha,scope=${{ github.repository }}-${{ matrix.platform }}
cache-to: type=gha,scope=${{ github.repository }}-${{ matrix.platform }},mode=max
- name: Export digest
run: |
mkdir -p ${{ runner.temp }}/digests
digest="${{ steps.build.outputs.digest }}"
touch "${{ runner.temp }}/digests/${digest#sha256:}"
- name: Upload digest
uses: actions/upload-artifact@043fb46d1a93c77aae656e7c1c64a875d1fc6a0a # v7.0.1
with:
name: digests-${{ env.PLATFORM_PAIR }}
path: ${{ runner.temp }}/digests/*
if-no-files-found: error
retention-days: 1
# Merge job: combines platform-specific images into a unified multi-arch manifest
merge:
name: Merge manifests
runs-on: ubuntu-latest
timeout-minutes: 10
needs: build
permissions:
contents: read
packages: write
steps:
- name: Prepare
run: echo "IMAGE_NAME_LC=${IMAGE_NAME,,}" >> $GITHUB_ENV
- name: Download digests
uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8.0.1
with:
path: ${{ runner.temp }}/digests
pattern: digests-*
merge-multiple: true
- name: Set up Docker Buildx
uses: docker/setup-buildx-action@4d04d5d9486b7bd6fa91e7baf45bbb4f8b9deedd # v4.0.0
- name: Log in to the Container registry
uses: docker/login-action@4907a6ddec9925e35a0a9e82d7399ccc52663121 # v4.1.0
with:
registry: ${{ env.REGISTRY }}
username: ${{ github.actor }}
@@ -51,35 +143,34 @@ jobs:
- name: Extract metadata (tags, labels) for Docker
id: meta
uses: docker/metadata-action@v5
uses: docker/metadata-action@030e881283bb7a6894de51c315a6bfe6a94e05cf # v6.0.0
with:
images: ${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}
tags: |
type=raw,value=development,enable=${{ github.ref == 'refs/heads/v2-dev' }} # For v2-dev branch
type=raw,value=stable,enable=${{ github.ref == 'refs/heads/v1-stable' }}
type=ref,event=tag # Use the tag name as a tag for tag builds
type=semver,pattern={{version}} # Generate semantic versioning tags for tag builds
type=sha,format=short,prefix=sha- # Just in case none of the above applies
labels: |
org.opencontainers.image.title=Big-AGI
org.opencontainers.image.description=Generative AI suite powered by state-of-the-art models
org.opencontainers.image.source=${{ github.server_url }}/${{ github.repository }}
org.opencontainers.image.documentation=https://big-agi.com
# Development: main branch
type=raw,value=development,enable=${{ github.ref == 'refs/heads/main' }}
- name: Build and push Docker image
uses: docker/build-push-action@v6
with:
context: .
file: Dockerfile
platforms: linux/amd64,linux/arm64
push: true
tags: ${{ steps.meta.outputs.tags }}
labels: ${{ steps.meta.outputs.labels }}
build-args: |
NEXT_PUBLIC_GA4_MEASUREMENT_ID=${{ secrets.GA4_MEASUREMENT_ID }}
# Enable build cache (future)
#cache-from: type=gha
#cache-to: type=gha,mode=max
# Enable provenance and SBOM (future)
#provenance: true
#sbom: true
# Latest: v2.x releases (safe default)
type=raw,value=latest,enable=${{ startsWith(github.ref, 'refs/tags/v2.') }}
# Stable: v2.x releases (alias)
type=raw,value=stable,enable=${{ startsWith(github.ref, 'refs/tags/v2.') }}
# Version tags (v2.0.0, 2.0.0)
type=ref,event=tag
type=semver,pattern={{version}}
- name: Create manifest list and push
working-directory: ${{ runner.temp }}/digests
run: |
docker buildx imagetools create \
$(jq -cr '.tags | map("-t " + .) | join(" ")' <<< "$DOCKER_METADATA_OUTPUT_JSON") \
--annotation='index:org.opencontainers.image.title=Big-AGI Open' \
--annotation='index:org.opencontainers.image.description=Big-AGI Open - Multi-model AI workspace for experts who need to think broader, decide smarter, and build with confidence.' \
--annotation='index:org.opencontainers.image.source=${{ github.server_url }}/${{ github.repository }}' \
--annotation='index:org.opencontainers.image.documentation=https://big-agi.com' \
$(printf '${{ env.REGISTRY }}/${{ env.IMAGE_NAME_LC }}@sha256:%s ' *)
- name: Inspect image
run: |
docker buildx imagetools inspect ${{ env.REGISTRY }}/${{ env.IMAGE_NAME_LC }}:${{ steps.meta.outputs.version }}
+3
View File
@@ -53,3 +53,6 @@ next-env.d.ts
.env*.local
/.run/dev (ENV).run.xml
/src/modules/3rdparty/aider/scratch*
# Ignore temporary CC files
/tmpclaude*
-3
View File
@@ -1,3 +0,0 @@
overrides=@mui/material@^5.0.0:
dependencies:
@mui/material: replaced-by=@mui/joy
+1
View File
@@ -0,0 +1 @@
24
Symlink
+1
View File
@@ -0,0 +1 @@
CLAUDE.md
+239
View File
@@ -0,0 +1,239 @@
# CLAUDE.md
Guidance to Claude Code when working with code in this repository.
## Architecture Overview
Big-AGI is a Next.js 15 application with a sophisticated modular architecture built for professional AI interactions.
### Development Commands
Dev servers may be already running on ports 3000, 3001, 3002, or 3003 (not always this app - other projects may occupy these ports). Never start or stop dev servers, let the user do it.
```bash
# Validate (~5s, safe while dev server runs, do NOT use `next build` ~45s for same checks)
tsc --noEmit --pretty && npm run lint # Type check (~3.5s) + ESLint (~2s)
eslint src/path/to/file.ts # Lint specific file
# Full build (~60s+, only when suspecting runtime/bundle issues)
npm run build # next build runs compile+lint+types but stops at first type-error file; tsc shows all at once
# Database & External Services
# npm run supabase:local-update-types # Generate TypeScript types
# npm run stripe:listen # Listen for Stripe webhooks
```
### Git/GitHub remotes
The `gh` command is available to interact with GitHub from the terminal, but **NEVER PUSH TO ANY BRANCH**. The user manages all 'write' git operations.
- `opensource` -> `enricoros/big-AGI` (public, default branch: `main`, MIT) - community issues/PRs/releases
- `private` -> `big-agi/big-agi-private` (private, default branch: `dev`) - main dev repo with `dev`->`staging`->`prod` pipeline
- **Always use `git mv` instead of `mv`** when renaming or moving files - preserves git history tracking
- **NEVER run `git stash`** - it causes work loss
### Core Directory Structure
You are started from the root of the repository (i.e. where the git folder is or scripts should be run from).
**ISSUE ALL COMMANDS FROM THE ROOT, OMITTING 'cd' COMMANDS. DO NOT CHAIN CD AND OTHER COMMANDS**
**NEVER RUN COMPOUND `cd` COMMANDS LIKE `cd some-folder && command` - ONLY RUN `command` FROM THE ROOT, ALWAYS.**
The directory structure is as follows:
```
/app/api/ # Next.js App Router (API routes only, mostly -> /src/server/)
/pages/ # Next.js Pages Router (file-based, mostly -> /src/apps/)
/src/
├── apps/ # Feature applications (self-contained modules)
├── modules/ # Reusable business logic and integrations
├── common/ # Shared infrastructure and utilities
└── server/ # Backend API layer with tRPC
/kb/ # Knowledge base for modules, architectures
```
### Key Technologies
- **Frontend**: Next.js 15, React 18, Material-UI Joy, Emotion (CSS-in-JS)
- **State Management**: Zustand with localStorage/IndexedDB (single cell) persistence
- **API Layer**: tRPC with TanStack React Query for type-safe communication
- **Runtime**: Edge Runtime for AI operations, Node.js for data processing
### "Apps" Architecture Pattern
Each app in `/src/apps/` is a self-contained feature module:
- Main component (`App*.tsx`)
- Local state store (`store-app-*.ts`)
- Feature-specific components and layouts
- Runtime configurations
Example apps: `chat/`, `call/`, `beam/`, `draw/`, `personas/`, `settings-modal/`
### Modules Architecture Pattern
Modules in `/src/modules/` provide reusable business logic:
- **`aix/`** - AI communication framework for real-time streaming
- **`beam/`** - Multi-model AI reasoning system (scatter/gather pattern)
- **`blocks/`** - Content rendering (markdown, code, images, etc.)
- **`llms/`** - Language model abstraction supporting 20+ vendors
### Key Subsystems & Their Patterns
#### AIX - Real-time AI Communication
**Location**: `/src/modules/aix/`
**Pattern**: Client-server streaming architecture with provider abstraction
- **Client** -> tRPC -> **Server** -> **AI Providers**
- Handles streaming/non-streaming responses with batching and error recovery
- Particle-based streaming: `AixWire_Particles` -> `ContentReassembler` -> `DMessage`
- Provider-agnostic through adapter pattern (OpenAI, Anthropic, Gemini protocols)
#### Beam - Multi-Model Reasoning
**Location**: `/src/modules/beam/`
**Pattern**: Scatter/Gather for parallel AI processing
- **Scatter**: Multiple models (rays) process input in parallel
- **Gather**: Fusion algorithms combine outputs
- Real-time UI updates via vanilla Zustand stores
- BeamStore per conversation via ConversationHandler
#### Conversation Management
**Location**: `/src/common/stores/chat/` and `/src/common/chat-overlay/`
**Pattern**: Overlay architecture with handler per conversation
- `ConversationHandler` orchestrates chat, beam, ephemerals
- Per-chat stores: `PerChatOverlayStore` + `BeamStore`
- Message structure: `DMessage` -> `DMessageFragment[]`
- Supports multi-pane with independent conversation states
#### Layout System ("Optima")
The Optima layout system provides:
- **Responsive design** adapting desktop/mobile
- **Drawer(left)/Toolbar/Panel(right)** composition
- **Portal-based rendering** for flexible component placement
Located in `/src/common/layout/optima/`
### Storage System
Big-AGI uses a local-first architecture with Zustand + IndexedDB:
- **Zustand** stores for in-memory state management
- **localStorage** for persistent settings/all storage (via Zustand persist middleware)
- **IndexedDB** for persistent chat-only storage (via Zustand persist middleware) on a single key-val cell
- **Local-first** architecture with offline capability
Key storage patterns:
- Stores use `createIDBPersistStorage()` for IndexedDB persistence
- Version-based migrations handle data structure changes
- Partialize/merge functions control what gets persisted
- Rehydration logic repairs and upgrades data on load
Located in `/src/common/stores/` with stores like:
- `chat/store-chats.ts`: Conversations and messages
- `llms/store-llms.ts`: Model configurations
### State Management Patterns
1. **Global Stores** (Zustand with IndexedDB persistence)
- `store-chats`: Conversations and messages
- `store-llms`: Model configurations
- `store-ux-labs`: UI preferences and labs features
- **Zustand pattern**: Always wrap multi-property selectors with `useShallow` from `zustand/react/shallow` to prevent re-renders on reference changes
2. **Per-Instance Stores** (Vanilla Zustand)
- `store-beam_vanilla`: Beam scatter/gather state
- `store-perchat_vanilla`: Chat overlay state
- `store-attachment-drafts_vanilla`: Attachment drafts
- High-performance, no React integration
3. **Module Stores**
- Feature-specific configuration and state
- Example: `store-module-beam`, `store-module-t2i`
### User Flows & Interdependencies
#### Chat Message Flow
1. User input -> `Composer` -> `DMessage` creation
2. `ConversationHandler.messageAppend()` -> Store update
3. `_handleExecute()` / `ConversationHandler.executeChatMessages()` -> AIX client request
4. AIX streaming -> `ContentReassembler` -> UI updates
5. Zustand auto-persistence -> IndexedDB
#### Beam Multi-Model Flow
1. User triggers Beam -> `BeamStore.open()` state update
2. Scatter: Parallel `aixChatGenerateContent()` to N models
3. Real-time ray updates -> UI progress
4. Gather: User selects fusion -> Combined output
5. Result -> New message in conversation
### Development Patterns
#### TypeScript & Code Quality
- Type-safe through strict TypeScript interfaces
- Clear interface-first approach for modules and components
- Use latest TypeScript 5.9+ features
- Use forward-looking patterns to minimize future refactors (e.g., discriminated unions, `satisfies` operator, as const assertions)
- Type guards and exhaustiveChecks for robustness
- Type inference where possible
- Runtime validation with Zod schemas for API inputs/outputs (usually server-side, with the client importing as types the inferred types)
#### Module Integration
- Modules register with central registries (e.g., `vendors.registry.ts`)
- Configuration objects define module behavior
#### API Patterns
- **tRPC routers** for type-safe API endpoints
- **Zod schemas** for runtime validation
- **tRPC procedures middleware** for authorization and logging (authorization is on a httpOnly cookie)
- **Edge functions** for performance-critical operations
#### Security Considerations
- API keys in environment variables only (server-side); on the client they're in localStorage for now, but we want to move away from this
- XSS protection through proper content escaping
#### Writing Style
- **Never use emdashes (—).** Use normal dashes (-) instead, in all generated text, code comments, and documentation.
## Common Development Tasks
### Testing & Quality
- Run `npm run lint` before committing
- Type-check with `tsc --noEmit`
- Test critical user flows manually
### Debugging Storage Issues
- Check IndexedDB: DevTools -> Application -> IndexedDB -> `app-chats`
- Monitor Zustand state: Use Zustand DevTools
- Check migration logs in console during rehydration
## Server Architecture
The server uses a split architecture with two tRPC routers:
### Edge Network (`trpc.router-edge`)
Distributed edge runtime for low-latency AI operations:
- **AIX** [1] - AI streaming and communication
- **LLM Routers** [1] - Vendor-specific operations such as list models (OpenAI, Anthropic, Gemini, Ollama)
- **Speex** [1] - Unified TTS router (ElevenLabs, Inworld, and other TTS vendors)
- **External Services** - Google Search, YouTube transcripts
[1]: also supports client-side fetch (CSF) via client-side inclusion (rebundling with stubs),
for direct browser-to-API communication when possible (CORS), to reduce latency and network barriers
Located at `/src/server/trpc/trpc.router-edge.ts`
### Cloud Network (`trpc.router-cloud`)
Centralized server for data processing operations:
- **Browse** - Web scraping and content extraction
- **Trade** - Import/export functionality (ChatGPT, markdown, JSON)
Located at `/src/server/trpc/trpc.router-cloud.ts`
**Key Pattern**: Edge runtime for AI (fast, distributed), Cloud runtime for data ops (centralized, Node.js)
@kb/KB.md
@kb/vision-inlined.md
As a side note, the product tiers (independent, non-VC-funded) are: **Open** (self-host, MIT) · **Free** (big-agi.com) · **Pro** (paid, includes Sync + backup). All tiers use the user's own API keys.
+36 -11
View File
@@ -1,6 +1,9 @@
# syntax=docker/dockerfile:1
# check=skip=CopyIgnoredFile
# Base
FROM node:22-alpine AS base
ENV NEXT_TELEMETRY_DISABLED 1
FROM node:24-alpine AS base
ENV NEXT_TELEMETRY_DISABLED=1
# Dependencies
FROM base AS deps
@@ -14,7 +17,7 @@ COPY src/server/prisma ./src/server/prisma
RUN sh -c '[ ! -e /lib/libssl.so.3 ] && ln -s /usr/lib/libssl.so.3 /lib/libssl.so.3 || echo "Link already exists"'
# Install dependencies, including dev (release builds should use npm ci)
ENV NODE_ENV development
ENV NODE_ENV=development
RUN npm ci
@@ -22,20 +25,37 @@ RUN npm ci
FROM base AS builder
WORKDIR /app
# Deployment type marker
ENV NEXT_PUBLIC_DEPLOYMENT_TYPE=docker
# Optional build version arguments at build time
ARG NEXT_PUBLIC_BUILD_HASH
ENV NEXT_PUBLIC_BUILD_HASH=${NEXT_PUBLIC_BUILD_HASH}
ARG NEXT_PUBLIC_BUILD_REF_NAME
ENV NEXT_PUBLIC_BUILD_REF_NAME=${NEXT_PUBLIC_BUILD_REF_NAME}
# Optional argument to configure GA4 at build time (see: docs/deploy-analytics.md)
ARG NEXT_PUBLIC_GA4_MEASUREMENT_ID
ENV NEXT_PUBLIC_GA4_MEASUREMENT_ID=${NEXT_PUBLIC_GA4_MEASUREMENT_ID}
# Optional argument to configure PostHog at build time (see: docs/deploy-analytics.md)
ARG NEXT_PUBLIC_POSTHOG_KEY
ENV NEXT_PUBLIC_POSTHOG_KEY=${NEXT_PUBLIC_POSTHOG_KEY}
# Optional argument to configure Google Drive Picker at build time (can reuse AUTH_GOOGLE_ID value)
ARG NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID
ENV NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID=${NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID}
# Copy development deps and source
COPY --from=deps /app/node_modules ./node_modules
COPY . .
# Build the application
ENV NODE_ENV production
ENV NODE_ENV=production
RUN npm run build
# Reduce installed packages to production-only
RUN npm prune --production
RUN npm prune --omit=dev
# Runner
@@ -43,18 +63,23 @@ FROM base AS runner
WORKDIR /app
# As user
RUN addgroup --system --gid 1001 nodejs
RUN adduser --system --uid 1001 nextjs
RUN addgroup --system --gid 1001 nodejs \
&& adduser --system --uid 1001 nextjs \
&& apk add --no-cache openssl
# Copy Built app
COPY --from=builder --chown=nextjs:nodejs /app/public ./public
COPY --from=builder --chown=nextjs:nodejs /app/.next ./.next
COPY --from=builder --chown=nextjs:nodejs /app/node_modules ./node_modules
COPY --from=builder --chown=nextjs:nodejs /app/src/server/prisma ./src/server/prisma
# Instead of `COPY --from=builder --chown=nextjs:nodejs /app/.next ./.next`, we only extract some parts, excluding .next/cache which is build time only:
COPY --from=builder --chown=nextjs:nodejs /app/.next/BUILD_ID ./.next/
COPY --from=builder --chown=nextjs:nodejs /app/.next/static ./.next/static
COPY --from=builder --chown=nextjs:nodejs /app/.next/server ./.next/server
COPY --from=builder --chown=nextjs:nodejs /app/.next/types ./.next/types
COPY --from=builder --chown=nextjs:nodejs /app/.next/*.json ./.next/
# Minimal ENV for production
ENV NODE_ENV production
ENV PATH $PATH:/app/node_modules/.bin
ENV NODE_ENV=production
# Run as non-root user
USER nextjs
@@ -63,4 +88,4 @@ USER nextjs
EXPOSE 3000
# Start the application
CMD ["next", "start"]
CMD ["/app/node_modules/.bin/next", "start"]
+1 -1
View File
@@ -1,6 +1,6 @@
MIT License
Copyright (c) 2023-2024 Enrico Ros
Copyright (c) 2023-2026 Enrico Ros
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
+263 -111
View File
@@ -1,41 +1,196 @@
# BIG-AGI 🧠✨
<div align="center">
Welcome to big-AGI, the AI suite for professionals that need function, form,
simplicity, and speed. Powered by the latest models from 12 vendors and
open-source servers, `big-AGI` offers best-in-class Chats,
[Beams](https://github.com/enricoros/big-AGI/issues/470),
and [Calls](https://github.com/enricoros/big-AGI/issues/354) with AI personas,
visualizations, coding, drawing, side-by-side chatting, and more -- all wrapped in a polished UX.
<img width="256" height="256" alt="Big-AGI Logo" src="https://big-agi.com/assets/logo-bright-github.svg" />
Stay ahead of the curve with big-AGI. 🚀 Pros & Devs love big-AGI. 🤖
<h1><a href="https://big-agi.com">Big-AGI</a></h1>
[![Official Website](https://img.shields.io/badge/BIG--AGI.com-%23096bde?style=for-the-badge&logo=vercel&label=launch)](https://big-agi.com)
[![Use Free ⋅ Go Pro](https://img.shields.io/badge/Use_Free-Get_Pro-d5ec31?style=for-the-badge&logo=rocket&logoColor=white&labelColor=000)](https://big-agi.com)
[![Deploy on Docker](https://img.shields.io/badge/Self--Host-Docker-blue?style=for-the-badge&logo=docker&logoColor=white&labelColor=000)](https://github.com/enricoros/big-AGI/pkgs/container/big-agi)
[![Deploy on Vercel](https://img.shields.io/badge/Vercel-Deploy-blue?style=for-the-badge&logo=vercel&logoColor=white&labelColor=000)](https://vercel.com/new/clone?repository-url=https://github.com/enricoros/big-agi)
[![Discord](https://img.shields.io/discord/1098796266906980422?style=for-the-badge&label=Discord&logo=discord&logoColor=white&labelColor=000000&color=purple)](https://discord.gg/MkH4qj2Jp9)
<br/>
[![GitHub Monthly Commits](https://img.shields.io/github/commit-activity/m/enricoros/big-agi?style=for-the-badge&x=3&logo=github&logoColor=white&label=commits&labelColor=000&color=green)](https://github.com/enricoros/big-agi/commits)
[![GHCR Pulls](https://img.shields.io/badge/ghcr.io-800k_dl-12b76a?style=for-the-badge&logo=Xdocker&logoColor=white&labelColor=000&color=A8E6CF)](https://github.com/enricoros/big-AGI/pkgs/container/big-agi)
[![Contributors](https://img.shields.io/github/contributors/enricoros/big-agi?style=for-the-badge&x=2&logo=Xgithub&logoColor=white&label=cooks&labelColor=000&color=A8E6CF)](https://github.com/enricoros/big-AGI/graphs/contributors)
[![License: MIT](https://img.shields.io/badge/License-MIT-A8E6CF?style=for-the-badge&labelColor=000)](https://opensource.org/licenses/MIT)
<br/>
> 🚀 Big-AGI 2 is launching Q4 2024. Be the first to experience it before the public release.
>
> 👉 [Apply for Early Access](https://y2rjg0zillz.typeform.com/to/ZSADpr5u?utm_source=gh-2&utm_medium=readme&utm_campaign=ea2)
[![Open an Issue](https://img.shields.io/badge/Open_Issue-AI_Will_Help-ff8c00?style=for-the-badge&logo=fireship&logoColor=fff&labelColor=8b0000)](https://github.com/enricoros/big-agi/issues/new?template=ai-triage.yml)
Or fork & run on Vercel
[//]: # ([![Uptime Robot ratio &#40;30 days&#41;]&#40;https://img.shields.io/uptimerobot/ratio/m801796948-868b22ed7ceaa0acac4dc765?style=for-the-badge&labelColor=000&color=green&#41;]&#40;https://stats.uptimerobot.com/59MXcnmjrM&#41;)
[//]: # ([![Open Version]&#40;https://img.shields.io/github/v/release/enricoros/big-AGI?label=Open+Release&style=flat-square&logo=github&logoColor=white&labelColor=000&#41;]&#40;https://github.com/enricoros/big-AGI/releases/latest&#41;)
[//]: # (![GitHub Stars]&#40;https://img.shields.io/github/stars/enricoros/big-agi?style=flat-square&logo=github&logoColor=white&labelColor=000&color=yellow&#41;)
[//]: # ([![GitHub Forks]&#40;https://img.shields.io/github/forks/enricoros/big-agi?style=flat-square&logo=github&logoColor=white&labelColor=000&#41;]&#40;#&#41;)
[//]: # ([![Follow on X]&#40;https://img.shields.io/twitter/follow/enricoros?style=flat-square&logo=X&logoColor=white&labelColor=000&color=000&#41;]&#40;https://x.com/enricoros&#41;)
[![Deploy with Vercel](https://vercel.com/button)](https://vercel.com/new/clone?repository-url=https%3A%2F%2Fgithub.com%2Fenricoros%2Fbig-AGI&env=OPENAI_API_KEY&envDescription=Backend%20API%20keys%2C%20optional%20and%20may%20be%20overridden%20by%20the%20UI.&envLink=https%3A%2F%2Fgithub.com%2Fenricoros%2Fbig-AGI%2Fblob%2Fmain%2Fdocs%2Fenvironment-variables.md&project-name=big-AGI)
</div>
### New Version
<br/>
This repository contains two main versions:
# Big-AGI Open 🧠
- Big-AGI 2: next-generation, bringing the most advanced AI experience
- `v2-dev`: V2 development branch, the exciting one, future default
- Big-AGI Stable: as deployed on big-agi.com
- `v1-dev`: V1 development branch (this branch)
- `v1-stable`: Current stable version
This is the open-source foundation of **Big-AGI**, ___the multi-model AI workspace for experts___.
Note: After the V2 release in Q4, `v2-dev` will become the default branch and `v1-dev` will reach EOL.
Big-AGI is the multi-model AI workspace for experts: Engineers architecting systems. Founders making decisions. Researchers validating hypotheses.
You need to think broader, decide faster, and build with confidence, then you need Big-AGI.
### Quick links: 👉 [roadmap](https://github.com/users/enricoros/projects/4/views/2) 👉 [installation](docs/installation.md) 👉 [documentation](docs/README.md)
It comes packed with **world-class features** like Beam, and is praised for its **best-in-class AI chat UX**.
**As an independent, non-VC-funded project, Pro subscriptions at $10.99/mo fund development for everyone, including the free and open-source tiers.**
### What's New in 1.16.1...1.16.8 · Sep 13, 2024 (patch releases)
![LLM Vendors](https://img.shields.io/badge/20+_LLM_Services-500+_Models-black?style=for-the-badge&logo=anthropic&logoColor=white&labelColor=purple)&nbsp;
[![Feature Beam](https://img.shields.io/badge/AI--Validation-BEAM-000?style=for-the-badge&labelColor=purple)](https://big-agi.com/beam)&nbsp;
[![Feature Inspector](https://img.shields.io/badge/Expert_Mode-AI_Inspector-000?style=for-the-badge&labelColor=purple)](https://big-agi.com/inspector)
- 1.16.8: OpenAI ChatGPT-4o Latest (o1-preview and o1-mini are supported in Big-AGI 2)
### What makes Big-AGI different:
**Intelligence**: with [Beam & Merge](https://big-agi.com/beam) for multi-model de-hallucination, native search, and bleeding-edge AI models like Opus 4.7, Nano Banana Pro, Kimi K2.6 or GPT 5.4 -
**Control**: with personas, data ownership, requests inspection, unlimited usage with API keys, and *no vendor lock-in* -
and **Speed**: with a local-first, over-powered, zero-latency, madly optimized web app.
<table>
<tr>
<td align="center" width="25%">
<b>🧠 Intelligence</b><br/>
<img src="https://img.shields.io/badge/Multi--Model-Trust-4285F4?style=for-the-badge" alt="Multi-Model"/>
</td>
<td align="center" width="25%">
<b>✨ Experience</b><br/>
<img src="https://img.shields.io/badge/Clean-UX-34A853?style=for-the-badge" alt="Clean UX"/>
</td>
<td align="center" width="25%">
<b>⚡ Performance</b><br/>
<img src="https://img.shields.io/badge/Zero-Latency-EA4335?style=for-the-badge" alt="Zero Latency"/>
</td>
<td align="center" width="25%">
<b>🔒 Control</b><br/>
<img src="https://img.shields.io/badge/No-Lock--in-FBBC04?style=for-the-badge" alt="No Lock-in"/>
</td>
</tr>
<tr>
<td align="center" valign="top">
Beam & Merge<br/>
No context junk<br/>
Purest AI outputs
</td>
<td align="center" valign="top">
Flow-state interface<br/>
Highly customizable<br/>
Best-in-class UX
</td>
<td align="center" valign="top">
Local-first<br/>
Highly parallel<br/>
Madly optimized
</td>
<td align="center" valign="top">
No vendor lock-in<br/>
Your API keys<br/>
AI Inspector
</td>
</tr>
</table>
### Who uses Big-AGI:
Loved by engineers, founders, researchers, self-hosters, and IT departments for its power, reliability, and transparency.
<img width="830" height="370" alt="image" src="https://github.com/user-attachments/assets/513c4f77-0970-4a56-b23b-1416c8246174" />
Choose Big-AGI because you don't need another clone or slop - you need an AI tool that scales with you.
### Show me a screenshot:
Sure - here is real-world screeengrab as I'm writing this, while running a Beam to extract SVG from an image with Sonnet 4.5, Opus 4.1, GPT 5.1, Gemini 2.5 Pro, Nano Banana, etc.
<img alt="Real-world screen capture as of Nov 15 2025, 2am" src="https://github.com/user-attachments/assets/853f4160-27cb-4ac9-826b-402f1e63d4af" />
## Get Started
| Tier | Best For | What You Get | Setup |
|------------------------------------------------------|-------------------|---------------------------------------------------------------|-------------|
| Big-AGI Open (self-host) | **IT** | First to get new models support. Maximum control and privacy. | 5-30 min |
| [big-agi.com](https://big-agi.com) Free | **Everyone** | Full core experience, improved Beam, new Personas, best UX. | **2 min**\* |
| **[big-agi.com](https://big-agi.com) Pro** $10.99/mo | **Professionals** | Everything + **Sync** across unlimited devices + 1GB storage | **2 min**\* |
\*: **Configuration requires your API keys**. *Big-AGI does not charge for model usage or limit your access*.
**Why Pro?** As an independent project, Pro subscriptions fund all development. Early subscribers shape the roadmap directly.
[![Use Free ⋅ Go Pro](https://img.shields.io/badge/Use_Free-Get_Pro-d5ec31?style=for-the-badge&logo=rocket&logoColor=white&labelColor=000)](https://big-agi.com)
**Self-host and developers** (full control)
- Develop locally or self-host with Docker on your own infrastructure [guide](docs/installation.md)
- Or fork & run on Vercel:
[![Deploy on Vercel](https://img.shields.io/badge/Deploy-black?style=for-the-badge&logo=vercel&logoColor=white&labelColor=000)](https://vercel.com/new/clone?repository-url=https%3A%2F%2Fgithub.com%2Fenricoros%2Fbig-AGI&env=OPENAI_API_KEY&envDescription=Backend%20API%20keys%2C%20optional%20and%20may%20be%20overridden%20by%20the%20UI.&envLink=https%3A%2F%2Fgithub.com%2Fenricoros%2Fbig-AGI%2Fblob%2Fmain%2Fdocs%2Fenvironment-variables.md&project-name=big-AGI)
[//]: # (**For the latest Big-AGI:**)
[//]: # (- [**Big-AGI Open**]&#40;https://github.com/enricoros/big-AGI/tree/main&#41; - Open Source, latest models and features &#40;main branch&#41;)
[//]: # (- [**Big-AGI Pro**]&#40;https://big-agi.com&#41; - Hosted with Cloud Sync)
---
## Our Philosophy
We're an independent, non-VC-funded project with a simple belief: **AI should elevate you, not replace you**.
This is why we built Big-AGI to be **local-first**, madly optimized to 0-latency, launched multi-model first to
defeat hallucinations, designed Beam around the **humans in the loop**, re-wrote frameworks and abstractions
so you **are not vendor locked-in**, and obsessed over a powerful UI that works, just works.
NOTE: this is a powerful tool - if you need a toy UI or clone, this ain't it.
---
## Release Notes
👉 **[See the Live Release Notes](https://big-agi.com/changes)**
- Open 2.0.4: **Hyper Params** **Opus 4.6**, **GPT-5.4**, **Gemini 3.1 Pro**, AWS Bedrock, parameter accuracy, Anthropic continuation/Fast mode
- Open 2.0.3: **Red Carpet** **Kimi K2.5**, **Gemini 3 Flash**, **GPT 5.2**, Google Drive, Inworld, Novita.ai, Speech/UX improvements
- Open 2.0.2: **Speex** multi-vendor speech synthesis, **Opus 4.5**, **Gemini 3 Pro**, **Nano Banana Pro**, **Grok 4.1**, **GPT-5.1**, **Kimi K2** + 280 fixes
### What's New in 2.0 · Oct 31, 2025 · Open
- **Big-AGI Open** is ready and more productive and faster than ever, with:
- **Beam 2**: multi-modal, program-based, follow-ups, save presets
- Top-notch AI models support including **agentic models** and **reasoning models**
- **Image Generation** and editing with Nano Banana and gpt-image-1
- **Web Search** with citations for supported models
- **UI** & Mobile UI overhaul with peeking and side panels
- And all of the [Big-AGI 2 changes](https://github.com/enricoros/big-AGI/issues/567#issuecomment-2262187617) and more
- Built for the future, madly optimized
<img width="830" height="385" alt="image" src="https://github.com/user-attachments/assets/ad52761d-7e3f-44d8-b41e-947ce8b4faa1" />
#### **Open** links: 👉 [changelog](https://big-agi.com/changes) 👉 [installation](docs/installation.md) 👉 [roadmap](https://github.com/users/enricoros/projects/4/views/2) 👉 [documentation](docs/README.md)
**For teams and institutions:** Need shared prompts, SSO, or managed deployments? Reach out at enrico@big-agi.com. We're actively collecting requirements from research groups and IT departments.
<details>
<summary>5,000 Commits Milestone</summary>
Hit 5k commits last week. That's a lot of code.
Recent work has been intense:
- Chain of thought reasoning across multiple LLMs: **OpenAI o3** and o1, **DeepSeek R1**, **Gemini 2.0 Flash Thinking**, and more
- Beam is real - ~35% of our users run it daily to compare models
- New AIX framework lets us scale features we couldn't before
- UI is faster than ever. Like, terminal-fast
The new architecture is solid and the speed improvements are real.
![5000e-830px](https://github.com/user-attachments/assets/42f7420b-9331-421b-9a18-2e653aaa7d9b)
</details>
<details>
<summary>What's New in 1.16.1...1.16.13 · (patch releases)</summary>
- 1.16.13: Docker fix ([#840](https://github.com/enricoros/big-AGI/issues/840))
- 1.16.12: Dockerfile update ([#840](https://github.com/enricoros/big-AGI/issues/840))
- 1.16.11: v1 final release, documentation updates
- 1.16.10: OpenRouter models support
- 1.16.9: Docker Gemini fix, R1 models support
- 1.16.8: OpenAI ChatGPT-4o Latest, o1 models support
- 1.16.7: OpenAI support for GPT-4o 2024-08-06
- 1.16.6: Groq support for Llama 3.1 models
- 1.16.5: GPT-4o Mini support
@@ -48,7 +203,10 @@ Note: After the V2 release in Q4, `v2-dev` will become the default branch and `v
- 1.16.2: Updates to Beam
- 1.16.1: Support for the new OpenAI GPT-4o 2024-05-13 model
### What's New in 1.16.0 · May 9, 2024 · Crystal Clear
</details>
<details>
<summary>What's New in 1.16.0 · May 9, 2024 · Crystal Clear</summary>
- [Beam](https://big-agi.com/blog/beam-multi-model-ai-reasoning) core and UX improvements based on user feedback
- Chat cost estimation 💰 (enable it in Labs / hover the token counter)
@@ -59,14 +217,20 @@ Note: After the V2 release in Q4, `v2-dev` will become the default branch and `v
- Models update: **Anthropic**, **Groq**, **Ollama**, **OpenAI**, **OpenRouter**, **Perplexity**
- Code soft-wrap, chat text selection toolbar, 3x faster on Apple silicon, and more [#517](https://github.com/enricoros/big-AGI/issues/517), [507](https://github.com/enricoros/big-AGI/pull/507)
#### 3,000 Commits Milestone · April 7, 2024
</details>
<details>
<summary>3,000 Commits Milestone · April 7, 2024</summary>
![big-AGI Milestone](https://github.com/enricoros/big-AGI/assets/32999/47fddbb1-9bd6-4b58-ace4-781dfcb80923)
- 🥇 Today we <b>celebrate commit 3000</b> in just over one year, and going stronger 🚀
- 📢️ Thanks everyone for your support and words of love for Big-AGI, we are committed to creating the best AI experiences for everyone.
### What's New in 1.15.0 · April 1, 2024 · Beam
</details>
<details>
<summary>What's New in 1.15.0 · April 1, 2024 · Beam</summary>
- ⚠️ [**Beam**: the multi-model AI chat](https://big-agi.com/blog/beam-multi-model-ai-reasoning). find better answers, faster - a game-changer for brainstorming, decision-making, and creativity. [#443](https://github.com/enricoros/big-AGI/issues/443)
- Managed Deployments **Auto-Configuration**: simplify the UI models setup with backend-set models. [#436](https://github.com/enricoros/big-AGI/issues/436)
@@ -76,6 +240,8 @@ Note: After the V2 release in Q4, `v2-dev` will become the default branch and `v
- 1.15.1: Support for Gemini Pro 1.5 and OpenAI Turbo models
- Beast release, over 430 commits, 10,000+ lines changed: [release notes](https://github.com/enricoros/big-AGI/releases/tag/v1.15.0), and changes [v1.14.1...v1.15.0](https://github.com/enricoros/big-AGI/compare/v1.14.1...v1.15.0)
</details>
<details>
<summary>What's New in 1.14.1 · March 7, 2024 · Modelmorphic</summary>
@@ -83,7 +249,7 @@ Note: After the V2 release in Q4, `v2-dev` will become the default branch and `v
- New **[Perplexity](https://www.perplexity.ai/)** and **[Groq](https://groq.com/)** integration (thanks @Penagwin). [#407](https://github.com/enricoros/big-AGI/issues/407), [#427](https://github.com/enricoros/big-AGI/issues/427)
- **[LocalAI](https://localai.io/models/)** deep integration, including support for [model galleries](https://github.com/enricoros/big-AGI/issues/411)
- **Mistral** Large and Google **Gemini 1.5** support
- Performance optimizations: runs [much faster](https://twitter.com/enricoros/status/1756553038293303434?utm_source=localhost:3000&utm_medium=big-agi), saves lots of power, reduces memory usage
- Performance optimizations: runs [much faster](https://x.com/enricoros/status/1756553038293303434?utm_source=localhost:3000&utm_medium=big-agi), saves lots of power, reduces memory usage
- Enhanced UX with auto-sizing charts, refined search and folder functionalities, perfected scaling
- And with more UI improvements, documentation, bug fixes (20 tickets), and developer enhancements
@@ -146,99 +312,85 @@ https://github.com/enricoros/big-AGI/assets/1590910/a6b8e172-0726-4b03-a5e5-10cf
</details>
For full details and former releases, check out the [changelog](docs/changelog.md).
For full details and former releases, check out the [archived versions changelog](docs/changelog.md).
## 👉 Key Features ✨
## 👉 Supported Models & Integrations
| ![Advanced AI](https://img.shields.io/badge/Advanced%20AI-32383e?style=for-the-badge&logo=ai&logoColor=white) | ![100+ AI Models](https://img.shields.io/badge/100%2B%20AI%20Models-32383e?style=for-the-badge&logo=ai&logoColor=white) | ![Flow-state UX](https://img.shields.io/badge/Flow--state%20UX-32383e?style=for-the-badge&logo=flow&logoColor=white) | ![Privacy First](https://img.shields.io/badge/Privacy%20First-32383e?style=for-the-badge&logo=privacy&logoColor=white) | ![Advanced Tools](https://img.shields.io/badge/Fun%20To%20Use-f22a85?style=for-the-badge&logo=tools&logoColor=white) |
Delightful UX with latest models exclusive features like Beam for **multi-model AI validation**.
> ![LLM Vendors](https://img.shields.io/badge/20_LLM_Services-500+_Models-black?style=for-the-badge&logo=openai&logoColor=white&labelColor=purple)&nbsp;
> [![Feature Beam](https://img.shields.io/badge/AI--Validation-BEAM-000?style=for-the-badge&logo=anthropic&labelColor=purple)](https://big-agi.com/beam)
| ![Advanced AI](https://img.shields.io/badge/Advanced%20AI-32383e?style=for-the-badge&logo=ai&logoColor=white) | ![500+ AI Models](https://img.shields.io/badge/500%2B%20AI%20Models-32383e?style=for-the-badge&logo=ai&logoColor=white) | ![Flow-state UX](https://img.shields.io/badge/Flow--state%20UX-32383e?style=for-the-badge&logo=flow&logoColor=white) | ![Privacy First](https://img.shields.io/badge/Privacy%20First-32383e?style=for-the-badge&logo=privacy&logoColor=white) | ![Advanced Tools](https://img.shields.io/badge/Fun%20To%20Use-f22a85?style=for-the-badge&logo=tools&logoColor=white) |
|---------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|
| **Chat**<br/>**Call**<br/>**Beam**<br/>**Draw**, ... | Local & Cloud<br/>Open & Closed<br/>Cheap & Heavy<br/>Google, Mistral, ... | Attachments<br/>Diagrams<br/>Multi-Chat<br/>Mobile-first UI | Stored Locally<br/>Easy self-Host<br/>Local actions<br/>Data = Gold | AI Personas<br/>Voice Modes<br/>Screen Capture<br/>Camera + OCR |
![big-AGI screenshot](docs/pixels/big-AGI-compo-20240201_small.png)
You can easily configure 100s of AI models in big-AGI:
### AI Models & Vendors
| **AI models** | _supported vendors_ |
|:--------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Opensource Servers | [LocalAI](https://localai.io/) (multimodal) · [Ollama](https://ollama.com/) |
| Local Servers | [LM Studio](https://lmstudio.ai/) |
| Multimodal services | [Azure](https://azure.microsoft.com/en-us/products/ai-services/openai-service) · [Google Gemini](https://ai.google.dev/) · [OpenAI](https://platform.openai.com/docs/overview) |
| Language services | [Anthropic](https://anthropic.com) · [Groq](https://wow.groq.com/) · [Mistral](https://mistral.ai/) · [OpenRouter](https://openrouter.ai/) · [Perplexity](https://www.perplexity.ai/) · [Together AI](https://www.together.ai/) |
| Image services | [Prodia](https://prodia.com/) (SDXL) |
| Speech services | [ElevenLabs](https://elevenlabs.io) (Voice synthesis / cloning) |
Configure 100s of AI models from 20+ providers:
Add extra functionality with these integrations:
| **AI models** | _supported vendors_ |
|:--------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Opensource Servers | [LocalAI](https://localai.io/) · [Ollama](https://ollama.com/) |
| Local Servers | [LM Studio](https://lmstudio.ai/) (non-open) |
| Multimodal services | [Anthropic](https://anthropic.com) · [AWS Bedrock](https://aws.amazon.com/bedrock/) · [Azure](https://azure.microsoft.com/en-us/products/ai-services/openai-service) · [Google Gemini](https://ai.google.dev/) · [OpenAI](https://platform.openai.com/docs/overview) |
| LLM services | [Alibaba](https://www.alibabacloud.com/en/product/modelstudio) · [DeepSeek](https://deepseek.com) · [Groq](https://wow.groq.com/) · [Mistral](https://mistral.ai/) · [Moonshot](https://www.moonshot.cn/) · [OpenPipe](https://openpipe.ai/) · [OpenRouter](https://openrouter.ai/) · [Perplexity](https://www.perplexity.ai/) · [Together AI](https://www.together.ai/) · [xAI](https://x.ai/) · [Z.ai](https://z.ai/) |
| OpenAI-compatible | Any OpenAI-compatible endpoint - models, pricing, and capabilities are auto-detected |
| Image services | OpenAI · Google Gemini (Nano Banana) · LocalAI |
| Speech services | [ElevenLabs](https://elevenlabs.io) · [Inworld](https://inworld.ai) · [OpenAI TTS](https://platform.openai.com/docs/guides/text-to-speech) · LocalAI · Browser (Web Speech API) |
| **More** | _integrations_ |
|:-------------|:---------------------------------------------------------------------------------------------------------------|
| Web Browse | [Browserless](https://www.browserless.io/) · [Puppeteer](https://pptr.dev/)-based |
| Web Search | [Google CSE](https://programmablesearchengine.google.com/) |
| Code Editors | [CodePen](https://codepen.io/pen/) · [StackBlitz](https://stackblitz.com/) · [JSFiddle](https://jsfiddle.net/) |
| Sharing | [Paste.gg](https://paste.gg/) (Paste chats) |
| Tracking | [Helicone](https://www.helicone.ai) (LLM Observability) |
### Additional Integrations
[//]: # (- [x] **Flow-state UX** for uncompromised productivity)
[//]: # (- [x] **AI Personas**: Tailor your AI interactions with customizable personas)
[//]: # (- [x] **Sleek UI/UX**: A smooth, intuitive, and mobile-responsive interface)
[//]: # (- [x] **Efficient Interaction**: Voice commands, OCR, and drag-and-drop file uploads)
[//]: # (- [x] **Privacy First**: Self-host and use your own API keys for full control)
[//]: # (- [x] **Advanced Tools**: Execute code, import PDFs, and summarize documents)
[//]: # (- [x] **Seamless Integrations**: Enhance functionality with various third-party services)
[//]: # (- [x] **Open Roadmap**: Contribute to the progress of big-AGI)
<br/>
## 🚀 Installation
To get started with big-AGI, follow our comprehensive [Installation Guide](docs/installation.md).
The guide covers various installation options, whether you're spinning it up on
your local computer, deploying on Vercel, on Cloudflare, or rolling it out
through Docker.
Whether you're a developer, system integrator, or enterprise user, you'll find step-by-step instructions
to set up big-AGI quickly and easily.
[![Installation Guide](https://img.shields.io/badge/Installation%20Guide-blue?style=for-the-badge&logo=read-the-docs&logoColor=white)](docs/installation.md)
Or bring your API keys and jump straight into our free instance on [big-AGI.com](https://big-agi.com).
<br/>
# 🌟 Get Involved!
[//]: # ([![Official Discord]&#40;https://img.shields.io/discord/1098796266906980422?label=discord&logo=discord&logoColor=%23fff&style=for-the-badge&#41;]&#40;https://discord.gg/MkH4qj2Jp9&#41;)
[![Official Discord](https://discordapp.com/api/guilds/1098796266906980422/widget.png?style=banner2)](https://discord.gg/MkH4qj2Jp9)
- [ ] 📢️ [**Chat with us** on Discord](https://discord.gg/MkH4qj2Jp9)
- [ ]**Give us a star** on GitHub 👆
- [ ] 🚀 **Do you like code**? You'll love this gem of a project! [_Pick up a task!_](https://github.com/users/enricoros/projects/4/views/4) - _easy_ to _pro_
- [ ] 💡 Got a feature suggestion? [_Add your roadmap ideas_](https://github.com/enricoros/big-agi/issues/new?&template=roadmap-request.md)
- [ ] ✨ [Deploy](docs/installation.md) your [fork](docs/customizations.md) for your friends and family, or [customize it for work](docs/customizations.md)
<br/>
[//]: # ([![GitHub stars]&#40;https://img.shields.io/github/stars/enricoros/big-agi&#41;]&#40;https://github.com/enricoros/big-agi/stargazers&#41;)
[//]: # ([![GitHub forks]&#40;https://img.shields.io/github/forks/enricoros/big-agi&#41;]&#40;https://github.com/enricoros/big-agi/network&#41;)
[//]: # ([![GitHub pull requests]&#40;https://img.shields.io/github/issues-pr/enricoros/big-agi&#41;]&#40;https://github.com/enricoros/big-agi/pulls&#41;)
[//]: # ([![License]&#40;https://img.shields.io/github/license/enricoros/big-agi&#41;]&#40;https://github.com/enricoros/big-agi/LICENSE&#41;)
## 📜 Licensing
Big-AGI incorporates third-party software components that are subject
to separate license terms. For detailed information about these
components and their respective licenses, please refer to
the [Third-Party Notices](src/modules/3rdparty/THIRD_PARTY_NOTICES.md).
| **More** | _integrations_ |
|:--------------|:---------------------------------------------------------------------------------------------------------------|
| Web Browse | [Browserless](https://www.browserless.io/) · [Puppeteer](https://pptr.dev/)-based |
| Web Search | [Google CSE](https://programmablesearchengine.google.com/) |
| Observability | [Helicone](https://www.helicone.ai) |
---
2023-2024 · Enrico Ros x [Big-AGI](https://big-agi.com) · Like this project? Leave a star! 💫⭐
## 🚀 Installation
Self-host with Docker, deploy on Vercel, or develop locally. Full setup guide:
[![Installation Guide](https://img.shields.io/badge/Installation%20Guide-blue?style=for-the-badge&logo=read-the-docs&logoColor=white)](docs/installation.md)
Or use the hosted version at [big-agi.com](https://big-agi.com) with your API keys.
---
## 👋 Community & Contributing
### Connect
[![Official Discord](https://discordapp.com/api/guilds/1098796266906980422/widget.png?style=banner2)](https://discord.gg/MkH4qj2Jp9)
⭐ [Star the repo](https://github.com/enricoros/big-agi) if Big-AGI is useful to you
### Contribute
**🤖 AI-Powered Issue Assistance**
When you open an issue, our custom AI triage system (powered by [Claude Code](https://github.com/anthropics/claude-code-action) with Big-AGI architecture documentation) analyzes it, searches the codebase, and provides solutions - typically within 30 minutes. We've trained the system on our modules and subsystems so it handles most issues effectively. Your feedback drives development!
[![Open an Issue](https://img.shields.io/badge/Open_Issue-AI_Will_Help-ff8c00?style=for-the-badge&logo=fireship&logoColor=fff&labelColor=8b0000)](https://github.com/enricoros/big-agi/issues/new?template=ai-triage.yml)
[![Request Feature](https://img.shields.io/badge/Request_Feature-Roadmap_Idea-orange?style=for-the-badge&logo=lightbulb&logoColor=white)](https://github.com/enricoros/big-agi/issues/new?&template=roadmap-request.md)
[![Good First Issues](https://img.shields.io/badge/Good_First_Issues-Start-blue?style=for-the-badge&logo=github&logoColor=white)](https://github.com/users/enricoros/projects/4/views/4)
[![Customization](https://img.shields.io/badge/Fork_&_Customize-Your_Own-purple?style=for-the-badge&logo=git&logoColor=white)](docs/customizations.md)
[![Roadmap](https://img.shields.io/badge/Open_Roadmap-View-0366d6?style=for-the-badge&logo=github&logoColor=white)](https://github.com/users/enricoros/projects/4/views/2)
#### Contributors
<a href="https://github.com/enricoros/big-agi/graphs/contributors">
<img src="https://contrib.rocks/image?repo=enricoros/big-agi&max=48&columns=12" />
</a>
---
## License
MIT License · [Third-Party Notices](src/modules/3rdparty/THIRD_PARTY_NOTICES.md)
**2023-2026** · [Enrico Ros](https://www.enricoros.com) × [Token Fabrics](https://www.tokenfabrics.com)
+22 -7
View File
@@ -2,23 +2,38 @@ import { fetchRequestHandler } from '@trpc/server/adapters/fetch';
import { appRouterCloud } from '~/server/trpc/trpc.router-cloud';
import { createTRPCFetchContext } from '~/server/trpc/trpc.server';
import { posthogServerSendException } from '~/server/posthog/posthog.server';
const handlerNodeRoutes = (req: Request) => fetchRequestHandler({
endpoint: '/api/cloud',
router: appRouterCloud,
req,
createContext: createTRPCFetchContext,
onError:
process.env.NODE_ENV === 'development'
? ({ path, error }) => console.error(`❌ tRPC-cloud failed on ${path ?? 'unk-path'}: ${error.message}`)
: undefined,
onError: async function({ path, error, type, ctx }) {
// -> DEV error logging
if (process.env.NODE_ENV === 'development')
console.error(`❌ tRPC-cloud failed on ${path ?? 'unk-path'}: ${error.message}`);
// -> Capture node errors
await posthogServerSendException(error, undefined, {
domain: 'trpc-onerror',
runtime: 'nodejs',
endpoint: path ?? 'unknown',
method: req.method,
url: req.url,
additionalProperties: {
error_code: error.code,
error_type: type,
},
});
},
});
// NOTE: the following statement breaks the build on non-pro deployments, and conditionals don't work either
// so we resorted to raising the timeout from 10s to 25s in the vercel.json file instead
// export const maxDuration = 25;
// so we resorted to raising the timeout from 10s to 60s in the vercel.json file instead
// export const maxDuration = 60;
export const runtime = 'nodejs';
export const dynamic = 'force-dynamic';
export { handlerNodeRoutes as GET, handlerNodeRoutes as POST };
+3 -1
View File
@@ -10,9 +10,11 @@ const handlerEdgeRoutes = (req: Request) => fetchRequestHandler({
createContext: createTRPCFetchContext,
onError:
process.env.NODE_ENV === 'development'
? ({ path, error }) => console.error(`❌ tRPC-edge failed on ${path ?? 'unk-path'}: ${error.message}`)
? ({ path, error }) => console.error(`\n❌ tRPC-edge failed on ${path ?? 'unk-path'}: ${error.message}`)
: undefined,
});
// NOTE: we don't set maxDuration explicitly here - however we set it in the Vercel project settings, raising to the limit of 300s
// export const maxDuration = 60;
export const runtime = 'edge';
export { handlerEdgeRoutes as GET, handlerEdgeRoutes as POST };
+1 -4
View File
@@ -1,8 +1,6 @@
# Very simple docker-compose file to run the app on http://localhost:3000 (or http://127.0.0.1:3000).
#
# For more examples, such runnin big-AGI alongside a web browsing service, see the `docs/docker` folder.
version: '3.9'
# For more examples, such running big-AGI alongside a web browsing service, see the `docs/docker` folder.
services:
big-agi:
@@ -11,4 +9,3 @@ services:
- "3000:3000"
env_file:
- .env
command: [ "next", "start", "-p", "3000" ]
+4
View File
@@ -1,3 +1,7 @@
---
unlisted: true
---
# AIX dispatch server - API features comparison
This is updated as of 2024-07-09, and includes the latest features and capabilities of the three major AI APIs: Anthropic, Gemini, and OpenAI.
+41 -22
View File
@@ -2,36 +2,54 @@
Information you need to get started, configure, and use big-AGI productively.
👉 **[Changelog](https://big-agi.com/changes)** - See what's new
## Getting Started
Guides for basic big-AGI features:
Essential guides:
- **[Enabling Microphone for Speech Recognition](help-feature-microphone.md)**: Instructions to
allow speech recognition in browsers and apps.
- **[FAQ](help-faq.md)**: Common questions and answers
- **[Enabling Microphone](help-feature-microphone.md)**: Configure speech recognition in your browser
- **[Data Ownership](help-data-ownership.md)**: How your data is stored and managed
- **[Live File](help-feature-livefile.md)**: Live file attachment feature
## AI Model Configuration
## AI Services
Detailed guides to configure AI models and advanced features in big-AGI.
How to set up AI models and features in big-AGI.
> 👉 The following applies to users of big-AGI.com, as the public instance is empty and requires user configuration.
- **Cloud AI Services**:
- **[Azure OpenAI](config-azure-openai.md)**
- **[OpenRouter](config-openrouter.md)**
- Easy API key setup: **Anthropic**, **Deepseek**, **Google AI**, **Groq**, **Mistral**, **OpenAI**, **OpenPipe**, **Perplexity**, **TogetherAI**, **xAI**
- Easy API key configuration:
[Alibaba](https://bailian.console.alibabacloud.com/?apiKey=1#/api-key),
[Anthropic](https://console.anthropic.com/settings/keys),
[AWS Bedrock](https://console.aws.amazon.com/bedrock/),
[Deepseek](https://platform.deepseek.com/api_keys),
[Google Gemini](https://aistudio.google.com/app/apikey),
[Groq](https://console.groq.com/keys),
[Mistral](https://console.mistral.ai/api-keys/),
[Moonshot](https://platform.moonshot.cn/console/api-keys),
[OpenAI](https://platform.openai.com/api-keys),
[OpenPipe](https://app.openpipe.ai/settings),
[Perplexity](https://www.perplexity.ai/settings/api),
[TogetherAI](https://api.together.xyz/settings/api-keys),
[xAI](https://x.ai/api),
[Z.ai](https://z.ai/)
- **[Azure OpenAI](config-azure-openai.md)** guide
- **[OpenRouter](config-openrouter.md)** guide
- **OpenAI-compatible endpoints**: Any provider with an OpenAI-compatible API works out of the box - models, pricing, and capabilities are auto-detected
- **Local AI Integrations**:
- **[LocalAI](config-local-localai.md)**
- **[LM Studio](config-local-lmstudio.md)**
- **[Ollama](config-local-ollama.md)**
- [LocalAI](config-local-localai.md), [LM Studio](config-local-lmstudio.md), [Ollama](config-local-ollama.md)
- **Enhanced AI Features**:
- **[Web Browsing](config-feature-browse.md)**: Enable web page download through third-party services or your own cloud (advanced)
- **[Web Browsing](config-feature-browse.md)**: Enable web page download through third-party services or your own cloud
- **Web Search**: Google Search API (see '[Environment Variables](environment-variables.md)')
- **Image Generation**: DALL·E 3 and 2, or Prodia API for Stable Diffusion XL
- **Voice Synthesis**: ElevenLabs API for voice generation
- **Image Generation**: GPT Image (gpt-image-1), Nano Banana, DALL·E 3 and 2
- **Voice Synthesis**: ElevenLabs, Inworld, OpenAI TTS, LocalAI, or browser Web Speech API
- **[Google Drive](config-feature-google-drive.md)**: Attach files from Google Drive
## Deployment & Customization
@@ -39,24 +57,25 @@ Detailed guides to configure AI models and advanced features in big-AGI.
For deploying a custom big-AGI instance:
- **[Installation Guide](installation.md)**: Set up your own big-AGI instance
- **[Installation Guide](installation.md)**, including:
- Set up your own big-AGI instance
- Source build or pre-built options
- Local, cloud, or on-premises deployment
- **Advanced Setup**:
- **[Source Code Customization Guide](customizations.md)**: Modify the source code
- **[Source Code Customization](customizations.md)**: Modify the source code
- **[Access Control](deploy-authentication.md)**: Optional, add basic user authentication
- **[Database Setup](deploy-database.md)**: Optional, enables "Chat Link Sharing"
- **[Reverse Proxy](deploy-reverse-proxy.md)**: Optional, enables custom domains and SSL
- **[Docker Deployment](deploy-docker.md)**: Deploy with Docker containers
- **[Kubernetes](deploy-k8s.md)**: Deploy on Kubernetes clusters
- **[Analytics](deploy-analytics.md)**: Set up usage analytics
- **[Environment Variables](environment-variables.md)**: Pre-configures models and services
## Community & Support
Connect with the growing big-AGI community:
- Check the [changelog](https://big-agi.com/changes) for the latest updates
- Visit our [GitHub repository](https://github.com/enricoros/big-AGI) for source code and issue tracking
- Check the latest updates and features on [Changelog](changelog.md) or the in-app [News](https://get.big-agi.com/news)
- Connect with us and other users on [Discord](https://discord.gg/MkH4qj2Jp9) for discussions, help, and sharing your experiences with big-AGI
- Join our [Discord](https://discord.gg/MkH4qj2Jp9) for discussions and help
Thank you for choosing big-AGI. We're excited to give you the best tools to amplify yourself.
Let's build something great.
+23 -9
View File
@@ -1,18 +1,33 @@
## Changelog
## Archived Versions - Changelog
This is a high-level changelog. Calls out some of the high level features batched
by release.
- For the live changelog, see [big-agi.com/changes](https://big-agi.com/changes)
- For the live roadmap, please see [the GitHub project](https://github.com/users/enricoros/projects/4/views/2)
### 1.17.0 - Jun 2024
> NOTE: with the release of 2.0.0 we switching to [big-agi.com/changes](https://big-agi.com/changes) for the
> continuously updated changelog.
- milestone: [1.17.0](https://github.com/enricoros/big-agi/milestone/17)
- work in progress: [big-AGI open roadmap](https://github.com/users/enricoros/projects/4/views/2), [help here](https://github.com/users/enricoros/projects/4/views/4)
### What's New in 2 · Oct 31, 2025 · Open
### What's New in 1.16.1...1.16.8 · Sep 13, 2024 (patch releases)
- **Big-AGI Open** is ready and more productive and faster than ever, with:
- **Beam 2**: multi-modal, program-based, follow-ups, save presets
- Top-notch AI models support including **agentic models** and **reasoning models**
- **Image Generation** and editing with Nano Banana and gpt-image-1
- **Web Search** with citations for supported models
- **UI** & Mobile UI overhaul with peeking and side panels
- And all of the [Big-AGI 2 changes](https://github.com/enricoros/big-AGI/issues/567#issuecomment-2262187617) and more
- Built for the future, madly optimized
- 1.16.8: OpenAI ChatGPT-4o Latest (o1-preview and o1-mini are supported in Big-AGI 2)
### What's New in 1.16.1...1.16.13 · (patch releases)
- 1.16.13: Docker fix (#840)
- 1.16.12: Dockerfile update (#840)
- 1.16.11: v1 final release, documentation updates
- 1.16.10: OpenRouter models support
- 1.16.9: Docker Gemini fix, R1 models support
- 1.16.8: OpenAI ChatGPT-4o Latest, o1 models support
- 1.16.7: OpenAI support for GPT-4o 2024-08-06
- 1.16.6: Groq support for Llama 3.1 models
- 1.16.5: GPT-4o Mini support
@@ -46,7 +61,7 @@ by release.
### What's New in 1.15.0 · April 1, 2024 · Beam
- ⚠️ [**Beam**: the multi-model AI chat](https://big-agi.com/blog/beam-multi-model-ai-reasoning). find better answers, faster - a game-changer for brainstorming, decision-making, and creativity. [#443](https://github.com/enricoros/big-AGI/issues/443)
- Managed Deployments **Auto-Configuration**: simplify the UI mdoels setup with backend-set models. [#436](https://github.com/enricoros/big-AGI/issues/436)
- Managed Deployments **Auto-Configuration**: simplify the UI models setup with backend-set models. [#436](https://github.com/enricoros/big-AGI/issues/436)
- Message **Starring ⭐**: star important messages within chats, to attach them later. [#476](https://github.com/enricoros/big-AGI/issues/476)
- Enhanced the default Persona
- Fixes to Gemini models and SVGs, improvements to UI and icons
@@ -58,7 +73,7 @@ by release.
- New **[Perplexity](https://www.perplexity.ai/)** and **[Groq](https://groq.com/)** integration (thanks @Penagwin). [#407](https://github.com/enricoros/big-AGI/issues/407), [#427](https://github.com/enricoros/big-AGI/issues/427)
- **[LocalAI](https://localai.io/models/)** deep integration, including support for [model galleries](https://github.com/enricoros/big-AGI/issues/411)
- **Mistral** Large and Google **Gemini 1.5** support
- Performance optimizations: runs [much faster](https://twitter.com/enricoros/status/1756553038293303434?utm_source=localhost:3000&utm_medium=big-agi), saves lots of power, reduces memory usage
- Performance optimizations: runs [much faster](https://x.com/enricoros/status/1756553038293303434?utm_source=localhost:3000&utm_medium=big-agi), saves lots of power, reduces memory usage
- Enhanced UX with auto-sizing charts, refined search and folder functionalities, perfected scaling
- And with more UI improvements, documentation, bug fixes (20 tickets), and developer enhancements
- [Release notes](https://github.com/enricoros/big-AGI/releases/tag/v1.14.0), and changes [v1.13.1...v1.14.0](https://github.com/enricoros/big-AGI/compare/v1.13.1...v1.14.0) (233 commits, 8,000+ lines changed)
@@ -216,7 +231,6 @@ For Developers:
- **[Install Mobile APP](../docs/pixels/feature_pwa.png)** 📲 looks like native (@harlanlewis)
- **[UI language](../docs/pixels/feature_language.png)** with auto-detect, and future app language! (@tbodyston)
- **PDF Summarization** 🧩🤯 - ask questions to a PDF! (@fredliubojin)
- **Code Execution: [Codepen](https://codepen.io/)** 💻 (@harlanlewis)
- **[SVG Drawing](../docs/pixels/feature_svg_drawing.png)** - draw with AI 🎨
- Chats: multiple chats, AI titles, Import/Export, Selection mode
- Rendering: Markdown, SVG, improved Code blocks
+48 -28
View File
@@ -14,7 +14,7 @@ If you have an `API Endpoint` and `API Key`, you can configure big-AGI as follow
1. Launch the `big-AGI` application
2. Go to the **Models** settings
3. Add a Vendor and select **Azure OpenAI**
- Enter the Endpoint (e.g., 'https://your-openai-api-1234.openai.azure.com/')
- Enter the Endpoint (e.g., 'https://your-resource-name.openai.azure.com')
- Enter the API Key (e.g., 'fd5...........................ba')
The deployed models are now available in the application. If you don't have a configured
@@ -23,6 +23,36 @@ Azure OpenAI service instance, continue with the next section.
In addition to using the UI, configuration can also be done using
[environment variables](environment-variables.md).
## Server Configuration
For server deployments, set these environment variables:
```bash
AZURE_OPENAI_API_ENDPOINT=https://your-resource-name.openai.azure.com
AZURE_OPENAI_API_KEY=your-api-key
```
This enables Azure OpenAI for all users without requiring individual API keys. For more details, see [environment-variables.md](environment-variables.md).
## Azure OpenAI API Versions
Azure OpenAI supports both traditional deployment-based API and the next-generation v1 API:
### Next-Generation v1 API (Default)
- **Enabled by default** for GPT-5-like models (GPT-5, GPT-6, o3, o4, etc.)
- Uses direct `/openai/v1/responses` endpoint without deployment IDs
- Optimized for advanced reasoning models and new features
- Can be disabled by setting `AZURE_OPENAI_DISABLE_V1=true`
### Traditional Deployment-Based API
- Uses `/openai/deployments/{deployment-name}/...` endpoints
- Required for older models and when v1 API is disabled
- Needs deployment ID for all API calls
### Known Limitations
- **Web Search Tool**: Azure OpenAI does not support the `web_search_preview` tool that's available in OpenAI's API
- Models with web search capabilities will have this feature automatically disabled on Azure
## Setting Up Azure
### Step 1: Azure Account & Subscription
@@ -34,18 +64,7 @@ In addition to using the UI, configuration can also be done using
- Fill in the required fields and click on **Create**
- Note down the **Subscription ID** (e.g., `12345678-1234-1234-1234-123456789012`)
### Step 2: Apply for Azure OpenAI Service
We'll now be creating "OpenAI"-specific resources on Azure. This requires to 'apply',
and acceptance should be quick (even as low as minutes).
1. Visit [Azure OpenAI Service](https://aka.ms/azure-openai)
2. Click on **Apply for access**
- Fill in the required fields (including the subscription ID) and click on **Apply**
Once your application is accepted, you can create OpenAI resources on Azure.
### Step 3: Create Azure OpenAI Resource
### Step 2: Create Azure OpenAI Resource
For more information, see [Azure: Create and deploy OpenAI](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/create-resource?pivots=web-portal)
@@ -55,31 +74,32 @@ For more information, see [Azure: Create and deploy OpenAI](https://learn.micros
![Creating an OpenAI service](pixels/config-azure-openai-create.png)
- Select the subscription
- Select a resource group or create a new one
- Select the region. Note that the region determines the available models.
> For instance, **Canada East** offers GPT-4-32k models, For the full list, see [GPT-4 models](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models)
- Select the region. **Important**: The region determines which models are available.
> Popular regions like **East US**, **West Europe**, and **Australia East** typically have the best model availability. For the latest model availability by region, see [Azure OpenAI Model Availability](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models)
- Name the service (e.g., `your-openai-api-1234`)
- Select a pricing tier (e.g., `S0` for standard)
- Select: "All networks, including the internet, can access this resource."
- Click on **Review + create** and then **Create**
After creating the resource, you can access the API Keys and Endpoints. At any point, you can go to
the OpenAI Service instance page to get this information.
After creating the resource, you can access the API Keys and Endpoints:
- Click on **Go to resource**
- Click on **Develop**
- Copy the `Endpoint`, called "Language API", e.g. 'https://your-openai-api-1234.openai.azure.com/'
- Copy `KEY 1`
1. Click on **Go to resource** (or navigate to your Azure OpenAI resource)
2. In the left sidebar, under **Resource Management**, click on **Keys and Endpoint**
3. Copy the required information:
- **Endpoint**: e.g., 'https://your-resource-name.openai.azure.com/'
- **Key**: Copy either KEY 1 or KEY 2 (both work identically)
### Step 4: Deploy Models
### Step 3: Deploy Models
By default, Azure OpenAI resource instances don't have models available. You need to deploy the models you want to use.
1. Click on **Model Deployments > Manage Deployments**
2. Click on **+Create New Deployment**
![Deploying a model](pixels/config-azure-openai-deploy.png)
- Select the model you want to deploy
- Optionally select a version
- name the model, e.g., `gpt4-32k-0613`
1. In your Azure OpenAI resource, click on **Model deployments** in the left sidebar
2. Click on **Create new deployment**
3. Fill in the deployment details:
- **Select a model**: Choose from available models
- **Model version**: Select the latest version or a specific one
- **Deployment name**: Give it a meaningful name
4. Click **Deploy**
Repeat as necessary for each model you want to deploy.
+55
View File
@@ -0,0 +1,55 @@
# Google Drive Integration
Attach files from Google Drive directly in the chat composer.
## Setup
### 1. Enable APIs
In [Google Cloud Console](https://console.cloud.google.com/):
1. Go to **APIs & Services > Library**
2. Enable **Google Drive API** and **Google Picker API**
### 2. Configure OAuth
1. Go to **APIs & Services > OAuth consent screen**
2. Create consent screen (External or Internal)
3. Add scope: `https://www.googleapis.com/auth/drive.file`
4. Add test users if in testing mode
### 3. Create Credentials
1. Go to **APIs & Services > Credentials**
2. Create **OAuth client ID** (Web application)
3. Add JavaScript origins:
- `http://localhost:3000` (dev)
- `https://your-domain.com` (prod)
### 4. Set Environment Variable
```bash
NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID=your-client-id.apps.googleusercontent.com
```
## Usage
- Click **Drive** button in attachment menu
## Supported Files
| Type | Export Format |
|-----------------|---------------------|
| Regular files | Downloaded directly |
| Google Docs | Markdown (.md) |
| Google Sheets | CSV (.csv) |
| Google Slides | PDF (.pdf) |
| Google Drawings | SVG (.svg) |
## Troubleshooting
**Picker won't open**: Check `NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID` is set and APIs are enabled.
**OAuth errors**: Verify your domain is in authorized JavaScript origins. Add yourself as test user if app is in testing mode.
**Download fails**: Check file permissions and that Drive API is enabled.
+3 -1
View File
@@ -41,6 +41,8 @@ In addition to using the UI, configuration can also be done using
### Integration: Models Gallery
> Note: The Gallery Admin feature described below may have been removed or renamed in recent versions of big-AGI.
If the running LocalAI instance is configured with a [Model Gallery](https://localai.io/models/):
- Go to Models > LocalAI
@@ -54,7 +56,7 @@ If the running LocalAI instance is configured with a [Model Gallery](https://loc
At the time of writing, LocalAI does not publish the model `context window size`.
Every model is assumed to be capable of chatting, and with a context window of 4096 tokens.
Please update the [src/modules/llms/transports/server/openai/models/models.data.ts](../src/modules/llms/server/openai/models/models.data.ts)
Please update the [src/modules/llms/server/models.mappings.ts](../src/modules/llms/server/models.mappings.ts)
file with the mapping information between LocalAI model IDs and names/descriptions/tokens, etc.
# 🤝 Support
+4 -5
View File
@@ -1,8 +1,7 @@
# OpenRouter Configuration
[OpenRouter](https://openrouter.ai) is a standalone, premium service
that provides access to <Link href='https://openrouter.ai/docs#models' target='_blank'>exclusive AI models</Link>
such as GPT-4 32k, Claude, and more. These models are typically not available to the public.
that provides access to a wide range of AI models from multiple providers through a single API.
This document details the process of integrating OpenRouter with big-AGI.
### 1. OpenRouter Account Setup and API Key Generation
@@ -20,7 +19,7 @@ This document details the process of integrating OpenRouter with big-AGI.
![feature-openrouter-add.png](pixels/feature-openrouter-add.png)
3. Input the API key into the **OpenRouter API Key** field, and load the Models.
![feature-openrouter-configure.png](pixels/feature-openrouter-configure.png)
4. OpenAI GPT4-32k and other models will now be accessible and selectable in the application.
4. Models from all supported providers will now be accessible and selectable in the application.
In addition to using the UI, configuration can also be done using
[environment variables](environment-variables.md).
@@ -30,5 +29,5 @@ In addition to using the UI, configuration can also be done using
OpenRouter independently manages its service and pricing and is not affiliated with big-AGI.
For more detailed information, please visit [this page](https://openrouter.ai/docs#models).
Please note that running large models such as GPT-4 32k can be costly and may rapidly consume
credits - a single prompt may cost $1 or more, at the time of writing.
Please note that running large models can be costly and may rapidly consume credits.
Check model pricing on the OpenRouter website before use.
+30 -9
View File
@@ -31,17 +31,14 @@ At time of writing, big-AGI has only 2 operations that run on Node.js Functions:
browsing (fetching web pages) and sharing. They both can exceed 10 seconds, especially
when fetching large pages or waiting for websites to be completed.
We provide `vercel_PRODUCTION.json` to raise the duration to 25 seconds (from a default of 10), to use it,
make sure to rename it to `vercel.json` before build.
From the Vercel Project > Settings > General > Build & Development Settings,
you can for instance set the build command to:
```bash
mv vercel_PRODUCTION.json vercel.json; next build
next build
```
### Change the Personas
### Change the Personas (v1.x only)
Edit the `src/data.ts` file to customize personas. This file houses the default personas. You can add, remove, or modify these to meet your project's needs.
@@ -52,20 +49,44 @@ Edit the `src/data.ts` file to customize personas. This file houses the default
Adapt the UI to match your project's aesthetic, incorporate new features, or exclude unnecessary ones.
- [ ] Adjust `src/common/app.theme.ts` for theme changes: colors, spacing, button appearance, animations, etc
- [ ] Modify `src/common/app.config.tsx` to alter the application's name
- [ ] Update `src/common/app.nav.tsx` to revise the navigation bar
- [ ] Modify `src/common/app.release.ts` to alter the application's name
- [ ] Update `src/common/app.nav.ts` to revise the navigation bar
### Add a Message of the Day
You can display a temporary announcement banner at the top of the app using the `NEXT_PUBLIC_MOTD` environment variable.
- Set this variable in your deployment environment
- The message supports template variables:
- `{{app_build_hash}}`: Current git commit hash
- `{{app_build_pkgver}}`: Package version
- `{{app_build_time}}`: Build timestamp as date
- `{{app_deployment_type}}`: Deployment type (local, docker, vercel, etc.)
- Users can dismiss the message (until next page refresh)
- Use it for version announcements, maintenance notices, or feature highlights
Example: `NEXT_PUBLIC_MOTD=🚀 New features available in {{app_build_pkgver}}! Try the improved Beam.`
## Testing & Deployment
Test your application thoroughly using local development (refer to README.md for local build instructions). Deploy using your preferred hosting service. big-AGI supports deployment on platforms like Vercel, Docker, or any Node.js-compatible service, especially those supporting NextJS's "Edge Runtime."
- [deploy-cloudflare.md](deploy-cloudflare.md): for Cloudflare Workers deployment
- [deploy-cloudflare.md](deploy-cloudflare.md): for Cloudflare Pages deployment (limited support)
- [deploy-docker.md](deploy-docker.md): for Docker deployment instructions and examples
- [deploy-k8s.md](deploy-k8s.md): for Kubernetes deployment instructions and examples
## Debugging
We introduced the `/info/debug` page that provides a detailed overview of the application's environment, including the API keys, environment variables, and other configuration settings.
The application includes a client-side logging system. You can view recent logs via the UI (Settings > Tools > Logs).
For deeper debugging during development:
1. **Debug Page**: Access the `/info/debug` page for an overview of the application's environment, configuration, API status, and environment variables available to the client.
2. **Conditional Breakpoints**: To automatically pause execution in your browser's developer tools when critical errors (`error`, `critical`, `DEV` levels) are logged to the console, set the following environment variable in your local `.env.local` file and restart your development server:
```bash
NEXT_PUBLIC_DEBUG_BREAKS=true
```
This allows you to inspect the application state at the exact moment an important error occurs. This feature only works in development mode (`npm run dev`) and requires the environment variable to be explicitly set to `true`.
<br/>
+51 -34
View File
@@ -2,8 +2,9 @@
The open-source big-AGI project provides support for the following analytics services:
- **Vercel Analytics**: automatic when deployed to Vercel
- **Google Analytics 4**: manual setup required
- **PostHog Analytics**: manual setup required
- **Vercel Analytics**: automatic when deployed to Vercel
The following is a quick overview of the Analytics options for the deployers of this open-source project.
big-AGI is deployed to many large-scale and enterprise though various ways (custom builds, Docker, Vercel, Cloudflare, etc.),
@@ -11,32 +12,6 @@ and this guide is for its customization.
## Service Configuration
### Vercel Analytics
- Why: understand coarse traction, and identify deployment issues - all without tracking individual users
- What: top pages, top referrers, country of origin, operating system, browser, and page speed metrics
Vercel Analytics and Speed Insights are local API endpoints deployed to your domain, so everything stays within your
domain. Furthermore, the Vercel Analytics service is privacy-friendly, and does not track individual users.
This service is avaialble to system administrators when deploying to Vercel. It is automatically enabled when deploying to Vercel.
The code that activates Vercel Analytics is located in the `src/pages/_app.tsx` file:
```tsx
const MyApp = ({ Component, emotionCache, pageProps }: MyAppProps) => <>
...
{isVercelFromFrontend && <VercelAnalytics debug={false} />}
{isVercelFromFrontend && <VercelSpeedInsights debug={false} sampleRate={1 / 2} />}
...
</>;
```
When big-AGI is served on Vercel hosts, the ```process.env.NEXT_PUBLIC_VERCEL_URL``` environment variable is trueish, and
analytics will be sent by default to the Vercel Analytics service which is deployed by Vercel IF configured from the
Vercel project dashboard.
In summary: to turn it on: activate the `Analytics` service in the Vercel project dashboard.
### Google Analytics 4
- Why: user engagement and retention, performance insights, personalization, content optimization
@@ -51,13 +26,55 @@ server/container will be able to report analytics to your Google Analytics 4 pro
As of Feb 27, 2024, this feature is in development.
### PostHog Analytics
- Why: feature usage tracking, user journeys, conversion optimization, product analytics
- What: page views, page leave events, user interactions, and deployment context
PostHog provides comprehensive product analytics with privacy controls. It helps understand how users interact with big-AGI's features, identify opportunities for improvement, and optimize the user experience.
To enable PostHog, set the `NEXT_PUBLIC_POSTHOG_KEY` environment variable at build time. PostHog is configured with tracking optimization and privacy in mind:
- Uses a proxy endpoint (`/a/ph`) to avoid ad blockers
- Respects user opt-out preferences via local storage
- Tracks only essential information without PII
- Adds deployment context for better segmentation
The implementation follows PostHog's best practices for Next.js applications and includes manual page view tracking for proper single-page application support.
### Vercel Analytics
- Why: understand coarse traction, and identify deployment issues - all without tracking individual users
- What: top pages, top referrers, country of origin, operating system, browser, and page speed metrics
Vercel Analytics and Speed Insights are local API endpoints deployed to your domain, so everything stays within your
domain. Furthermore, the Vercel Analytics service is privacy-friendly, and does not track individual users.
This service is avaialble to system administrators when deploying to Vercel. It is automatically enabled when deploying to Vercel.
The code that activates Vercel Analytics is located in the `pages/_app.tsx` file:
```tsx
const MyApp = ({ Component, emotionCache, pageProps }: MyAppProps) => <>
...
{Is.Deployment.VercelFromFrontend && <VercelAnalytics debug={false} />}
{Is.Deployment.VercelFromFrontend && <VercelSpeedInsights debug={false} sampleRate={1 / 2} />}
...
</>;
```
When big-AGI is served on Vercel hosts, the `process.env.NEXT_PUBLIC_VERCEL_URL` environment variable is trueish, and
analytics will be sent by default to the Vercel Analytics service which is deployed by Vercel IF configured from the
Vercel project dashboard.
In summary: to turn it on: activate the `Analytics` service in the Vercel project dashboard.
## Configurations
| Scope | Default | Description / Instructions |
|-----------------------------------------------------------------------------------------|------------------|-------------------------------------------------------------------------------------------------------------------------|
| Your source builds of big-AGI | None | **Vercel**: enable Vercel Analytics from the dashboard. · **Google Analytics**: set environment variable at build time. |
| Your docker builds of big-AGI | None | **Vercel**: n/a. · **Google Analytics**: set environment variable at `docker build` time. |
| [big-agi.com](https://big-agi.com) | Vercel + Google | The main website ([privacy policy](https://big-agi.com/privacy)) hosted for free for anyone. |
| [official Docker packages](https://github.com/enricoros/big-AGI/pkgs/container/big-agi) | Google Analytics | **Vercel**: n/a · **Google Analytics**: set to the big-agi.com Google Analytics for analytics and improvements. |
| Scope | Default | Description / Instructions |
|-------------------------------------------------------------------------------------------------------------------------|---------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Your **Source** builds of big-AGI | None | **Google Analytics**: set environment variable at build time · **PostHog**: set environment variable at build time · **Vercel**: enable Vercel Analytics from the dashboard |
| Your **Docker** builds of big-AGI | None | (**Vercel**: n/a) · **Google Analytics**: set environment variable at `docker build` time · **PostHog**: set environment variable at `docker build` time. |
| [get.big-agi.com](https://get.big-agi.com) (**Big-AGI 1.x Legacy**) | Vercel + Google + PostHog | The main website ([privacy policy](https://big-agi.com/privacy)) hosted for free for anyone. |
| [prebuilt Docker packages](https://github.com/enricoros/big-AGI/pkgs/container/big-agi) (**Big-AGI 1.x**, 'latest' tag) | Google Analytics | **Vercel**: n/a · **Google Analytics**: set to the big-agi.com Google Analytics for analytics and improvements · **PostHog**: n/a |
Note: this information is updated as of Feb 27, 2024 and can change at any time.
Note: this information is updated as of March 3, 2025 and can change at any time.
+11 -9
View File
@@ -1,18 +1,20 @@
---
unlisted: true
---
# Deploying a Next.js App on Cloudflare Pages
> WARNING: Cloudflare Pages does not support traditional NodeJS runtimes, but only Edge Runtime functions.
> WARNING: Cloudflare Pages only supports Edge Runtime functions, not the full Node.js runtime.
>
> In this project we use Prisma connected to serverless Postgres, which at the moment cannot run on
> edge functions, so we cannot deploy this project on Cloudflare Pages.
> The cloud router in this project requires a Node.js runtime for Supabase SDK, authentication,
> sync, and other server-side features that cannot run on Cloudflare's edge runtime.
>
> Workaround: Step 3.4. has been added below, to DELETE the NodeJS traditional runtime - which means that some
> Workaround: Step 3.4. has been added below, to DELETE the Node.js cloud router - which means that some
> parts of this application will not work.
> - [Side effects](https://github.com/enricoros/big-agi/blob/main/src/apps/chat/trade/server/trade.router.ts#L19):
> Sharing functionality to DB, and import from ChatGPT share, and post to Paste.GG will not work
> - [Side effects](https://github.com/enricoros/big-agi/blob/main/src/modules/trade/server/trade.router.ts):
> Sharing functionality, import from ChatGPT share, and post to Paste.GG will not work
> - Cloud features (sync, auth, payments) will not be available
> - See [Issue 174](https://github.com/enricoros/big-agi/issues/174).
>
> Longer term: follow [prisma/prisma: Support Edge Function deployments](https://github.com/prisma/prisma/issues/21394)
> and convert the Node runtime to Edge runtime once Prisma supports it.
This guide provides steps to deploy your Next.js app on Cloudflare Pages.
It is based on the [official Cloudflare developer documentation](https://developers.cloudflare.com/pages/framework-guides/deploy-a-nextjs-site/),
+6
View File
@@ -31,6 +31,12 @@ file.
### Official Images: [ghcr.io/enricoros/big-agi](https://github.com/enricoros/big-agi/pkgs/container/big-agi)
#### Available Tags
- **`:latest`** / **`:stable`** - Latest stable release (recommended)
- **`:development`** - Main branch (bleeding edge)
- **`:v2.0.0`** - Specific versions
#### Run using *docker* 🚀
```bash
@@ -19,7 +19,6 @@ services:
- .env
environment:
- PUPPETEER_WSS_ENDPOINT=ws://browserless:3000
command: [ "next", "start", "-p", "3000" ]
depends_on:
- browserless
-14
View File
@@ -1,14 +0,0 @@
# Why big-AGI?
Placeholder for a document that demonstrates the productivity and unique features of Big-AGI.
## Exclusive features
- [x] Call AGI
- [x] Continuous Voice mode
- [x] Diagram generation
- [ ] ...
## Productivity Features
- [x] Multi-window to never wait
- [x] Multi-Chat to explore different solutions
- [x] Rendering of graphs, charts, mindmaps
- [ ] ...
+35 -12
View File
@@ -3,7 +3,7 @@
This document provides an explanation of the environment variables used in the big-AGI application.
**All variables are optional**; and _UI options_ take precedence over _backend environment variables_,
which take place over _defaults_. This file is kept in sync with [`../src/server/env.mjs`](../src/server/env.mjs).
which take place over _defaults_. This file is kept in sync with [`../src/server/env.server.ts`](../src/server/env.server.ts).
### Setting Environment Variables
@@ -23,16 +23,24 @@ MDB_URI=
OPENAI_API_KEY=
OPENAI_API_HOST=
OPENAI_API_ORG_ID=
ALIBABA_API_HOST=
ALIBABA_API_KEY=
AZURE_OPENAI_API_ENDPOINT=
AZURE_OPENAI_API_KEY=
ANTHROPIC_API_KEY=
ANTHROPIC_API_HOST=
BEDROCK_BEARER_TOKEN=
BEDROCK_ACCESS_KEY_ID=
BEDROCK_SECRET_ACCESS_KEY=
BEDROCK_SESSION_TOKEN=
BEDROCK_REGION=
DEEPSEEK_API_KEY=
GEMINI_API_KEY=
GROQ_API_KEY=
LOCALAI_API_HOST=
LOCALAI_API_KEY=
MISTRAL_API_KEY=
MOONSHOT_API_KEY=
OLLAMA_API_HOST=
OPENPIPE_API_KEY=
OPENROUTER_API_KEY=
@@ -54,17 +62,18 @@ GOOGLE_CSE_ID=
ELEVENLABS_API_KEY=
ELEVENLABS_API_HOST=
ELEVENLABS_VOICE_ID=
# Text-To-Image: Prodia
PRODIA_API_KEY=
# Backend HTTP Basic Authentication (see `deploy-authentication.md` for turning on authentication)
HTTP_BASIC_AUTH_USERNAME=
HTTP_BASIC_AUTH_PASSWORD=
# Frontend variables
# Frontend variables
NEXT_PUBLIC_MOTD=
NEXT_PUBLIC_GA4_MEASUREMENT_ID=
NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID=
NEXT_PUBLIC_PLANTUML_SERVER_URL=
NEXT_PUBLIC_POSTHOG_KEY=
```
## Backend Variables
@@ -88,16 +97,27 @@ requiring the user to enter an API key
| `OPENAI_API_KEY` | API key for OpenAI | Recommended |
| `OPENAI_API_HOST` | Changes the backend host for the OpenAI vendor, to enable platforms such as Helicone and CloudFlare AI Gateway | Optional |
| `OPENAI_API_ORG_ID` | Sets the "OpenAI-Organization" header field to support organization users | Optional |
| `ALIBABA_API_HOST` | The Alibaba AI OpenAI-compatible endpoint | Optional |
| `ALIBABA_API_KEY` | The API key for Alibaba AI | Optional |
| `AZURE_OPENAI_API_ENDPOINT` | Azure OpenAI endpoint - host only, without the path | Optional, but if set `AZURE_OPENAI_API_KEY` must also be set |
| `AZURE_OPENAI_API_KEY` | Azure OpenAI API key, see [config-azure-openai.md](config-azure-openai.md) | Optional, but if set `AZURE_OPENAI_API_ENDPOINT` must also be set |
| `AZURE_OPENAI_DISABLE_V1` | Disables the next-generation v1 API for GPT-5-like models (set to 'true' to disable) | Optional, defaults to enabled |
| `AZURE_OPENAI_API_VERSION` | API version for traditional deployment-based endpoints | Optional, defaults to '2025-04-01-preview' |
| `AZURE_DEPLOYMENTS_API_VERSION` | API version for the deployments listing endpoint | Optional, defaults to '2023-03-15-preview' |
| `ANTHROPIC_API_KEY` | The API key for Anthropic | Optional |
| `ANTHROPIC_API_HOST` | Changes the backend host for the Anthropic vendor, to enable platforms such as AWS Bedrock | Optional |
| `ANTHROPIC_API_HOST` | Changes the backend host for the Anthropic vendor, for proxies or custom endpoints | Optional |
| `BEDROCK_BEARER_TOKEN` | Bedrock long-term API key (`ABSK...`). Takes priority over IAM credentials. Short-term keys only work for runtime, not model listing | Optional |
| `BEDROCK_ACCESS_KEY_ID` | AWS IAM Access Key ID for Bedrock (Claude models via AWS) | Optional, but if set `BEDROCK_SECRET_ACCESS_KEY` must also be set |
| `BEDROCK_SECRET_ACCESS_KEY` | AWS IAM Secret Access Key for Bedrock | Optional, but if set `BEDROCK_ACCESS_KEY_ID` must also be set |
| `BEDROCK_SESSION_TOKEN` | AWS Session Token for temporary/STS credentials | Optional |
| `BEDROCK_REGION` | AWS region for Bedrock (e.g., `us-east-1`, `us-west-2`, `eu-west-1`) | Optional, defaults to `us-east-1` |
| `DEEPSEEK_API_KEY` | The API key for Deepseek AI | Optional |
| `GEMINI_API_KEY` | The API key for Google AI's Gemini | Optional |
| `GROQ_API_KEY` | The API key for Groq Cloud | Optional |
| `LOCALAI_API_HOST` | Sets the URL of the LocalAI server, or defaults to http://127.0.0.1:8080 | Optional |
| `LOCALAI_API_KEY` | The (Optional) API key for LocalAI | Optional |
| `MISTRAL_API_KEY` | The API key for Mistral | Optional |
| `MOONSHOT_API_KEY` | The API key for Moonshot AI | Optional |
| `OLLAMA_API_HOST` | Changes the backend host for the Ollama vendor. See [config-local-ollama.md](config-local-ollama.md) | |
| `OPENPIPE_API_KEY` | The API key for OpenPipe | Optional |
| `OPENROUTER_API_KEY` | The API key for OpenRouter | Optional |
@@ -123,12 +143,11 @@ Enable the app to Talk, Draw, and Google things up.
| Variable | Description |
|:---------------------------|:------------------------------------------------------------------------------------------------------------------------|
| **Text-To-Speech** | [ElevenLabs](https://elevenlabs.io/) is a high quality speech synthesis service |
| **Text-To-Speech** | ElevenLabs, Inworld, OpenAI TTS, LocalAI, and browser Web Speech API are supported |
| `ELEVENLABS_API_KEY` | ElevenLabs API Key - used for calls, etc. |
| `ELEVENLABS_API_HOST` | Custom host for ElevenLabs |
| `ELEVENLABS_VOICE_ID` | Default voice ID for ElevenLabs |
| **Text-To-Image** | [Prodia](https://prodia.com/) is a reliable image generation service |
| `PRODIA_API_KEY` | Prodia API Key - used with '/imagine ...' |
| | *Note: OpenAI TTS and LocalAI TTS reuse credentials from your configured LLM services (no separate env vars needed)* |
| **Google Custom Search** | [Google Programmable Search Engine](https://programmablesearchengine.google.com/about/) produces links to pages |
| `GOOGLE_CLOUD_API_KEY` | Google Cloud API Key, used with the '/react' command - [Link to GCP](https://console.cloud.google.com/apis/credentials) |
| `GOOGLE_CSE_ID` | Google Custom/Programmable Search Engine ID - [Link to PSE](https://programmablesearchengine.google.com/) |
@@ -142,10 +161,14 @@ Enable the app to Talk, Draw, and Google things up.
The value of these variables are passed to the frontend (Web UI) - make sure they do not contain secrets.
| Variable | Description |
|:----------------------------------|:-----------------------------------------------------------------------------------------|
| `NEXT_PUBLIC_GA4_MEASUREMENT_ID` | The measurement ID for Google Analytics 4. (see [deploy-analytics](deploy-analytics.md)) |
| `NEXT_PUBLIC_PLANTUML_SERVER_URL` | The URL of the PlantUML server, used for rendering UML diagrams. (code in RederCode.tsx) |
| Variable | Description |
|:----------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `NEXT_PUBLIC_DEBUG_BREAKS` | (optional, development) When set to 'true', enables automatic debugger breaks on DEV/error/critical logs in development builds |
| `NEXT_PUBLIC_MOTD` | Message of the Day - displays a dismissible banner at the top of the app (see [customizations](customizations.md) for the template variables). Example: 🔔 Welcome to our deployment! Version {{app_build_pkgver}} built on {{app_build_time}}. |
| `NEXT_PUBLIC_GA4_MEASUREMENT_ID` | (optional) The measurement ID for Google Analytics 4. (see [deploy-analytics](deploy-analytics.md)) |
| `NEXT_PUBLIC_GOOGLE_DRIVE_CLIENT_ID` | (optional) Google OAuth Client ID for Drive Picker. Can reuse `AUTH_GOOGLE_ID`. See [Google Drive](config-feature-google-drive.md) |
| `NEXT_PUBLIC_PLANTUML_SERVER_URL` | The URL of the PlantUML server, used for rendering UML diagrams. Allows using custom local servers. |
| `NEXT_PUBLIC_POSTHOG_KEY` | (optional) Key for PostHog analytics. (see [deploy-analytics](deploy-analytics.md)) |
> Important: these variables must be set at build time, which is required by Next.js to pass them to the frontend.
> This is in contrast to the backend variables, which can be set when starting the local server/container.
+4
View File
@@ -1,3 +1,7 @@
---
unlisted: true
---
# Big-AGI Advanced Tips & Tricks
> 🚨 This file is not meant for publication, and it's just been created as a handbook with tips
+126
View File
@@ -0,0 +1,126 @@
# Big-AGI Data Ownership Guide
Big-AGI is a **client-first** web application, which means it prioritizes speed and data ownership compared to cloud apps.
Your *API keys*, *chat history*, and *settings* live in your
browser's [local storage](https://developer.mozilla.org/en-US/docs/Web/API/Window/localStorage), not
on cloud servers.
You can use Big-AGI in two ways:
1. Run it yourself (open-source)
2. Use big-agi.com (hosted service)
This guide explains how the open-source version handles your data. You can verify everything in [the source code](https://github.com/enricoros/big-agi).
## Client-Side Storage
Within Big-AGI almost all chat/keys data is handled client-side in your browser using two
standard browser storage mechanisms:
- **Local Storage**: API keys, settings, and configurations ([learn more](https://developer.mozilla.org/en-US/docs/Web/API/Window/localStorage))
- **IndexedDB**: Chat history and larger files ([learn more](https://developer.mozilla.org/en-US/docs/Web/API/IndexedDB_API))
The Big-AGI backend mainly passes requests to AI services (OpenAI, Anthropic, etc.). It doesn't store your data, except for the chat-sharing function if used.
You can see your data in your browser's local storage and IndexedDB - try it yourself:
1. In Chrome: Open DevTools (press F12 on Windows, ⌘ + ⌥ + I on Mac)
2. Click 'Application' > 'Local Storage'
3. See your settings and API keys
![Browser local storage showing API keys and chat data](pixels/data_ownership_local_storage.png)
### Sync for Authenticated Users
Users with accounts on big-agi.com who opt into Sync (a Pro feature) have their entity data - such as conversations and personas - replicated to the server for multi-device access.
Server-side data is isolated per-user using Row Level Security (RLS), ensuring that no other user can access your synced data.
Sync is entirely optional; without it, all data remains local to your browser.
### What This Means For You
Storing data in your browser means:
- Your data stays on **one device/browser only**
- Clearing browser data **erases your chats** - make backups
- Anyone using your browser can see your chats and keys
- Running your own server needs technical skills
### Local Device Identifier
Big-AGI generates a _device identifier_ that combines timestamp and random components, stored only on your device. This identifier:
- Is used only for the **optional sync functionality** between your devices
- Helps maintain data consistency when using Big-AGI across multiple devices
- Remains completely local unless you explicitly enable sync
- Is not used for tracking, analytics, or telemetry
- Can be deleted anytime by clearing local storage
- Is fully transparent - see the implementation in `src/common/stores/store-client.ts`
## How Data Flows
AI interactions in Big-AGI, such as chats, AI titles, text to speech, browsing, flow through three components:
1. **Browser** (client/installed App) - Stores your keys & data locally
2. **Backend** (routing server) - Passes requests to AI services
3. **AI Services** - Where the actual AI processing happens
### Self-Deployed Version: Your Infrastructure
You run the server. Your data only leaves when making AI requests.
The keys and chats are under your control and pass through your code, and are sent to
the upstream AI services on a per-request basis.
![data_ownership_local.png](pixels/data_ownership_deployed.png)
### Web Version: Using big-agi.com
Your data passes through the hosted Big-AGI edge network to reach AI services. The keys
and chats pass through Big-AGI's edge network to reach the AI services on a per-request basis,
and then are send to the upstream AI services.
![data_ownership_hosted.png](pixels/data_ownership_hosted.png)
### Direct Connection (Browser → AI Service)
Most AI services offer a **Direct Connection** toggle (under a service's Advanced settings). When enabled, the browser calls the AI provider's API directly, skipping the Big-AGI server entirely.
Benefits:
- **No 4.5 MB upload limit** - the Vercel body-size cap does not apply, so larger attachments and long prompts go through.
- **No 300-second timeout** - the Vercel function timeout does not apply, so long-running generations keep streaming.
- **More privacy** - connection metadata (IP, timestamp, edge region, Vercel telemetry) is not observable by the Big-AGI edge server.
Tradeoff:
- **Slightly more downlink bandwidth**: when traffic passes through the Big-AGI edge, repetitive streaming frames are compacted; direct streams arrive verbatim from the provider.
Availability requires both:
1. The API key is set in your browser (client-side), not via server environment variables. Server-key deployments cannot use Direct Connection because the browser has no credential to send.
2. The AI service allows CORS (browser-origin requests). Most major providers do; Big-AGI sets any extra headers they require.
Direct Connection is a net win on speed, limits, and privacy whenever the provider permits it.
## Security Best Practices
**Basic Security**:
- **Never share API keys**
- **Don't use shared computers**
- Use private browsing for one-off sessions
- Use trusted networks
- Back up your data
**When Running Your Own Server**:
- Use [environment variables](environment-variables.md) for API keys
- Run on trusted infrastructure
- Keep your installation updated
## TL;DR
Your API keys and chats stay in your browser. The server only passes requests to AI services.
Use big-agi.com for convenience, or [run it yourself](installation.md) for full control.
Need help? Join our [Discord](https://discord.gg/MkH4qj2Jp9) or open a [GitHub issue](https://github.com/enricoros/big-agi/issues).
+48
View File
@@ -0,0 +1,48 @@
# Frequently Asked Questions
Quick answers to common questions about Big-AGI. For detailed documentation, see our [Website Docs](https://big-agi.com/docs).
### Connectivity
<details open>
<summary><b>What is "Direct Connection" and should I enable it?</b></summary>
Direct Connection lets the browser call the AI provider's API directly, skipping the Big-AGI edge server. It appears as a toggle in each AI service's Advanced settings when your API key is set client-side.
**When available, it is a net win**: faster, fewer restrictions, more privacy.
- **No 4.5 MB upload limit** (Vercel body-size cap does not apply).
- **No 300-second timeout** (Vercel function timeout does not apply; call length is bound only by the AI service).
- **More privacy** - connection metadata (IP, timestamp, edge region, Vercel telemetry) is not observable by the Big-AGI edge server.
- **Slightly more downlink bandwidth** - when passing through the edge, Big-AGI sheds repetitive streaming frames; direct streams arrive verbatim.
**When it is unavailable**:
1. **Server-side keys** - if the deployment stores API keys in server environment variables, the browser has no credential to send directly.
2. **Provider does not allow CORS** - browsers cannot call APIs that block cross-origin requests. Most major providers permit it; Big-AGI sets any required headers.
</details>
### Versions
<details open>
<summary><b>How do I check my Big-AGI version?</b></summary>
You can see the version in the _News_ section of the app, as per the image below.
![Version location in Big-AGI](https://github.com/user-attachments/assets/cd295094-0114-420f-a5b9-0d762e59b506)
</details>
<details open>
<summary><b>How do I verify my Vercel deployment version?</b></summary>
You can go in the **deployments** section of your Vercel project, and at a quick glance see
what is the latest deployment status, time, and link to the source code.
![Vercel deployments view](https://github.com/user-attachments/assets/664b8c3d-496e-4595-ad5e-898bdb82507c)
Each deployment links directly to its source code commit.
</details>
---
Missing something? [Open an issue](https://github.com/enricoros/big-agi/issues/new) or [join our Discord](https://discord.gg/MkH4qj2Jp9).
+4 -10
View File
@@ -7,7 +7,7 @@ process for your own instance of big-AGI and related products.
**Try big-AGI** - You don't need to install anything if you want to play with big-AGI
and have your API keys to various model services. You can access our free instance on [big-AGI.com](https://big-agi.com).
The free instance runs the latest `main-stable` branch from this repository.
The free instance runs the latest `main` branch from this repository.
## 🧩 Build-your-own
@@ -72,9 +72,8 @@ Create your GitHub fork, create a Vercel project over that fork, and deploy it.
### Deploy on Cloudflare
Deploy on Cloudflare's global network by installing big-AGI on
Cloudflare Pages. Check out the [Cloudflare Installation Guide](deploy-cloudflare.md)
for step-by-step instructions.
> Note: Cloudflare Pages deployment has limitations due to Edge Runtime constraints.
> See the [Cloudflare guide](deploy-cloudflare.md) for details and known issues.
### Docker Deployments
@@ -136,11 +135,6 @@ Deploy big-AGI on a Kubernetes cluster for enhanced scalability and management.
For more detailed instructions on Kubernetes deployment, including updating and troubleshooting, refer to our [Kubernetes Deployment Guide](deploy-k8s.md).
### Midori AI Subsystem for Docker Deployment
Follow the instructions found on [Midori AI Subsystem Site](https://io.midori-ai.xyz/subsystem/manager/)
for your host OS. After completing the setup process, install the Big-AGI docker backend to the Midori AI Subsystem.
## Enterprise-Grade Installation
For businesses seeking a fully-managed, scalable solution, consider our managed installations.
@@ -151,6 +145,6 @@ Enjoy all the features of big-AGI without the hassle of infrastructure managemen
Join our vibrant community of developers, researchers, and AI enthusiasts. Share your projects, get help, and collaborate with others.
- [Discord Community](https://discord.gg/MkH4qj2Jp9)
- [Twitter](https://twitter.com/yourusername)
- [X (Twitter)](https://x.com/enricoros)
For any questions or inquiries, please don't hesitate to [reach out to our team](mailto:hello@big-agi.com).
+3 -3
View File
@@ -16,6 +16,8 @@ stringData:
OPENAI_API_KEY: ""
OPENAI_API_HOST: ""
OPENAI_API_ORG_ID: ""
ALIBABA_API_HOST: ""
ALIBABA_API_KEY: ""
AZURE_OPENAI_API_ENDPOINT: ""
AZURE_OPENAI_API_KEY: ""
ANTHROPIC_API_KEY: ""
@@ -26,6 +28,7 @@ stringData:
LOCALAI_API_HOST: ""
LOCALAI_API_KEY: ""
MISTRAL_API_KEY: ""
MOONSHOT_API_KEY: ""
OLLAMA_API_HOST: ""
OPENPIPE_API_KEY: ""
OPENROUTER_API_KEY: ""
@@ -44,6 +47,3 @@ stringData:
ELEVENLABS_API_KEY: ""
ELEVENLABS_API_HOST: ""
ELEVENLABS_VOICE_ID: ""
# Text-To-Image: Prodia
PRODIA_API_KEY: ""
Binary file not shown.

After

Width:  |  Height:  |  Size: 55 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 62 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 234 KiB

+4
View File
@@ -1,3 +1,7 @@
---
unlisted: true
---
# ReAct: question answering with Reasoning and Actions
## What is ReAct?
+23
View File
@@ -0,0 +1,23 @@
import { defineConfig } from "eslint/config";
import path from "node:path";
import { fileURLToPath } from "node:url";
import js from "@eslint/js";
import { FlatCompat } from "@eslint/eslintrc";
const __filename = fileURLToPath(import.meta.url);
const __dirname = path.dirname(__filename);
const compat = new FlatCompat({
baseDirectory: __dirname,
recommendedConfig: js.configs.recommended,
allConfig: js.configs.all
});
export default defineConfig([{
extends: compat.extends("next/core-web-vitals"),
rules: {
//
"react-hooks/exhaustive-deps": ["warn", {
additionalHooks: "(useMemoShallowStable)",
}],
},
}]);
+39
View File
@@ -0,0 +1,39 @@
## Knowledge Base
Architecture and system documentation is available in the `/kb/` knowledge base, for use by AI agents and developers.
**Structure:**
- `/kb/KB.md` - Already in context: this text
- `/kb/vision-inlined.md` - Already in context (next section): long-term vision and north stars
- `/kb/modules/` - Core business logic (e.g. AIX)
- `/kb/systems/` - Infrastructure (routing, startup)
### Modules Documentation
#### AIX - AI Communication Framework
- **[AIX.md](modules/AIX.md)** - AIX streaming architecture documentation
- **[AIX-callers-analysis.md](modules/AIX-callers-analysis.md)** - Analysis of AIX entry points, call chains, common and different rendering, error handling, etc.
#### CSF - Client-Side Fetch
- **[CSF.md](systems/client-side-fetch.md)** - Direct browser-to-API communication for LLM requests
### Systems Documentation
#### Core Platform Systems
- **[app-routing.md](systems/app-routing.md)** - Next.js routing, provider stack, and display state hierarchy
- **[LLM-parameters-system.md](systems/LLM-parameters-system.md)** - Language model parameter flow across the system
- **[LLM-vendor-integration.md](modules/LLM-vendor-integration.md)** - Adding new LLM providers
### KB Guidelines
#### Writing Style
- **Direct and factual** - No marketing language
- **Present tense** - "AIX handles streaming" not "AIX will handle"
- **Active voice** - "The system processes" not "Processing is done by"
- **Concrete examples** - Show actual code/config when helpful, briefly
#### Maintenance
- Remove outdated knowledge base information when detected
- Keep cross-references current when files move
+145
View File
@@ -0,0 +1,145 @@
# AIX Chat Generation Calls Analysis
This document analyzes all AIX function callers and their patterns for message removal, placeholder handling, and error management.
## AIX Function Architecture
### Three-Tier Call Hierarchy
**Core AIX Functions** (Direct tRPC API callers):
- `aixChatGenerateContent_DMessage_FromConversation` - 9 callers (conversation streaming)
- `aixChatGenerateContent_DMessage_orThrow` - 6 callers (direct request/response)
- `aixChatGenerateText_Simple` - 12 callers (text-only utilities)
**Utility Layer** (Hooks & Functions):
- Conversation management, persona processing, content generation utilities
**UI Layer** (React Components):
- User-facing interfaces with rich error states and fallback mechanisms
## Core Function Callers Analysis
### Conversation-Based Callers (`_FromConversation`)
| **Caller** | **Context** | **Message Removal** | **Placeholder** | **Error Handling** |
|------------|-------------|-------------------|----------------|-------------------|
| **Chat Persona** | `'conversation'` | `messageWasInterruptedAtStart()``removeMessage()` | None | Error fragments |
| **XE Chat Generate** | `'conversation'` | `messageWasInterruptedAtStart()``removeMessage()` | `'...'` placeholder | Error fragments via messageEditor |
| **Beam Scatter** | `'beam-scatter'` | `messageWasInterruptedAtStart()` → empty message | `SCATTER_PLACEHOLDER` | Ray status update |
| **Beam Gather** | `'beam-gather'` | `messageWasInterruptedAtStart()` → clear fragments | `GATHER_PLACEHOLDER` | Re-throw errors |
| **Beam Follow-up** | `'beam-followup'` | `messageWasInterruptedAtStart()` → remove message | `FOLLOWUP_PLACEHOLDER` | Status updates |
| **ScratchChat** | `'scratch-chat'` | `aborted && !fragments` → array removal | `SCRATCH_CHAT_PLACEHOLDER` | Error fragments |
| **Telephone** | `'call'` | None | None | Basic handling |
| **ReAct Agent** | `'chat-react-turn'` | None | None | Append errors |
| **Variform** | `'_DEV_'` | None | None | Throw errors |
### Direct Request Callers (`aixChatGenerateContent_DMessage`)
| **Caller** | **Context** | **Message Removal** | **Error Handling** |
|------------|-------------|-------------------|-------------------|
| **Auto Follow-ups** | `'chat-followup-*'` | `fragmentDelete()` on failure | `fragmentReplace()` with error |
| **Gen CR Diffs** | `'aifn-gen-cr-diffs'` | None | State-based handling |
| **Code Fixup** | `'fixup-code'` | None | Throw errors |
| **Attachment Prompts** | `'chat-attachment-prompts'` | None | Throw errors |
### Text-Only Utilities (`aixChatGenerateText_Simple`)
| **Utility** | **Purpose** | **Error Strategy** | **Called By** |
|-------------|-------------|-------------------|---------------|
| **conversationTitle** | Auto-generate chat titles | Try/catch with fallback | UI components |
| **conversationSummary** | Generate summaries | Try/catch with fallback | Chat drawer |
| **useStreamChatText** | Generic text streaming | Error state management | FlattenerModal |
| **useLLMChain** | Multi-step processing | Step-by-step handling | Persona creation |
| **imaginePromptFromText** | Text → image prompts | Simple propagation | Image generation |
| **aifnBeamGenerateBriefing** | Beam summaries | Null return on error | Beam completion |
| **useAifnPersonaGenIdentity** | Extract persona identity | Query error handling | Persona flows |
| **DiagramsModal** | Generate diagrams | Component error state | Manual generation |
## Message Removal Patterns
### 1. Complete Message Removal
- **Chat Persona**: `messageWasInterruptedAtStart()``messageEditor.removeMessage()`
- **ScratchChat**: `outcome === 'aborted' && !fragments?.length` → array removal
- **Trigger**: Message aborted before any content generated
### 2. Fragment-Level Management
- **Beam Gather**: Clear fragments array but keep message structure
- **Auto Follow-ups**: Delete specific placeholder fragments on failure
- **Purpose**: Maintain message structure while removing failed content
### 3. Empty Message Replacement
- **Beam Scatter**: Replace with `createDMessageEmpty()` but preserve ray structure
- **Purpose**: Keep UI structure intact while indicating failure
### 4. No Removal Strategy
- **Text-only functions**: Use fallback values, error states, or null returns
- **Simple callers**: Propagate errors upstream for handling
## Error Handling by Layer
### UI Layer (Components)
- **Pattern**: Rich error states with user-facing messages
- **Examples**: DiagramsModal, FlattenerModal
- **Features**: Retry mechanisms, fallback UI, loading states
### Utility Layer (Hooks/Functions)
- **Pattern**: Graceful degradation with fallbacks
- **Examples**: conversationTitle, conversationSummary
- **Features**: Silent failures, default values, try/catch blocks
### Core Layer (Direct API)
- **Pattern**: Minimal handling, error propagation
- **Examples**: Code Fixup, Attachment Prompts
- **Features**: Assumes upstream error handling
## Key Implementation Details
### Message Removal Detection
```typescript
// Core detection logic
function messageWasInterruptedAtStart(message: Pick<DMessage, 'generator' | 'fragments'>): boolean {
return message.generator?.tokenStopReason === 'client-abort' && message.fragments.length === 0;
}
```
### Placeholder Management
- **Initialization**: `createPlaceholderVoidFragment(placeholderText)`
- **Replacement**: During streaming updates or on completion
- **Cleanup**: Delete on error to avoid stale content
### Context Patterns
- **Production**: `'conversation'`, `'beam-scatter'`, `'scratch-chat'`
- **Features**: `'chat-followup-*'`, `'fixup-code'`, `'ai-diagram'`
- **Development**: `'_DEV_'`
## Best Practices
### Message Removal
- Use `messageWasInterruptedAtStart()` for consistent detection
- Only remove messages with no content that were client-aborted
- Consider UI context when choosing removal vs. clearing strategy
### Error Handling
- **Fragment-level**: Use `messageEditor.fragmentReplace()` with error fragments
- **Message-level**: Use `messageEditor.removeMessage()` or array removal
- **Status-level**: Update component state for UI feedback
### Placeholder Management
- Initialize with descriptive placeholders using `createPlaceholderVoidFragment()`
- Replace during streaming updates
- Clean up on error to prevent stale content
## Architectural Insights
1. **Layered Error Handling**: Sophistication increases closer to UI
2. **Context Specialization**: Different contexts for different use cases
3. **Streaming vs Non-Streaming**: Conversation functions stream, utilities typically don't
4. **Message vs Fragment Management**: Different strategies for different UI needs
The most sophisticated handling is in **Beam modules** and **Chat Persona** with comprehensive removal logic, while simpler callers rely on upstream error handling.
## Code References
- **Core function**: `src/modules/aix/client/aix.client.ts:aixChatGenerateContent_DMessage_FromConversation`
- **Removal check**: `src/common/stores/chat/chat.message.ts:388:messageWasInterruptedAtStart()`
- **Placeholder creation**: `src/common/stores/chat/chat.fragments.ts:createPlaceholderVoidFragment()`
+190
View File
@@ -0,0 +1,190 @@
# AIX
AIX is a client/server library for integrating advanced AI capabilities into web applications.
## Overview
AIX provides real-time, type-safe communication between a Typescript application and AI providers.
Built with tRPC, it manages the lifecycle of AI-generated content from request to rendering, supporting both streaming and non-streaming AI providers.
## Features
- Content Generation
- Multi-Modal streaming/non-streaming
- Throttled batching and error handling
- Server-side timeout/retry
- Function Calling and Code Execution
- Complex AI Workflows (future)
- Embeddings / Information Retrieval / Image Manipulation (future)
## AIX Providers support
| Service | Chat | Function Calling | Multi-Modal Input | Cont. (1) | Streaming | Idiosyncratic |
|------------|------------|------------------|-------------------|-----------|-----------|---------------|
| Alibaba | ✅ | ✅ | | ✅ | Yes + 📦 | |
| Anthropic | ✅ | ✅ + Parallel | Img: ✅ | ✅ | Yes + 📦 | |
| Azure | ✅ | ✅ | | ✅ | Yes + 📦 | |
| Deepseek | ✅ | ❌ (rejected) | | ✅ | Yes + 📦 | |
| Gemini | ✅ | ✅ + Parallel | Img: ✅ | ✅ | Yes + 📦 | Code ex.: ✅ |
| Groq | ✅ | ✅ + Parallel | | ✅ | Yes + 📦 | |
| LM Studio | ✅ | ❌ (not working) | | ❌ | Yes + 📦 | |
| Local AI | ✅ | ✅ | | ❌ | Yes + 📦 | |
| Mistral | ✅ | ✅ | | ✅ | Yes + 📦 | |
| OpenAI | ✅ | ✅ + Parallel | Img: ✅ | ✅ | Yes + 📦 | |
| OpenPipe | ✅ | ✅ | Img: ✅ | ✅ | Yes + 📦 | |
| OpenRouter | ✅ | ❌ (inconsistent) | | ✅ | Yes + 📦 | |
| Perplexity | ✅ | ❌ (rejected) | | ✅ | Yes + 📦 | |
| TogetherAI | ✅ | ✅ | | ✅ | Yes + 📦 | |
| xAI | | | | | | |
| Z.ai | ✅ | ✅ | Img: ✅ | ✅ | Yes + 📦 | Thinking mode |
| Ollama (2) | ❌ (broken) | ? | | | | |
Notes:
- 1: Continuation marks: a. sends reason=max-tokens (streaming/non-streaming), b. TBA
- 2: Ollama has not been ported to AIX yet due to the custom APIs.
## 1. System Architecture
The subsystem comprises three main components:
1. **Client (e.g. Next.js Frontend)**
- Initiates requests
- Renders AI-generated content in real-time
- Reconstructs streamed data
2. **Server (e.g. Next.js Backend)**
- Acts as an intermediary between client and AI providers
- Handles request preparation, dispatching, and response processing
- Streams responses back to the client
3. **Upstream AI Providers**
- Generate AI content based on requests
### ChatGenerate workflow:
1. Request Initialization: AIX Client prepares and sends request (systemInstruction, messages=AixWire_Parts[], etc.) to AIX Server
2. Dispatch Preparation: AIX Server prepares for upstream communication
3. AI Provider Interaction: AIX Server communicates with AI Provider (streaming or non-streaming)
4. Data Decoding, Transformation and Transmission: AIX Server sends AixWire_Particles to AIX Client
5. Client-side Processing: Client's ContentReassembler processes AixWire_Particles into a list (likely a single) of multi-fragment (DMessageContentFragment[]) messages
6. Completion: AIX Server sends 'done' control message, AIX Client finalizes data update
7. Error Handling: AIX Server sends specific error messages when necessary
## 2. Files and Folders
AIX is organized into the following files and folders:
1. Client-Side (`/client/`):
- `aix.client.ts`: Main client-side entry point for AIX operations.
- `aix.client.chatGenerateRequest.ts`: Handles conversion of chat messages to AIX-compatible format (AixWire_Content, AixWire_Parts, etc.).
2. Server-Side (`/server/`):
- API (`/server/api/`) - Client to Server communication:
- `aix.router.ts`: Defines the tRPC router for AIX operations.
- `aix.wiretypes.ts`: Contains Zod schemas for types and calls incoming from the client (AixWire_Parts, AixWire_Content, AixWire_Tooling, AixWire_API, ...), and outgoing (AixWire_Particles)
- Dispatch (`/server/dispatch/`) - Server to AI Provider communication:
- `/server/dispatch/chatGenerate/`: Content Generation with chat-style inputs:
- `./adapters/`: Adapters for creating API requests for different AI protocols (Anthropic, Bedrock, Gemini, OpenAI Chat Completions, OpenAI Responses, xAI Responses).
- `./parsers/`: Parsers for parsing streaming/non-streaming responses from different AI protocols (Anthropic, Bedrock Converse, Gemini, OpenAI, OpenAI Responses).
- `chatGenerate.dispatch.ts`: Creates a pipeline to execute Chat Generation to a specific provider.
- `ChatGenerateTransmitter.ts`: Used to serialize and transmit AixWire_Particles to the client.
- `/server/dispatch/wiretypes/`: AI provider Wire Types:
- Type definitions for different AI providers/protocols (Anthropic, Bedrock Converse, Gemini, OpenAI, xAI).
- `stream.demuxers.ts`: Handles demuxing of different stream formats.
## 3. Architecture Diagram
```mermaid
sequenceDiagram
participant AIX Client
participant AIX Server
participant PartTransmitter
participant AI Provider
AIX Client ->> AIX Client: Initialize ContentReassembler
AIX Client ->> AIX Client: Convert DMessage*Part to AixWire_Parts
AIX Client ->> AIX Server: Send messages (arrays of AixWire_Parts)
AIX Server ->> AIX Server: Prepare Dispatch (Upstream request, demux, parsing)
alt Dispatch Preparation Error
AIX Server ->> AIX Client: Send `dispatch-prepare` error message
else Dispatch Fetch
AIX Server ->> AI Provider: Send AI-provider specific stream/non-stream request
AIX Server ->> AIX Client: Send 'start' control message
AIX Server ->> PartTransmitter: Initialize part particle serialization
alt Streaming AI Provider
loop Until stream end or error
AI Provider ->> AIX Server: Stream response chunk
AIX Server ->> AIX Server: Demux chunk into DispatchEvents
loop For each AI-provider specific DispatchEvent
AIX Server ->> AIX Server: Parse DispatchEvent
AIX Server ->> PartTransmitter: (Parser) Calls serialization functions
PartTransmitter ->> PartTransmitter: Generate and throttle AixWire_PartParticles
PartTransmitter -->> AIX Server: Yield AixWire_PartParticle
end
AIX Server ->> AIX Client: Send accumulated AixWire_PartParticles
end
AIX Server ->> PartTransmitter: Request any remaining particles
PartTransmitter -->> AIX Server: Yield any final AixWire_PartParticles
AIX Server ->> AIX Client: Send final AixWire_PartParticles (if any)
else Non-Streaming AI Provider
AI Provider ->> AIX Server: Send AI-provider specific complete response
alt AI-provider specific full-response parser
AIX Server ->> AIX Server: Parse full response
AIX Server ->> PartTransmitter: Call particle serialization functions
PartTransmitter ->> PartTransmitter: Generate AixWire_PartParticle
PartTransmitter -->> AIX Server: Yield ALL AixWire_PartParticle
end
AIX Server ->> AIX Client: Send all AixWire_PartParticles
end
AIX Server ->> AIX Client: Send 'done' control message
loop For each received batch of particles
AIX Client ->> AIX Client: ContentReassembler processes particles into DMessage*Part
alt DMessageTextPart
AIX Client ->> AIX Client: Update UI with text content
else DMessageImageRefPart
AIX Client ->> AIX Client: Load and display image
else DMessageToolInvocationPart
AIX Client ->> AIX Client: Process tool invocation (dev only)
else DMessageToolResponsePart
AIX Client ->> AIX Client: Process tool response (dev only)
else DMessageErrorPart
AIX Client ->> AIX Client: Display error message
else DMessageDocPart
AIX Client ->> AIX Client: Process and display document
else DVoidPlaceholderPart
AIX Client ->> AIX Client: Handle placeholder (non-submitted)
end
end
AIX Client ->> AIX Client: Finalize data update
end
alt Error Handling
AIX Server ->> AIX Client: Send 'error' specific control messages
end
note over AIX Server, AI Provider: Server-side Timeout/Retry mechanism
loop Retry on timeout (server-side)
AIX Server ->> AI Provider: Retry request
end
note over AIX Client: Client-side Timeout mechanism
AIX Client ->> AIX Client: Timeout if no response received within set time
```
---
### 2025-03-14 Update
AIX is used in production in Big-AGI and is stable and performant.
The code is tightly coupled with the tRPC framework and the rest of our codebase,
so it is not recommended to use it outside of our ecosystem.
For a great Typescript alternative we recommend the Vercel AI SDK.
+126
View File
@@ -0,0 +1,126 @@
# LLM Vendor Integration Guide
How to add support for new LLM providers in Big-AGI. There are two integration paths, and
the dynamic backend path is strongly preferred for new vendors.
## Integration Paths
### Path 1: Dynamic Backend (preferred)
For any provider with an **OpenAI-compatible API** (which is nearly all new providers).
**Surface area**: 1-2 files, no UI changes, no registry changes.
A dynamic backend provides:
- Hostname-based auto-detection when the user adds the provider's API URL
- Automatic model list parsing with vendor-specific metadata (pricing, context windows, capabilities)
- Zero UI code - uses the existing "Custom OpenAI-compatible" service setup
**Files touched**:
- `src/modules/llms/server/openai/models/{vendor}.models.ts` (required) - model definitions + hostname heuristic
- `src/modules/llms/server/openai/wiretypes/{vendor}.wiretypes.ts` (optional) - Zod schemas for vendor-specific wire format
- `src/modules/llms/server/listModels.dispatch.ts` - add heuristic to the detection chain (2 lines)
**What the model file must export**:
```typescript
// 1. Hostname heuristic - returns true when the user's API URL matches this vendor
export function vendorHeuristic(hostname: string): boolean {
return hostname.includes('.vendor-domain.com');
}
// 2. Model converter - transforms vendor's /v1/models response to ModelDescriptionSchema[]
export function vendorModelsToModelDescriptions(wireModels: unknown): ModelDescriptionSchema[] {
// Parse wire format, map to ModelDescriptionSchema with:
// - id, label, description
// - contextWindow, maxCompletionTokens
// - interfaces (Chat, Vision, Fn, Reasoning, etc.)
// - chatPrice (input/output per token)
// - parameterSpecs (temperature, etc.)
}
```
**Existing examples**: `novita.models.ts`, `chutesai.models.ts`, `fireworksai.models.ts`
MUST also provide the updated vendor icon like other icons in `src/common/components/icons/vendors/`.
Make sure all the information is available if in the future we want to promote those to full registered vendors.
### Path 2: Registered Vendor (heavyweight, discouraged for new providers)
Full first-class integration with dedicated UI, own dialect, and registry entry. Reserved for
providers with **non-OpenAI protocols** (Anthropic, Gemini, Ollama) or providers with enough
user demand to warrant a dedicated setup flow.
**Surface area**: 5+ files across 3 directories.
**Files touched**:
- `src/modules/llms/vendors/{vendor}/{vendor}.vendor.ts` - IModelVendor implementation
- `src/modules/llms/vendors/{vendor}/{VendorName}ServiceSetup.tsx` - React UI setup component
- `src/modules/llms/vendors/vendors.registry.ts` - registry entry + ModelVendorId union
- `src/modules/llms/server/openai/models/{vendor}.models.ts` - model definitions
- `src/modules/llms/server/listModels.dispatch.ts` - dispatch case
- Possibly server protocol adapter if not OpenAI-compatible
- Possibly more files, e.g. wires, etc.
- See existing providers and commits that added them for full scope
**When to use this path**: Only when the provider has a meaningfully different API protocol
(not OpenAI-compatible), or when there is significant user demand AND the provider offers
unique capabilities that benefit from dedicated UI (e.g., Ollama's local model management).
When using this path, please add links to upstream documentation. Make sure all constants
are correctly handled everywhere, especially for provider-based switches.
## Decision Criteria
| Question | Dynamic | Registered |
|----------|---------|------------|
| OpenAI-compatible API? | Yes - use dynamic | Only if not OAI-compatible |
| Needs custom auth UI? | No - uses generic fields | Yes - custom setup form |
| Unique protocol? | No | Yes (Anthropic, Gemini, Ollama) |
| User demand level | Any | High + sustained |
| Maintenance burden | Minimal | Significant (5+ files) |
## For External Contributors / Vendor Requests
When vendors or community members request integration via GitHub issues:
1. **Point them to the dynamic backend path** - it's faster to implement, review, and maintain
2. **Requirements for a dynamic backend PR**:
- Model file with heuristic + converter exporting `ModelDescriptionSchema[]`
- Wire types if the vendor's `/v1/models` response has non-standard fields
- Vendor icon (SVG preferred) in `src/common/components/icons/vendors/`
- Two-line addition to the heuristic chain in `listModels.dispatch.ts`
3. **Do not accept**: New registered vendors for OpenAI-compatible providers. The maintenance
cost of a full vendor (UI component, registry entry, dispatch case) is not justified when
dynamic detection achieves the same result with a fraction of the code.
## Architecture Notes
### How Dynamic Detection Works
In `listModels.dispatch.ts`, the `case 'openai':` handler:
1. Fetches `/v1/models` from the user-provided API host
2. Runs the hostname through a chain of heuristics (in order)
3. First matching heuristic's converter is used to parse models
4. Falls back to stock OpenAI parsing if no heuristic matches
### Hostname Security
Hostname matching uses `llmsHostnameMatches()` from `openai.access.ts` which parses the
URL properly to prevent DNS spoofing. Always use `.includes()` on the parsed hostname,
never on the raw URL string.
### Key Types
- `ModelDescriptionSchema` (`llm.server.types.ts`) - output type for all model converters
- `DModelInterfaceV1` (`llms.types.ts`) - capability flags (Chat, Vision, Fn, Reasoning, etc.)
- `IModelVendor` (`vendors/IModelVendor.ts`) - interface for registered vendors only
- `ManualMappings` / `KnownModel` (`models.mappings.ts`) - server-side model patches
### File Locations
- Dynamic backends: `src/modules/llms/server/openai/models/`
- Wire types: `src/modules/llms/server/openai/wiretypes/`
- Dispatch: `src/modules/llms/server/listModels.dispatch.ts`
- Registered vendors: `src/modules/llms/vendors/*/`
- Vendor icons: `src/common/components/icons/vendors/`
- Type definitions: `src/modules/llms/server/llm.server.types.ts`
+120
View File
@@ -0,0 +1,120 @@
# LLM Parameters System
This document describes how parameters flow through Big-AGI's LLM parameters system, from definition to API invocation.
## System Overview
The LLM parameters system operates across five layers that transform parameters from global definitions to vendor-specific API calls. Each layer serves a specific purpose in the parameter resolution pipeline.
## Parameter Flow Architecture
### Layer 1: Parameter Registry
**File**: `src/common/stores/llms/llms.parameters.ts`
The `DModelParameterRegistry` defines all available parameters with their constraints and metadata. Each parameter includes type information, validation rules, and default behavior.
**Default Value System**: The registry supports multiple default mechanisms:
- `nullable` - Parameters that can be explicitly null to skip API transmission
- `initialValue` - Parameter's base default (e.g., `llmVndOaiRestoreMarkdown: true`)
### Layer 2: Model Specifications
**File**: `src/modules/llms/server/llm.server.types.ts`
Models declare which parameters they support through `parameterSpecs` arrays. Each spec can override registry defaults:
```typescript
parameterSpecs: [
{ paramId: 'llmVndAntThinkingBudget', initialValue: 1024 }, // Override default
{ paramId: 'llmVndGeminiThinkingBudget', rangeOverride: [0, 8192] }, // Custom range
]
```
**Parameter Visibility**: The `hidden` flag removes parameters from the UI while keeping them functional. Models can also mark parameters as `required`.
### Layer 3: Client Configuration
The system provides two UI configurators with different scopes:
#### Full Model Configuration Dialog
**File**: `src/modules/llms/models-modal/LLMParametersEditor.tsx`
Shows all non-hidden parameters from model's `parameterSpecs`. Used in the models modal for complete configuration.
#### ChatPanel Quick Controls
**File**: `src/apps/chat/components/layout-panel/ChatPanelModelParameters.tsx`
Shows only parameters that are:
- In model's `parameterSpecs`
- Listed in `_interestingParameters` array
- Not marked as `hidden`
**Value Resolution**: Both UIs use `getAllModelParameterValues()` to merge:
1. **Fallback values** - Implicit parameters get their `LLMImplicitParametersRuntimeFallback` values
2. **Initial values** - Model's `initialParameters` (populated during model creation)
3. **User values** - User's `userParameters` (highest priority)
### Layer 4: AIX Translation
**File**: `src/modules/aix/client/aix.client.ts`
The AIX client transforms DLLM parameters to wire protocol format. This layer handles parameter precedence rules and name transformations.
**Client Options**: The system supports parameter overrides through `llmOptionsOverride` and complete replacement via `llmUserParametersReplacement`.
### Layer 5: Vendor Adaptation
**Files**: `src/modules/aix/server/dispatch/chatGenerate/adapters/*.ts`
Server-side adapters translate AIX parameters to vendor APIs. Each vendor may interpret parameters differently:
- **OpenAI**: `vndEffort` -> `reasoning_effort`
- **Perplexity**: Reuses OpenAI parameter format
- **OpenAI Responses API**: Maps to structured reasoning config with additional logic
## Parameter Initialization Process
When a model is loaded:
1. **Model Creation**: `_createDLLMFromModelDescription()` creates the DLLM with empty `initialParameters`
2. **Initial Value Application**: `applyModelParameterSpecsInitialValues()` populates initial values from:
- Model spec `initialValue` (highest priority)
- Registry `initialValue` (fallback)
3. **Runtime Resolution**: `getAllModelParameterValues()` creates final parameter set:
- Required fallbacks (for missing required parameters)
- Initial parameters (model defaults)
- User parameters (user overrides)
## Special Parameter Behaviors
**Hidden Parameters**: Parameters like `llmRef` are marked `hidden: true` in the registry and never appear in the UI, but remain functional for system use.
**Nullable Parameters**: Parameters with `nullable` configuration can be explicitly set to `null` to prevent transmission to the API, distinct from being undefined.
**Range Overrides**: Models can override parameter ranges (e.g., different Gemini models support different thinking budget ranges).
**Parameter Interactions**: The UI implements business logic like disabling web search when reasoning effort is 'minimal'.
## Type Safety Mechanisms
The system maintains type safety through:
- `DModelParameterId` union from registry keys
- `DModelParameterValue<T>` conditional types for values
- `DModelParameterSpecAny` interfaces for specifications
- Runtime validation via Zod schemas at API boundaries
## Model Variant Pattern
Some vendors use model variants to enable features, for instance:
- **Anthropic**: Creates separate `idVariant: 'thinking'` entries forcing value of hidden parameters
- **Google/OpenAI**: Parameters directly on base models
## Migration and Compatibility
The architecture supports parameter evolution:
- **Precedence Rules**: Newer parameters take priority during AIX translation
- **Graceful Degradation**: Unknown parameters log warnings but don't break functionality
## Key Implementation Files
- **Registry**: `src/common/stores/llms/llms.parameters.ts`
- **Specifications**: `src/modules/llms/server/llm.server.types.ts`
- **UI Controls**: `src/modules/llms/models-modal/LLMParametersEditor.tsx`
- **AIX Translation**: `src/modules/aix/client/aix.client.ts`
- **Wire Types**: `src/modules/aix/server/api/aix.wiretypes.ts`
- **Vendor Adapters**: `src/modules/aix/server/dispatch/chatGenerate/adapters/*.ts`
+151
View File
@@ -0,0 +1,151 @@
# Big-AGI Routing & Display States
This document describes the routing architecture and display state hierarchy in Big-AGI, from top-level providers down to component-level states.
## Overview
Big-AGI uses Next.js Pages Router with a provider stack that determines what users see based on application state and configuration.
## Quick Reference: Route Configurations
| Route | Purpose | Key Features |
|-------|---------|--------------|
| `/` | Main chat app | Default application |
| `/call` | Voice interface | Voice-to-voice AI conversations |
| `/personas` | Persona management | Create and manage AI personas |
| ... | | |
## Decision Flow Diagram
The routing decisions follow a hierarchy from system-level provider configuration down to component-level states.
```mermaid
flowchart TD
Start([Navigate to Route]) --> Root[_app.tsx]
Root --> Theme[ProviderTheming]
Theme --> Error[ErrorBoundary]
Error --> Bootstrap[ProviderBootstrapLogic]
Bootstrap --> BootCheck{Bootstrap Checks}
BootCheck -->|News| News[↗️ /news]
BootCheck -->|Continue| Router{Router}
Router -->|/| Chat[Chat App]
Router -->|/personas,/call,/beam...| OtherApps[Other Apps]
Router -->|/news| NewsApp[News App]
Chat --> ChatStates{Chat States}
ChatStates -->|No Models| ZeroModels[🟡 Setup Models]
ChatStates -->|No Conv| ZeroConv[🟡 Select Chat]
ChatStates -->|No Msgs| PersonaGrid[Choose Persona]
ChatStates -->|Ready| Active[🟢 Active Chat]
Active --> Features[Features:<br/>• Chat Bar<br/>• Beam Mode<br/>• Attachments]
style ZeroModels fill:#fff4cc
style ZeroConv fill:#fff4cc
style Active fill:#ccffcc
style Chat fill:#f0f8ff
style OtherApps fill:#f0f8ff
style NewsApp fill:#f0f8ff
```
## Display State Hierarchy
```
_app.tsx (Root)
├── ProviderTheming ← Always Applied
├── ErrorBoundary ← Always Applied
├── ProviderBootstrapLogic ← Always Applied
│ ├── Tiktoken preload & Model auto-config
│ ├── Storage maintenance & cleanup
│ └── News Redirect (if conditions met)
└── Page Component
├── AppChat (/) → Default app
│ ├── CMLZeroModels → If no models configured
│ ├── CMLZeroConversation → If no conversation selected
│ └── PersonaGrid → If conversation empty
└── Other Apps → Personas, Call, Draw, News, Beam
```
## Provider Stack
| Provider | Purpose | Key Functions |
|----------|---------|---------------|
| **ProviderTheming** | UI theme management | Theme switching, CSS variables |
| **ErrorBoundary** | Error handling | Catches and displays errors gracefully |
| **ProviderBootstrapLogic** | App initialization | • Tiktoken preload<br>• Model auto-config<br>• Storage cleanup<br>• News redirect logic |
For detailed initialization sequence and provider functions, see [app-startup-sequence.md](app-startup-sequence.md), if present.
## Application Routes
### Primary Apps
- `/` → AppChat (default)
- `/call` → Voice call interface
- `/beam` → Multi-model reasoning
- `/draw` → Image generation
- `/personas` → Personas app
- `/news` → News/updates
### Zero States
#### Chat App Zero States
**CMLZeroModels**
- **Location**: `/src/apps/chat/components/messages-list/CMLZeroModels.tsx`
- **Triggered**: No LLM sources configured
- **Shows**: Welcome screen with "Setup Models" button
**CMLZeroConversation**
- **Location**: `/src/apps/chat/components/messages-list/CMLZeroConversation.tsx`
- **Triggered**: No conversation selected
- **Shows**: "Select/create conversation" prompt
**PersonaGrid**
- **App**: Chat (when conversation is empty)
- **Triggered**: Conversation exists but has no messages
- **Shows**: Persona selector interface
#### Feature-Specific Zero States
**Beam Tutorial**
- **Feature**: Beam (multi-model reasoning)
- **Component**: `ExplainerCarousel`
- **Triggered**: First-time Beam usage
- **Shows**: Interactive feature walkthrough
## Common Scenarios
### New User First Visit
1. Navigates to `/` → Provider stack loads
2. Bootstrap runs → No news redirect (first visit)
3. Chat loads → **CMLZeroModels** (no models configured)
4. User clicks "Setup Models" → Configuration flow
### Returning User with Saved State
1. Navigates to `/` → Provider stack loads
2. IndexedDB restores state → Previous conversation loaded
3. Chat loads → **Active chat interface** (bypasses all zero states)
4. All messages and context preserved from last session
### Shared Chat Viewer
1. Navigates to `/link/chat/[id]` → Full provider stack
2. Views read-only chat → May see "Import" option
3. If importing → Checks for duplicates, creates new local conversation
## Storage System
Big-AGI uses a local-first architecture:
- **Zustand** for reactive state management
- **IndexedDB** for persistent storage via Zustand persist middleware
- **Version-based migrations** for data structure upgrades
Key stores:
- `app-chats`: Conversations and messages (IndexedDB)
- `app-llms`: Model configurations (IndexedDB)
- `app-ui`: UI preferences (localStorage)
+29
View File
@@ -0,0 +1,29 @@
# CSF - Client-Side Fetch
Client-Side Fetch (CSF), surfaced to users as **"Direct Connection"**, enables direct browser-to-API communication, bypassing the server for LLM requests. When enabled, the browser makes requests directly to vendor APIs (e.g., `api.openai.com`, `api.groq.com`) instead of routing through the Next.js server. This reduces latency, decreases server load, and is particularly useful for local models where the browser can communicate directly with Ollama or LM Studio.
## User-facing tradeoffs (Direct Connection vs via-server)
Wins when Direct Connection is on:
- **No 4.5MB upload limit** (Vercel body-size cap does not apply to direct browser-to-API requests).
- **No 300s function timeout** (Vercel serverless/edge timeout does not apply; call duration is bound only by the AI service).
- **More privacy**: connection metadata (IP, timestamp, edge region, Vercel telemetry) is not observable by the Big-AGI edge server.
Costs:
- **Slightly more downlink bandwidth**: when traffic passes through the Big-AGI server, repetitive streaming frames are shed/compacted; direct streams arrive verbatim.
Availability requires both:
1. The API key is on the **client** (localStorage), not a server-side env var. Server-key deployments cannot use CSF because the browser has no credential to send.
2. The AI service **allows CORS** from browsers. Most major providers do; some require specific headers which Big-AGI sets.
Net: Direct Connection is a win on speed, limits, and privacy whenever the provider permits it. It is unavailable when keys are server-side or the provider blocks browser-origin requests.
## Implementation
CSF is implemented as an opt-in setting stored as `csf: boolean` in each vendor's service settings. The vendor interface exposes `csfAvailable?: (setup) => boolean` to determine if CSF can be enabled (typically checking if an API key or host is configured). The actual execution happens in `aix.client.direct-chatGenerate.ts` which dynamically imports when CSF is active, making direct fetch calls using the same wire protocols as the server.
All 20+ supported vendors (OpenAI, Anthropic, Gemini, Ollama, LocalAI, Deepseek, Groq, Mistral, xAI, OpenRouter, Perplexity, Together AI, Alibaba, Moonshot, OpenPipe, LM Studio, Z.ai, Azure, Bedrock) support CSF. Cloud vendors require CORS support from the API provider (all tested vendors return `access-control-allow-origin: *`). Local vendors (Ollama, LocalAI, LM Studio) require CORS to be enabled on the local server.
## UI
The CSF toggle appears in each vendor's setup panel under "Advanced" settings, labeled "Direct Connection". It becomes visible when the prerequisites are met (API key present for cloud vendors, host configured for local vendors). The setting is managed through `useModelServiceClientSideFetch` hook which provides `csfAvailable`, `csfActive`, `csfToggle`, and `csfReset` for UI consumption.
+3
View File
@@ -0,0 +1,3 @@
## Strategic Vision
If provided, the following influences the long-term vision, product and architectural goals/north stars for Big-AGI.
-85
View File
@@ -1,85 +0,0 @@
import { readFile } from 'node:fs/promises';
// Build information
process.env.NEXT_PUBLIC_BUILD_HASH = 'big-agi-2-dev';
process.env.NEXT_PUBLIC_BUILD_PKGVER = JSON.parse('' + await readFile(new URL('./package.json', import.meta.url))).version;
process.env.NEXT_PUBLIC_BUILD_TIMESTAMP = new Date().toISOString();
console.log(` 🧠 \x1b[1mbig-AGI\x1b[0m v${process.env.NEXT_PUBLIC_BUILD_PKGVER} (@${process.env.NEXT_PUBLIC_BUILD_HASH})`);
// Non-default build types
const buildType =
process.env.BIG_AGI_BUILD === 'standalone' ? 'standalone'
: process.env.BIG_AGI_BUILD === 'static' ? 'export'
: undefined;
buildType && console.log(` 🧠 big-AGI: building for ${buildType}...\n`);
/** @type {import('next').NextConfig} */
let nextConfig = {
reactStrictMode: true,
// [exports] https://nextjs.org/docs/advanced-features/static-html-export
...buildType && {
output: buildType,
distDir: 'dist',
// disable image optimization for exports
images: { unoptimized: true },
// Optional: Change links `/me` -> `/me/` and emit `/me.html` -> `/me/index.html`
// trailingSlash: true,
},
// [puppeteer] https://github.com/puppeteer/puppeteer/issues/11052
// NOTE: we may not be needing this anymore, as we use '@cloudflare/puppeteer'
serverExternalPackages: ['puppeteer-core'],
webpack: (config, { isServer }) => {
// @mui/joy: anything material gets redirected to Joy
config.resolve.alias['@mui/material'] = '@mui/joy';
// @dqbd/tiktoken: enable asynchronous WebAssembly
config.experiments = {
asyncWebAssembly: true,
layers: true,
};
// fix warnings for async functions in the browser (https://github.com/vercel/next.js/issues/64792)
if (!isServer) {
config.output.environment = { ...config.output.environment, asyncFunction: true };
}
// prevent too many small chunks (40kb min) on 'client' packs (not 'server' or 'edge-server')
// noinspection JSUnresolvedReference
if (typeof config.optimization.splitChunks === 'object' && config.optimization.splitChunks.minSize) {
// noinspection JSUnresolvedReference
config.optimization.splitChunks.minSize = 40 * 1024;
}
return config;
},
// Note: disabled to check whether the project becomes slower with this
// modularizeImports: {
// '@mui/icons-material': {
// transform: '@mui/icons-material/{{member}}',
// },
// },
// Uncomment the following leave console messages in production
// compiler: {
// removeConsole: false,
// },
};
// Validate environment variables, if set at build time. Will be actually read and used at runtime.
// This is the reason both this file and the servr/env.mjs files have this extension.
await import('./src/server/env.mjs');
// conditionally enable the nextjs bundle analyzer
if (process.env.ANALYZE_BUNDLE) {
const { default: withBundleAnalyzer } = await import('@next/bundle-analyzer');
nextConfig = withBundleAnalyzer({ openAnalyzer: true })(nextConfig);
}
export default nextConfig;
+160
View File
@@ -0,0 +1,160 @@
import type { NextConfig } from 'next';
import type { WebpackConfigContext } from 'next/dist/server/config-shared';
import { execSync } from 'node:child_process';
import { readFileSync } from 'node:fs';
// Build information: from CI, or git commit hash
let buildHash = process.env.NEXT_PUBLIC_BUILD_HASH || process.env.GITHUB_SHA || process.env.VERCEL_GIT_COMMIT_SHA; // Docker or custom, GitHub Actions, Vercel
try {
// fallback to local git commit hash
if (!buildHash)
buildHash = execSync('git rev-parse --short HEAD').toString().trim();
} catch {
// final fallback
buildHash = '2-dev';
}
// The following are used by/available to Release.buildInfo(...)
process.env.NEXT_PUBLIC_BUILD_HASH = (buildHash || '').slice(0, 10);
process.env.NEXT_PUBLIC_BUILD_PKGVER = JSON.parse('' + readFileSync(new URL('./package.json', import.meta.url))).version;
process.env.NEXT_PUBLIC_BUILD_TIMESTAMP = new Date().toISOString();
process.env.NEXT_PUBLIC_DEPLOYMENT_TYPE = process.env.NEXT_PUBLIC_DEPLOYMENT_TYPE || (process.env.VERCEL_ENV ? `vercel-${process.env.VERCEL_ENV}` : 'local'); // Docker or custom, Vercel
console.log(` 🧠 \x1b[1mbig-AGI\x1b[0m v${process.env.NEXT_PUBLIC_BUILD_PKGVER} (@${process.env.NEXT_PUBLIC_BUILD_HASH}${process.env.VERCEL_ENV ? `, \x1b[2mV:\x1b[0m${process.env.VERCEL_ENV}` : ''}, \x1b[2mN:\x1b[0m${process.env.NODE_ENV})`);
// Non-default build types
const buildType =
process.env.BIG_AGI_BUILD === 'standalone' ? 'standalone' as const
: process.env.BIG_AGI_BUILD === 'static' ? 'export' as const
: undefined;
buildType && console.log(` 🧠 big-AGI: building for ${buildType}...\n`);
/** @type {import('next').NextConfig} */
let nextConfig: NextConfig = {
reactStrictMode: !process.env.NO_STRICT_MODE, // default: enabled
// [exports] https://nextjs.org/docs/advanced-features/static-html-export
...(buildType && {
output: buildType,
distDir: 'dist',
// disable image optimization for exports
images: { unoptimized: true },
// Optional: Change links `/me` -> `/me/` and emit `/me.html` -> `/me/index.html`
// trailingSlash: true,
}),
// [puppeteer] https://github.com/puppeteer/puppeteer/issues/11052
// NOTE: we may not be needing this anymore, as we use '@cloudflare/puppeteer'
serverExternalPackages: ['puppeteer-core'],
webpack: (config: any, { isServer, webpack /*, dev, nextRuntime*/ }: WebpackConfigContext) => {
// @mui/joy: anything material gets redirected to Joy
config.resolve.alias['@mui/material'] = '@mui/joy';
// @dqbd/tiktoken: enable asynchronous WebAssembly
config.experiments = {
asyncWebAssembly: true,
layers: true,
};
// client-side bundling
if (!isServer) {
/**
* AIX client-side
* We replace certain server-only modules with client-side mocks, to reuse the exact same imports
* while avoiding importing server-only code which would break the build or break at runtime.
*/
const serverToClientMocks: ReadonlyArray<[RegExp, string]> = [
[/\/posthog\.server/, '/posthog.client-mock'],
[/\/env\.server/, '/env.client-mock'],
];
config.plugins = [
...config.plugins,
...serverToClientMocks.map(([pattern, replacement]) =>
new webpack.NormalModuleReplacementPlugin(pattern, (resource: any) => {
// console.log(' 🧠 [WEBPACK REPLACEMENT]:', resource.request, '->', resource.request.replace(pattern, replacement));
resource.request = resource.request.replace(pattern, replacement);
}),
),
];
// cosmetic: fix warnings for (absent!) top-level awaits in the browser (https://github.com/vercel/next.js/issues/64792)
config.output.environment = { ...config.output.environment, asyncFunction: true };
}
// prevent too many small chunks (40kb min) on 'client' packs (not 'server' or 'edge-server')
// noinspection JSUnresolvedReference
if (typeof config.optimization.splitChunks === 'object' && config.optimization.splitChunks.minSize) {
// noinspection JSUnresolvedReference
config.optimization.splitChunks.minSize = 40 * 1024;
}
return config;
},
// Optional Analytics > PostHog
skipTrailingSlashRedirect: true, // required to support PostHog trailing slash API requests
async rewrites() {
return [
{
source: '/a/ph/static/:path*',
destination: 'https://us-assets.i.posthog.com/static/:path*',
},
{
source: '/a/ph/:path*',
destination: 'https://us.i.posthog.com/:path*',
},
{
source: '/a/ph/decide',
destination: 'https://us.i.posthog.com/decide',
},
{
source: '/a/ph/flags',
destination: 'https://us.i.posthog.com/flags',
},
];
},
// Note: disabled to check whether the project becomes slower with this
// modularizeImports: {
// '@mui/icons-material': {
// transform: '@mui/icons-material/{{member}}',
// },
// },
// Uncomment the following leave console messages in production
// compiler: {
// removeConsole: false,
// },
};
// Validate environment variables at build time, if required. Server env vars will be actually read and used at runtime (cloud/edge).
import { env as validateEnv } from '~/server/env.server';
void validateEnv; // Triggers env validation - throws if required vars are missing
// PostHog error reporting with source maps for production builds
import { withPostHogConfig } from '@posthog/nextjs-config';
if (process.env.POSTHOG_API_KEY && process.env.POSTHOG_ENV_ID) {
console.log(' 🧠 \x1b[1mbig-AGI\x1b[0m: building with PostHog issue reporting and source maps...');
nextConfig = withPostHogConfig(nextConfig, {
personalApiKey: process.env.POSTHOG_API_KEY,
envId: process.env.POSTHOG_ENV_ID,
host: 'https://us.i.posthog.com', // backtrace upload host
logLevel: 'error', // lowered, too noisy
sourcemaps: {
enabled: process.env.NODE_ENV === 'production',
project: 'big-agi',
version: process.env.NEXT_PUBLIC_BUILD_HASH,
deleteAfterUpload: false, // false: leave them in the tree, which would also help debugging of open-source installs
},
});
}
// conditionally enable the nextjs bundle analyzer
import withBundleAnalyzer from '@next/bundle-analyzer';
if (process.env.ANALYZE_BUNDLE) {
nextConfig = withBundleAnalyzer({ openAnalyzer: true })(nextConfig) as NextConfig;
}
export default nextConfig;
+4565 -1593
View File
File diff suppressed because it is too large Load Diff
+63 -68
View File
@@ -1,8 +1,9 @@
{
"name": "big-agi",
"version": "1.91.0",
"version": "2.0.4",
"private": true,
"author": "Enrico Ros <enrico.ros@gmail.com>",
"author": "Enrico Ros <enrico@big-agi.com> (https://www.enricoros.com)",
"homepage": "https://big-agi.com",
"repository": "https://github.com/enricoros/big-agi",
"scripts": {
"dev": "next dev --turbopack",
@@ -11,10 +12,13 @@
"build": "next build",
"start": "next start",
"lint": "next lint",
"tsclint": "tsc --noEmit --pretty",
"postinstall": "prisma generate --no-hints",
"gen:icon-sprites": "node tools/develop/gen-icon-sprites/generate-llm-sprites.ts",
"db:push": "prisma db push",
"db:studio": "prisma studio",
"vercel:env:pull": "npx vercel env pull .env.development.local"
"vercel:env:pull": "npx vercel env pull .env.development.local",
"sharp:win32_x64": "npm install --os=win32 --cpu=x64 sharp"
},
"prisma": {
"schema": "src/server/prisma/schema.prisma"
@@ -27,84 +31,75 @@
"@emotion/cache": "^11.14.0",
"@emotion/react": "^11.14.0",
"@emotion/server": "^11.11.0",
"@emotion/styled": "^11.14.0",
"@mui/icons-material": "^5.16.14",
"@mui/joy": "^5.0.0-beta.51",
"@mui/material": "^5.16.14",
"@next/bundle-analyzer": "^15.1.4",
"@next/third-parties": "^15.1.4",
"@emotion/styled": "^11.14.1",
"@googleworkspace/drive-picker-react": "^0.2.0",
"@mui/icons-material": "^5.18.0",
"@mui/joy": "^5.0.0-beta.52",
"@next/bundle-analyzer": "~15.1.12",
"@prisma/client": "~5.22.0",
"@t3-oss/env-nextjs": "^0.11.1",
"@tanstack/react-query": "^5.63.0",
"@trpc/client": "11.0.0-rc.688",
"@trpc/next": "11.0.0-rc.688",
"@trpc/react-query": "11.0.0-rc.688",
"@trpc/server": "11.0.0-rc.688",
"@vercel/analytics": "^1.4.1",
"@vercel/speed-insights": "^1.1.0",
"browser-fs-access": "^0.35.0",
"cheerio": "^1.0.0",
"dexie": "^4.0.10",
"dexie-react-hooks": "^1.1.7",
"diff": "^7.0.0",
"eventsource-parser": "^3.0.0",
"idb-keyval": "^6.2.1",
"mammoth": "^1.9.0",
"nanoid": "^5.0.9",
"next": "^15.1.4",
"@tanstack/react-query": "5.90.21",
"@tanstack/react-virtual": "^3.13.22",
"@trpc/client": "11.5.1",
"@trpc/next": "11.5.1",
"@trpc/react-query": "11.5.1",
"@trpc/server": "11.5.1",
"@vercel/analytics": "^1.6.1",
"@vercel/speed-insights": "^1.3.1",
"aws4fetch": "^1.0.20",
"browser-fs-access": "^0.38.0",
"cheerio": "^1.1.2",
"csv-stringify": "^6.6.0",
"dexie": "~4.0.11",
"dexie-react-hooks": "~1.1.7",
"diff": "^8.0.3",
"eventemitter3": "^5.0.4",
"idb-keyval": "^6.2.2",
"mammoth": "^1.11.0",
"nanoid": "^5.1.6",
"next": "~15.1.12",
"nprogress": "^0.2.0",
"pdfjs-dist": "4.10.38",
"plantuml-encoder": "^1.4.0",
"prismjs": "^1.29.0",
"pdfjs-dist": "5.4.54",
"posthog-js": "^1.369.0",
"posthog-node": "^5.29.2",
"prismjs": "^1.30.0",
"puppeteer-core": "^24.40.0",
"react": "^18.3.1",
"react-csv": "^2.2.2",
"react-dom": "^18.3.1",
"react-hook-form": "^7.54.2",
"react-katex": "^3.0.1",
"react-markdown": "^9.0.3",
"react-player": "^2.16.0",
"react-resizable-panels": "^2.1.7",
"react-timeago": "^7.2.0",
"react-hook-form": "^7.71.2",
"react-markdown": "^10.1.0",
"react-player": "^3.4.0",
"react-resizable-panels": "^3.0.6",
"react-timeago": "^8.3.0",
"rehype-katex": "^7.0.1",
"remark-gfm": "^4.0.0",
"remark-gfm": "^4.0.1",
"remark-mark-highlight": "^0.1.1",
"remark-math": "^6.0.0",
"sharp": "^0.33.5",
"superjson": "^2.2.2",
"tesseract.js": "^6.0.0",
"tiktoken": "^1.0.18",
"turndown": "^7.2.0",
"zod": "^3.24.1",
"zod-to-json-schema": "^3.24.1",
"zustand": "^5.0.3"
"sharp": "^0.34.5",
"superjson": "^2.2.6",
"tesseract.js": "^7.0.0",
"tiktoken": "^1.0.22",
"turndown": "^7.2.2",
"zod": "^4.3.6",
"zustand": "5.0.7"
},
"devDependencies": {
"@types/diff": "^7.0.0",
"@types/node": "^22.10.5",
"@posthog/nextjs-config": "~1.6.4",
"@types/node": "^25.6.0",
"@types/nprogress": "^0.2.3",
"@types/plantuml-encoder": "^1.4.2",
"@types/prismjs": "^1.26.5",
"@types/react": "^18.3.18",
"@types/react-beautiful-dnd": "^13.1.8",
"@types/prismjs": "^1.26.6",
"@types/react": "^19.2.14",
"@types/react-csv": "^1.1.10",
"@types/react-dom": "^18.3.5",
"@types/react-katex": "^3.0.4",
"@types/react-timeago": "^4.1.7",
"@types/turndown": "^5.0.5",
"cross-env": "^7.0.3",
"eslint": "^9.17.0",
"eslint-config-next": "^15.1.4",
"prettier": "^3.4.2",
"@types/react-dom": "^19.2.3",
"@types/turndown": "^5.0.6",
"cross-env": "^10.1.0",
"eslint": "^9.39.4",
"eslint-config-next": "~15.1.12",
"prettier": "^3.8.2",
"prisma": "~5.22.0",
"puppeteer-core": "^23.11.1",
"typescript": "^5.7.3"
"tsx": "^4.21.0",
"typescript": "^6.0.2"
},
"engines": {
"node": "^22.0.0 || ^20.0.0"
},
"overrides": {
"@types/react": "^18.3.18",
"@types/react-dom": "^18.3.5",
"uri-js": "npm:uri-js-replace"
"node": "^24.0.0 || ^22.0.0 || ^20.0.0"
}
}
+19 -9
View File
@@ -1,12 +1,17 @@
import * as React from 'react';
import Head from 'next/head';
import dynamic from 'next/dynamic';
import { MyAppProps } from 'next/app';
import { Analytics as VercelAnalytics } from '@vercel/analytics/next';
import { SpeedInsights as VercelSpeedInsights } from '@vercel/speed-insights/next';
import { Brand } from '~/common/app.config';
import { apiQuery } from '~/common/util/trpc.client';
// [server-client-safe] dynamic imports to avoid webpack bundling issues with next/navigation
const VercelAnalytics = dynamic(() => import('@vercel/analytics/next').then(mod => mod.Analytics), { ssr: false });
const VercelSpeedInsights = dynamic(() => import('@vercel/speed-insights/next').then(mod => mod.SpeedInsights), { ssr: false });
import 'katex/dist/katex.min.css';
import '~/common/styles/CodePrism.css';
import '~/common/styles/GithubMarkdown.css';
@@ -14,6 +19,7 @@ import '~/common/styles/NProgress.css';
import '~/common/styles/agi.effects.css';
import '~/common/styles/app.styles.css';
import { ErrorBoundary } from '~/common/components/ErrorBoundary';
import { Is } from '~/common/util/pwaUtils';
import { OverlaysInsert } from '~/common/layout/overlays/OverlaysInsert';
import { ProviderBackendCapabilities } from '~/common/providers/ProviderBackendCapabilities';
@@ -21,7 +27,8 @@ import { ProviderBootstrapLogic } from '~/common/providers/ProviderBootstrapLogi
import { ProviderSingleTab } from '~/common/providers/ProviderSingleTab';
import { ProviderTheming } from '~/common/providers/ProviderTheming';
import { SnackbarInsert } from '~/common/components/snackbar/SnackbarInsert';
import { hasGoogleAnalytics, OptionalGoogleAnalytics } from '~/common/components/GoogleAnalytics';
import { hasGoogleAnalytics, OptionalGoogleAnalytics } from '~/common/components/3rdparty/GoogleAnalytics';
import { hasPostHogAnalytics, OptionalPostHogAnalytics } from '~/common/components/3rdparty/PostHogAnalytics';
const Big_AGI_App = ({ Component, emotionCache, pageProps }: MyAppProps) => {
@@ -42,18 +49,21 @@ const Big_AGI_App = ({ Component, emotionCache, pageProps }: MyAppProps) => {
<ProviderSingleTab>
<ProviderBackendCapabilities>
{/* ^ Backend capabilities & SSR boundary */}
<ProviderBootstrapLogic>
<SnackbarInsert />
{getLayout(<Component {...pageProps} />)}
<OverlaysInsert />
</ProviderBootstrapLogic>
<ErrorBoundary outer>
<ProviderBootstrapLogic>
<SnackbarInsert />
{getLayout(<Component {...pageProps} />)}
<OverlaysInsert />
</ProviderBootstrapLogic>
</ErrorBoundary>
</ProviderBackendCapabilities>
</ProviderSingleTab>
</ProviderTheming>
{hasGoogleAnalytics && <OptionalGoogleAnalytics />}
{hasPostHogAnalytics && <OptionalPostHogAnalytics />}
{Is.Deployment.VercelFromFrontend && <VercelAnalytics debug={false} />}
{Is.Deployment.VercelFromFrontend && <VercelSpeedInsights debug={false} sampleRate={1 / 2} />}
{hasGoogleAnalytics && <OptionalGoogleAnalytics />}
</>;
};
+24 -4
View File
@@ -37,14 +37,31 @@ export default function MyDocument({ emotionStyleTags }: MyDocumentProps) {
<meta property='og:site_name' content={Brand.Meta.SiteName} />
<meta property='og:type' content='website' />
{/* Twitter */}
<meta property='twitter:card' content='summary_large_image' />
{/* Twitter / X */}
<meta name='twitter:card' content='summary_large_image' />
<meta property='twitter:url' content={Brand.URIs.Home} />
<meta property='twitter:title' content={Brand.Title.Common} />
<meta property='twitter:description' content={Brand.Meta.Description} />
{Brand.URIs.CardImage && <meta property='twitter:image' content={Brand.URIs.CardImage} />}
<meta name='twitter:site' content={Brand.Meta.TwitterSite} />
<meta name='twitter:card' content='summary_large_image' />
<meta name='twitter:creator' content='@enricoros' />
<link rel='canonical' href={Brand.URIs.Home} />
{/* Author & Structured Data */}
<meta name='author' content='Enrico Ros' />
<link rel='author' href='https://www.enricoros.com' />
<script type='application/ld+json' dangerouslySetInnerHTML={{ __html: JSON.stringify({
'@context': 'https://schema.org',
'@type': 'SoftwareApplication',
'name': 'Big-AGI',
'url': 'https://big-agi.com',
'applicationCategory': 'ProductivityApplication',
'operatingSystem': 'All, Web',
'description': Brand.Meta.Description,
'sameAs': ['https://github.com/enricoros/big-agi', 'https://discord.gg/MkH4qj2Jp9',],
'author': { '@type': 'Person', 'name': 'Enrico Ros', 'url': 'https://www.enricoros.com' },
'publisher': { '@type': 'Organization', 'name': 'Token Fabrics LLC', 'url': 'https://www.tokenfabrics.com' },
}) }} />
{/* Style Sheets (injected and server-side) */}
<meta name='emotion-insertion-point' content='' />
@@ -100,6 +117,10 @@ MyDocument.getInitialProps = async (ctx: DocumentContext) => {
});
const initialProps = await Document.getInitialProps(ctx);
// Inject the comment before the HTML tag
initialProps.html = `<!-- ❤ Built with Big-AGI -->\n${initialProps.html}`;
// This is important. It prevents Emotion to render invalid HTML.
// See https://github.com/mui/material-ui/issues/26561#issuecomment-855286153
const emotionStyles = extractCriticalToChunks(initialProps.html);
@@ -107,7 +128,6 @@ MyDocument.getInitialProps = async (ctx: DocumentContext) => {
<style
data-emotion={`${style.key} ${style.ids.join(' ')}`}
key={style.key}
// eslint-disable-next-line react/no-danger
dangerouslySetInnerHTML={{ __html: style.css }}
/>
));
+3 -5
View File
@@ -18,18 +18,18 @@ import { ROUTE_APP_CHAT, ROUTE_INDEX } from '~/common/app.routes';
import { Release } from '~/common/app.release';
// capabilities access
import { useCapabilityBrowserSpeechRecognition, useCapabilityElevenLabs, useCapabilityTextToImage } from '~/common/components/useCapabilities';
import { useCapabilityBrowserSpeechRecognition, useCapabilityTextToImage } from '~/common/components/useCapabilities';
// stores access
import { getLLMsDebugInfo } from '~/common/stores/llms/store-llms';
import { useChatStore } from '~/common/stores/chat/store-chats';
import { useFolderStore } from '~/common/stores/folders/store-chat-folders';
import { useLogicSherpaStore } from '~/common/logic/store-logic-sherpa';
import { useUXLabsStore } from '~/common/state/store-ux-labs';
import { useUXLabsStore } from '~/common/stores/store-ux-labs';
// utils access
import { BrowserLang, clientHostName, Is, isPwa } from '~/common/util/pwaUtils';
import { getGA4MeasurementId } from '~/common/components/GoogleAnalytics';
import { getGA4MeasurementId } from '~/common/components/3rdparty/GoogleAnalytics';
import { prettyTimestampForFilenames } from '~/common/util/timeUtils';
import { supportsClipboardRead } from '~/common/util/clipboardUtils';
import { supportsScreenCapture } from '~/common/util/screenCaptureUtils';
@@ -95,7 +95,6 @@ function AppDebug() {
const cProduct = {
capabilities: {
mic: useCapabilityBrowserSpeechRecognition(),
elevenLabs: useCapabilityElevenLabs(),
textToImage: useCapabilityTextToImage(),
},
models: getLLMsDebugInfo(),
@@ -109,7 +108,6 @@ function AppDebug() {
reloads: usageCount,
},
release: {
app: Release.App,
build: frontendBuild,
},
};
Binary file not shown.

After

Width:  |  Height:  |  Size: 2.3 MiB

+1 -1
View File
@@ -3,7 +3,7 @@
"short_name": "big-AGI",
"theme_color": "#32383E",
"background_color": "#9FA6AD",
"description": "Your Generative AI Suite",
"description": "Open-source AI workspace. Multi-model reasoning and personas for maximum control.",
"categories": [
"productivity",
"AI",
File diff suppressed because one or more lines are too long
+2 -3
View File
@@ -10,7 +10,6 @@ import { createBeamVanillaStore } from '~/modules/beam/store-beam_vanilla';
import { OptimaToolbarIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { createDConversation, DConversation } from '~/common/stores/chat/chat.conversation';
import { createDMessageTextContent, DMessage } from '~/common/stores/chat/chat.message';
import { getChatLLMId } from '~/common/stores/llms/store-llms';
import { useIsMobile } from '~/common/components/useMatchMedia';
@@ -21,8 +20,8 @@ function initTestConversation(): DConversation {
return conversation;
}
function initTestBeamStore(messages: DMessage[], beamStore: BeamStoreApi = createBeamVanillaStore()): BeamStoreApi {
beamStore.getState().open(messages, getChatLLMId(), false, (content) => alert(content));
function initTestBeamStore(messages: DMessage[], beamStore: BeamStoreApi): BeamStoreApi {
beamStore.getState().open(messages, null, false, (content) => alert(content));
return beamStore;
}
+17 -14
View File
@@ -6,15 +6,17 @@ import ChatIcon from '@mui/icons-material/Chat';
import CheckRoundedIcon from '@mui/icons-material/CheckRounded';
import CloseRoundedIcon from '@mui/icons-material/CloseRounded';
import MicIcon from '@mui/icons-material/Mic';
import RecordVoiceOverTwoToneIcon from '@mui/icons-material/RecordVoiceOverTwoTone';
import WarningRoundedIcon from '@mui/icons-material/WarningRounded';
import { useSpeexGlobalEngine } from '~/modules/speex/store-module-speex';
import { PhVoice } from '~/common/components/icons/phosphor/PhVoice';
import { animationColorRainbow } from '~/common/util/animUtils';
import { navigateBack } from '~/common/app.routes';
import { optimaOpenPreferences } from '~/common/layout/optima/useOptima';
import { useCapabilityBrowserSpeechRecognition, useCapabilityElevenLabs } from '~/common/components/useCapabilities';
import { useCapabilityBrowserSpeechRecognition } from '~/common/components/useCapabilities';
import { useChatStore } from '~/common/stores/chat/store-chats';
import { useUICounter } from '~/common/state/store-ui';
import { useUICounter } from '~/common/stores/store-ui';
function StatusCard(props: { icon: React.JSX.Element, hasIssue: boolean, text: string, button?: React.JSX.Element }) {
@@ -45,7 +47,7 @@ export function CallWizard(props: { strict?: boolean, conversationId: string | n
// external state
const recognition = useCapabilityBrowserSpeechRecognition();
const synthesis = useCapabilityElevenLabs();
const speexGlobalEngine = useSpeexGlobalEngine();
const chatIsEmpty = useChatStore(state => {
if (!props.conversationId)
return false;
@@ -56,17 +58,18 @@ export function CallWizard(props: { strict?: boolean, conversationId: string | n
// derived state
const outOfTheBlue = !props.conversationId;
const overriddenEmptyChat = chatEmptyOverride || !chatIsEmpty;
const overriddenEmptyChat = outOfTheBlue || chatEmptyOverride || !chatIsEmpty;
const overriddenRecognition = recognitionOverride || recognition.mayWork;
const allGood = overriddenEmptyChat && overriddenRecognition && synthesis.mayWork;
const fatalGood = overriddenRecognition && synthesis.mayWork;
const synthesisShallWork = !!speexGlobalEngine;
const allGood = overriddenEmptyChat && overriddenRecognition && synthesisShallWork;
const fatalGood = overriddenRecognition && synthesisShallWork;
const handleOverrideChatEmpty = React.useCallback(() => setChatEmptyOverride(true), []);
const handleOverrideRecognition = React.useCallback(() => setRecognitionOverride(true), []);
const handleConfigureElevenLabs = React.useCallback(() => optimaOpenPreferences('voice'), []);
const handleConfigureVoice = React.useCallback(() => optimaOpenPreferences('voice'), []);
const handleFinishButton = React.useCallback(() => {
if (!allGood)
@@ -128,17 +131,17 @@ export function CallWizard(props: { strict?: boolean, conversationId: string | n
{/* Text to Speech status */}
<StatusCard
icon={<RecordVoiceOverTwoToneIcon />}
icon={<PhVoice />}
text={
(synthesis.mayWork ? 'Voice synthesis should be ready.' : 'There might be an issue with ElevenLabs voice synthesis.')
+ (synthesis.isConfiguredServerSide ? '' : (synthesis.isConfiguredClientSide ? '' : ' Please add your API key in the settings.'))
(synthesisShallWork ? 'Voice synthesis should be ready.' : 'There might be an issue with voice synthesis.')
// + (synthesis.isConfiguredServerSide ? '' : (synthesis.isConfiguredClientSide ? '' : ' Please add your API key in the settings.'))
}
button={synthesis.mayWork ? undefined : (
<Button variant='outlined' onClick={handleConfigureElevenLabs} sx={{ mx: 1 }}>
button={synthesisShallWork ? undefined : (
<Button variant='outlined' onClick={handleConfigureVoice} sx={{ mx: 1 }}>
Configure
</Button>
)}
hasIssue={!synthesis.mayWork}
hasIssue={!synthesisShallWork}
/>
{/*<Typography>*/}
+35 -28
View File
@@ -5,11 +5,11 @@ import { Avatar, Box, Card, CardContent, Chip, IconButton, Link as MuiLink, List
import CallIcon from '@mui/icons-material/Call';
import { GitHubProjectIssueCard } from '~/common/components/GitHubProjectIssueCard';
import { OptimaPanelGroup } from '~/common/layout/optima/panel/OptimaPanelGroup';
import { OptimaPanelGroupedList } from '~/common/layout/optima/panel/OptimaPanelGroupedList';
import { OptimaPanelIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { animationShadowRingLimey } from '~/common/util/animUtils';
import { conversationTitle, DConversation, DConversationId } from '~/common/stores/chat/chat.conversation';
import { useChatStore } from '~/common/stores/chat/store-chats';
import { useSetOptimaAppMenu } from '~/common/layout/optima/useOptima';
import type { AppCallIntent } from './AppCall';
import { MockPersona, useMockPersonas } from './state/useMockPersonas';
@@ -210,7 +210,7 @@ function useConversationsByPersona() {
}
export function Contacts(props: { setCallIntent: (intent: AppCallIntent) => void }) {
function ContactsMenuItems() {
// external state
const {
@@ -218,36 +218,43 @@ export function Contacts(props: { setCallIntent: (intent: AppCallIntent) => void
showConversations, toggleShowConversations,
showSupport, toggleShowSupport,
} = useAppCallStore();
return (
<OptimaPanelGroupedList title='Contacts Settings'>
<MenuItem onClick={toggleGrayUI}>
Grayed UI
<Switch checked={grayUI} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem onClick={toggleShowConversations}>
Conversations
<Switch checked={showConversations} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem onClick={toggleShowSupport}>
Show Support
<Switch checked={showSupport} sx={{ ml: 'auto' }} />
</MenuItem>
</OptimaPanelGroupedList>
);
}
export function Contacts(props: { setCallIntent: (intent: AppCallIntent) => void }) {
// external state
const { personas } = useMockPersonas();
const { grayUI, showConversations, showSupport } = useAppCallStore();
const conversationsByPersona = useConversationsByPersona();
// pluggable UI
const menuItems = React.useMemo(() => <OptimaPanelGroup title='Contacts Settings'>
<MenuItem onClick={toggleGrayUI}>
Grayed UI
<Switch checked={grayUI} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem onClick={toggleShowConversations}>
Conversations
<Switch checked={showConversations} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem onClick={toggleShowSupport}>
Show Support
<Switch checked={showSupport} sx={{ ml: 'auto' }} />
</MenuItem>
</OptimaPanelGroup>, [grayUI, showConversations, showSupport, toggleGrayUI, toggleShowConversations, toggleShowSupport]);
useSetOptimaAppMenu(menuItems, 'CallUI-Contacts');
return <>
{/* -> Panel */}
<OptimaPanelIn><ContactsMenuItems /></OptimaPanelIn>
{/* Header "Call AGI" */}
<Box sx={{
my: 6,
@@ -310,7 +317,7 @@ export function Contacts(props: { setCallIntent: (intent: AppCallIntent) => void
issue={354}
text='Call App: Support thread and compatibility matrix'
note={<>
Voice input uses the HTML Web Speech API, and speech output requires an ElevenLabs API Key.
Voice input uses the HTML Web Speech API.
</>}
// note2='Please report any issues you encounter'
sx={{
+50 -62
View File
@@ -7,31 +7,30 @@ import CallEndIcon from '@mui/icons-material/CallEnd';
import CallIcon from '@mui/icons-material/Call';
import MicIcon from '@mui/icons-material/Mic';
import MicNoneIcon from '@mui/icons-material/MicNone';
import RecordVoiceOverTwoToneIcon from '@mui/icons-material/RecordVoiceOverTwoTone';
import { ScrollToBottom } from '~/common/scroll-to-bottom/ScrollToBottom';
import { ScrollToBottomButton } from '~/common/scroll-to-bottom/ScrollToBottomButton';
import { useChatLLMDropdown } from '../chat/components/layout-bar/useLLMDropdown';
import { SystemPurposeId, SystemPurposes } from '../../data';
import { elevenLabsSpeakText } from '~/modules/elevenlabs/elevenlabs.client';
import { AixChatGenerateContent_DMessage, aixChatGenerateContent_DMessage_FromConversation } from '~/modules/aix/client/aix.client';
import { useElevenLabsVoiceDropdown } from '~/modules/elevenlabs/useElevenLabsVoiceDropdown';
import { aixChatGenerateContent_DMessage_FromConversation, AixChatGenerateContent_DMessageGuts } from '~/modules/aix/client/aix.client';
import { speakText } from '~/modules/speex/speex.client';
import type { OptimaBarControlMethods } from '~/common/layout/optima/bar/OptimaBarDropdown';
import { AudioPlayer } from '~/common/util/audio/AudioPlayer';
import { Link } from '~/common/components/Link';
import { OptimaPanelGroup } from '~/common/layout/optima/panel/OptimaPanelGroup';
import { OptimaToolbarIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { OptimaPanelGroupedList } from '~/common/layout/optima/panel/OptimaPanelGroupedList';
import { OptimaPanelIn, OptimaToolbarIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { SpeechResult, useSpeechRecognition } from '~/common/components/speechrecognition/useSpeechRecognition';
import { clipboardInterceptCtrlCForCleanup } from '~/common/util/clipboardUtils';
import { conversationTitle, remapMessagesSysToUsr } from '~/common/stores/chat/chat.conversation';
import { createDMessageFromFragments, createDMessageTextContent, DMessage, messageFragmentsReduceText } from '~/common/stores/chat/chat.message';
import { createDMessageFromFragments, createDMessageTextContent, DMessage, messageFragmentsReduceText, messageWasInterruptedAtStart } from '~/common/stores/chat/chat.message';
import { createErrorContentFragment } from '~/common/stores/chat/chat.fragments';
import { launchAppChat, navigateToIndex } from '~/common/app.routes';
import { useChatStore } from '~/common/stores/chat/store-chats';
import { useGlobalShortcuts } from '~/common/components/shortcuts/useGlobalShortcuts';
import { usePlayUrl } from '~/common/util/audio/usePlayUrl';
import { useSetOptimaAppMenu } from '~/common/layout/optima/useOptima';
import { usePlayUrlInterval } from './state/usePlayUrlInterval';
import type { AppCallIntent } from './AppCall';
import { CallAvatar } from './components/CallAvatar';
@@ -41,22 +40,17 @@ import { CallStatus } from './components/CallStatus';
import { useAppCallStore } from './state/store-app-call';
function CallMenuItems(props: {
function CallMenu(props: {
pushToTalk: boolean,
setPushToTalk: (pushToTalk: boolean) => void,
override: boolean,
setOverride: (overridePersonaVoice: boolean) => void,
}) {
// external state
const { grayUI, toggleGrayUI } = useAppCallStore();
const { voicesDropdown } = useElevenLabsVoiceDropdown(false, !props.override);
const handlePushToTalkToggle = () => props.setPushToTalk(!props.pushToTalk);
const handleChangeVoiceToggle = () => props.setOverride(!props.override);
return <OptimaPanelGroup title='Call'>
return <OptimaPanelGroupedList title='Call'>
<MenuItem onClick={handlePushToTalkToggle}>
<ListItemDecorator>{props.pushToTalk ? <MicNoneIcon /> : <MicIcon />}</ListItemDecorator>
@@ -64,17 +58,6 @@ function CallMenuItems(props: {
<Switch checked={props.pushToTalk} onChange={handlePushToTalkToggle} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem onClick={handleChangeVoiceToggle}>
<ListItemDecorator><RecordVoiceOverTwoToneIcon /></ListItemDecorator>
Change Voice
<Switch checked={props.override} onChange={handleChangeVoiceToggle} sx={{ ml: 'auto' }} />
</MenuItem>
<MenuItem>
<ListItemDecorator>{' '}</ListItemDecorator>
{voicesDropdown}
</MenuItem>
<ListDivider />
<MenuItem onClick={toggleGrayUI}>
@@ -86,7 +69,7 @@ function CallMenuItems(props: {
Voice Calls Feedback
</MenuItem>
</OptimaPanelGroup>;
</OptimaPanelGroupedList>;
}
@@ -99,7 +82,6 @@ export function Telephone(props: {
const [avatarClickCount, setAvatarClickCount] = React.useState<number>(0);// const [micMuted, setMicMuted] = React.useState(false);
const [callElapsedTime, setCallElapsedTime] = React.useState<string>('00:00');
const [callMessages, setCallMessages] = React.useState<DMessage[]>([]);
const [overridePersonaVoice, setOverridePersonaVoice] = React.useState<boolean>(false);
const [personaTextInterim, setPersonaTextInterim] = React.useState<string | null>(null);
const [pushToTalk, setPushToTalk] = React.useState(true);
const [stage, setStage] = React.useState<'ring' | 'declined' | 'connected' | 'ended'>('ring');
@@ -107,7 +89,7 @@ export function Telephone(props: {
const responseAbortController = React.useRef<AbortController | null>(null);
// external state
const { chatLLMId, chatLLMDropdown } = useChatLLMDropdown(llmDropdownRef);
const { chatLLMId: modelId, chatLLMDropdown: modelDropdown } = useChatLLMDropdown(llmDropdownRef);
const { chatTitle, reMessages } = useChatStore(useShallow(state => {
const conversation = props.callIntent.conversationId
? state.conversations.find(conversation => conversation.id === props.callIntent.conversationId) ?? null
@@ -119,7 +101,7 @@ export function Telephone(props: {
}));
const persona = SystemPurposes[props.callIntent.personaId as SystemPurposeId] ?? undefined;
const personaCallStarters = persona?.call?.starters ?? undefined;
const personaVoiceId = overridePersonaVoice ? undefined : (persona?.voices?.elevenLabs?.voiceId ?? undefined);
// const personaVoiceSelector = React.useMemo(() => personaGetVoiceSelector(persona), [persona]);
const personaSystemMessage = persona?.systemMessage ?? undefined;
// hooks and speech
@@ -145,11 +127,11 @@ export function Telephone(props: {
// pickup / hangup
React.useEffect(() => {
!isRinging && AudioPlayer.playUrl(isConnected ? '/sounds/chat-begin.mp3' : '/sounds/chat-end.mp3');
!isRinging && void AudioPlayer.playUrl(isConnected ? '/sounds/chat-begin.mp3' : '/sounds/chat-end.mp3').catch(() => {/* autoplay may be blocked */});
}, [isRinging, isConnected]);
// ringtone
usePlayUrl(isRinging ? '/sounds/chat-ringtone.mp3' : null, 300, 2800 * 2);
usePlayUrlInterval(isRinging ? '/sounds/chat-ringtone.mp3' : null, 300, 2800 * 2);
/// Shortcuts
@@ -166,7 +148,6 @@ export function Telephone(props: {
};
// [E] pickup -> seed message and call timer
// FIXME: Overriding the voice will reset the call - not a desired behavior
React.useEffect(() => {
if (!isConnected) return;
@@ -186,11 +167,14 @@ export function Telephone(props: {
setCallMessages([createDMessageTextContent('assistant', firstMessage)]); // [state] set assistant:hello message
// fire/forget
void elevenLabsSpeakText(firstMessage, personaVoiceId, true, true);
// fire/forget - use 'fast' priority for real-time conversation
void speakText(firstMessage,
undefined,
{ label: 'Call', priority: 'fast' },
);
return () => clearInterval(interval);
}, [isConnected, personaCallStarters, personaVoiceId]);
}, [isConnected, personaCallStarters]);
// [E] persona streaming response - upon new user message
React.useEffect(() => {
@@ -226,7 +210,7 @@ export function Telephone(props: {
}
// bail if no llm selected
if (!chatLLMId) return;
if (!modelId) return;
// Call Message Generation Prompt
@@ -249,33 +233,40 @@ export function Telephone(props: {
setPersonaTextInterim('💭...');
aixChatGenerateContent_DMessage_FromConversation(
chatLLMId,
modelId,
callSystemInstruction,
callGenerationInputHistory,
'call',
callMessages[0].id,
{ abortSignal: responseAbortController.current.signal },
(update: AixChatGenerateContent_DMessage, _isDone: boolean) => {
(update: AixChatGenerateContent_DMessageGuts, _isDone: boolean) => {
const updatedText = messageFragmentsReduceText(update.fragments).trim();
if (updatedText)
setPersonaTextInterim(finalText = updatedText);
},
).then((status) => {
// whether status.outcome === 'success' or not, we get a valid DMessage, eventually with Error Fragments inside
// don't add the message to conversation if it was interrupted with no content
if (messageWasInterruptedAtStart(status.lastDMessage))
return;
// whether status.outcome === 'completed' or not, we get a valid DMessage, eventually with Error Fragments inside
const fullMessage = createDMessageFromFragments('assistant', status.lastDMessage.fragments);
fullMessage.generator = status.lastDMessage.generator;
setCallMessages(messages => [...messages, fullMessage]); // [state] append assistant:call_response
// fire/forget
if (status.outcome === 'success' && finalText?.length >= 1)
void elevenLabsSpeakText(finalText, personaVoiceId, true, true);
// fire/forget - use 'fast' priority for real-time conversation
if (status.outcome === 'completed' && finalText?.length >= 1)
void speakText(finalText,
undefined,
{ label: 'Call', priority: 'fast' },
);
}).catch((err: DOMException) => {
if (err?.name !== 'AbortError') {
// create an error message to explain the exception
const errorMesage = createDMessageFromFragments('assistant', [createErrorContentFragment(err.message || err.toString())]);
setCallMessages(messages => [...messages, errorMesage]); // [state] append assistant:call_response-ERROR
const errorMessage = createDMessageFromFragments('assistant', [createErrorContentFragment(err.message || err.toString())]);
setCallMessages(messages => [...messages, errorMessage]); // [state] append assistant:call_response-ERROR
}
}).finally(() => {
setPersonaTextInterim(null);
@@ -285,7 +276,7 @@ export function Telephone(props: {
responseAbortController.current?.abort();
responseAbortController.current = null;
};
}, [isConnected, callMessages, chatLLMId, personaVoiceId, personaSystemMessage, reMessages]);
}, [callMessages, isConnected, modelId, personaSystemMessage, reMessages]);
// [E] Message interrupter
const abortTrigger = isConnected && recognitionState.hasSpeech;
@@ -311,22 +302,19 @@ export function Telephone(props: {
const isMicEnabled = recognitionState.isAvailable;
const isTTSEnabled = true;
const isEnabled = isMicEnabled && isTTSEnabled;
// pluggable UI
const menuItems = React.useMemo(() =>
<CallMenuItems
pushToTalk={pushToTalk} setPushToTalk={setPushToTalk}
override={overridePersonaVoice} setOverride={setOverridePersonaVoice} />
, [overridePersonaVoice, pushToTalk],
);
useSetOptimaAppMenu(menuItems, 'CallUI-Call');
const micErrorMessage = recognitionState.errorMessage;
return <>
<OptimaToolbarIn>{chatLLMDropdown}</OptimaToolbarIn>
{/* -> Toolbar */}
<OptimaToolbarIn>{modelDropdown}</OptimaToolbarIn>
{/* -> Panel */}
<OptimaPanelIn>
<CallMenu
pushToTalk={pushToTalk} setPushToTalk={setPushToTalk}
/>
</OptimaPanelIn>
<Typography
level='h1'
@@ -350,7 +338,7 @@ export function Telephone(props: {
callerName={isConnected ? undefined : personaName}
statusText={isRinging ? '' /*'is calling you'*/ : isDeclined ? 'call declined' : isEnded ? 'call ended' : callElapsedTime}
regardingText={chatTitle}
micError={!isMicEnabled} speakError={!isTTSEnabled}
micError={!isMicEnabled} micErrorMessage={micErrorMessage} speakError={!isTTSEnabled}
/>
{/* Live Transcript, w/ streaming messages, audio indication, etc. */}
@@ -372,7 +360,7 @@ export function Telephone(props: {
<ScrollToBottom stickToBottomInitial>
<Box sx={{ minHeight: '100%', p: 1, display: 'flex', flexDirection: 'column', gap: 1 }}>
<Box onCopy={clipboardInterceptCtrlCForCleanup} sx={{ minHeight: '100%', p: 1, display: 'flex', flexDirection: 'column', gap: 1 }}>
{/* Call Messages [] */}
{callMessages.map((message) =>
+2 -2
View File
@@ -16,7 +16,7 @@ export function CallStatus(props: {
callerName?: string,
statusText: string,
regardingText: string | null,
micError: boolean, speakError: boolean,
micError: boolean, micErrorMessage: string | null, speakError: boolean,
// llmComponent?: React.JSX.Element,
}) {
return (
@@ -37,7 +37,7 @@ export function CallStatus(props: {
</Typography>}
{props.micError && <InlineError
severity='danger' error='Looks like this Browser may not support speech recognition. You can try Chrome on Windows or Android instead.' />}
severity='danger' error={props.micErrorMessage || 'Looks like this Browser may not support speech recognition. You can try Chrome on Windows or Android instead.'} />}
{props.speakError && <InlineError
severity='danger' error='Text-to-speech does not appear to be configured. Please set it up in Preferences > Voice.' />}
@@ -1,4 +1,5 @@
import * as React from 'react';
import { AudioPlayer } from '~/common/util/audio/AudioPlayer';
@@ -8,15 +9,16 @@ import { AudioPlayer } from '~/common/util/audio/AudioPlayer';
* @param firstDelay The delay before the first play, in milliseconds.
* @param repeatMs The delay between each repeat, in milliseconds. If 0, the sound will only play once.
*/
export function usePlayUrl(url: string | null, firstDelay: number = 0, repeatMs: number = 0) {
export function usePlayUrlInterval(url: string | null, firstDelay: number = 0, repeatMs: number = 0) {
React.useEffect(() => {
if (!url) return;
const abortController = new AbortController();
let timer2: any = null;
const playFirstTime = () => {
const playAudio = () => AudioPlayer.playUrl(url);
void playAudio();
const playAudio = () => void AudioPlayer.playUrl(url, abortController.signal).catch(() => {/* autoplay may be blocked */});
playAudio();
timer2 = repeatMs > 0 ? setInterval(playAudio, repeatMs) : null;
};
@@ -24,8 +26,8 @@ export function usePlayUrl(url: string | null, firstDelay: number = 0, repeatMs:
return () => {
clearTimeout(timer1);
if (timer2)
clearInterval(timer2);
timer2 && clearInterval(timer2);
abortController?.abort();
};
}, [firstDelay, repeatMs, url]);
}
+160 -93
View File
@@ -2,15 +2,12 @@ import * as React from 'react';
import { Panel, PanelGroup, PanelResizeHandle } from 'react-resizable-panels';
import type { SxProps } from '@mui/joy/styles/types';
import { useTheme } from '@mui/joy';
import { Box, useTheme } from '@mui/joy';
import { DEV_MODE_SETTINGS } from '../settings-modal/UxLabsSettings';
import { DiagramConfig, DiagramsModal } from '~/modules/aifn/digrams/DiagramsModal';
import { FlattenerModal } from '~/modules/aifn/flatten/FlattenerModal';
import { TradeConfig, TradeModal } from '~/modules/trade/TradeModal';
import type { DiagramConfig } from '~/modules/aifn/digrams/DiagramsModal';
import type { TradeConfig } from '~/modules/trade/TradeModal';
import { downloadSingleChat, importConversationsFromFilesAtRest, openConversationsAtRestPicker } from '~/modules/trade/trade.client';
import { imaginePromptFromTextOrThrow } from '~/modules/aifn/imagine/imaginePromptFromText';
import { elevenLabsSpeakText } from '~/modules/elevenlabs/elevenlabs.client';
import { useAreBeamsOpen } from '~/modules/beam/store-beam.hooks';
import { useCapabilityTextToImage } from '~/modules/t2i/t2i.client';
@@ -18,9 +15,10 @@ import type { DConversation, DConversationId } from '~/common/stores/chat/chat.c
import type { OptimaBarControlMethods } from '~/common/layout/optima/bar/OptimaBarDropdown';
import { ConfirmationModal } from '~/common/components/modals/ConfirmationModal';
import { ConversationsManager } from '~/common/chat-overlay/ConversationsManager';
import { LLM_IF_ANT_PromptCaching, LLM_IF_OAI_Vision } from '~/common/stores/llms/llms.types';
import { OptimaDrawerIn, OptimaToolbarIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { PanelResizeInset } from '~/common/components/panes/GoodPanelResizeHandler';
import { ErrorBoundary } from '~/common/components/ErrorBoundary';
import { getLLMContextTokens, LLM_IF_ANT_PromptCaching, LLM_IF_OAI_Vision } from '~/common/stores/llms/llms.types';
import { OptimaDrawerIn, OptimaPanelIn, OptimaToolbarIn } from '~/common/layout/optima/portals/OptimaPortalsIn';
import { PanelResizeInset } from '~/common/components/PanelResizeInset';
import { Release } from '~/common/app.release';
import { ScrollToBottom } from '~/common/scroll-to-bottom/ScrollToBottom';
import { ScrollToBottomButton } from '~/common/scroll-to-bottom/ScrollToBottomButton';
@@ -28,28 +26,29 @@ import { ShortcutKey, useGlobalShortcuts } from '~/common/components/shortcuts/u
import { WorkspaceIdProvider } from '~/common/stores/workspace/WorkspaceIdProvider';
import { addSnackbar, removeSnackbar } from '~/common/components/snackbar/useSnackbarsStore';
import { createDMessageFromFragments, createDMessagePlaceholderIncomplete, DMessageMetadata, duplicateDMessageMetadata } from '~/common/stores/chat/chat.message';
import { createErrorContentFragment, createTextContentFragment, DMessageAttachmentFragment, DMessageContentFragment, duplicateDMessageFragmentsNoVoid } from '~/common/stores/chat/chat.fragments';
import { createErrorContentFragment, createTextContentFragment, DMessageAttachmentFragment, DMessageContentFragment, duplicateDMessageFragments } from '~/common/stores/chat/chat.fragments';
import { gcChatImageAssets } from '~/common/stores/chat/chat.gc';
import { getChatLLMId } from '~/common/stores/llms/store-llms';
import { getConversation, getConversationSystemPurposeId, useConversation } from '~/common/stores/chat/store-chats';
import { optimaActions, optimaOpenModels, optimaOpenPreferences, useSetOptimaAppMenu } from '~/common/layout/optima/useOptima';
import { themeBgAppChatComposer } from '~/common/app.theme';
import { useChatLLM } from '~/common/stores/llms/llms.hooks';
import { optimaActions, optimaOpenModels, optimaOpenPreferences, useOptimaChromeless } from '~/common/layout/optima/useOptima';
import { useFolderStore } from '~/common/stores/folders/store-chat-folders';
import { useIsMobile, useIsTallScreen } from '~/common/components/useMatchMedia';
import { useLLM } from '~/common/stores/llms/llms.hooks';
import { useModelDomain } from '~/common/stores/llms/hooks/useModelDomain';
import { useOverlayComponents } from '~/common/layout/overlays/useOverlayComponents';
import { useRouterQuery } from '~/common/app.routes';
import { useUXLabsStore } from '~/common/state/store-ux-labs';
import { useUIComplexityIsMinimal } from '~/common/stores/store-ui';
import { ChatPane } from './components/layout-pane/ChatPane';
import { ChatBarAltBeam } from './components/layout-bar/ChatBarAltBeam';
import { ChatBarBeam } from './components/layout-bar/ChatBarBeam';
import { ChatBarAltTitle } from './components/layout-bar/ChatBarAltTitle';
import { ChatBarDropdowns } from './components/layout-bar/ChatBarDropdowns';
import { ChatBarChat } from './components/layout-bar/ChatBarChat';
import { ChatBeamWrapper } from './components/ChatBeamWrapper';
import { ChatDrawerMemo } from './components/layout-drawer/ChatDrawer';
import { ChatMessageList } from './components/ChatMessageList';
import { Composer } from './components/composer/Composer';
import { usePanesManager } from './components/panes/usePanesManager';
import { PaneTitleOverlay } from './components/PaneTitleOverlay';
import { useComposerAutoHide } from './components/composer/useComposerAutoHide';
import { usePanesManager } from './components/panes/store-panes-manager';
import type { ChatExecuteMode } from './execute-mode/execute-mode.types';
@@ -74,24 +73,52 @@ const chatMessageListSx: SxProps = {
flexGrow: 1,
};
/*const chatMessageListBrandedSx: SxProps = {
flexGrow: 1,
backgroundBlendMode: 'soft-light',
backgroundColor: themeBgApp,
backgroundImage: 'url(https://...)',
backgroundPosition: 'center',
backgroundRepeat: 'no-repeat',
backgroundSize: 'contain',
} as const;*/
const chatBeamWrapperSx: SxProps = {
flexGrow: 1,
// we added these after removing the minSize={20} (%) from the containing panel.
minWidth: '18rem',
// minHeight: 'calc(100vh - 69px - var(--AGI-Nav-width))',
};
const composerOpenSx: SxProps = {
zIndex: 21, // just to allocate a surface, and potentially have a shadow
// NOTE: disabled on 2025-03-05: conflicts with the GlobalDragOverlay's
// zIndex: 21, // just to allocate a surface, and potentially have a shadow
minWidth: { md: 480 }, // don't get compresses too much on desktop
backgroundColor: themeBgAppChatComposer,
// backgroundColor: themeBgAppChatComposer, // inlined in the Composer
transition: 'background-color 0.5s ease-out',
borderTop: `1px solid`,
borderTopColor: 'rgba(var(--joy-palette-neutral-mainChannel, 99 107 116) / 0.4)',
// hack: eats the bottom of the last message (as it has a 1px divider)
mt: '-1px',
};
// NOTE: commented on 2024-05-13, as other content was stepping on the border due to it and missing zIndex
// mt: '-1px',
} as const;
const composerClosedSx: SxProps = {
display: 'none',
};
const composerOpenMobileSx: SxProps = {
zIndex: 21, // allocates the surface, possibly enables shadow if we like
py: 0.5, // have some breathing room
// boxShadow: '0px -1px 8px -2px rgba(0, 0, 0, 0.4)',
...composerOpenSx,
} as const;
// const composerClosedSx: SxProps = {
// display: 'none',
// };
// Lazy-loaded Modals
const DiagramsModalLazy = React.lazy(() => import('~/modules/aifn/digrams/DiagramsModal').then(module => ({ default: module.DiagramsModal })));
const FlattenerModalLazy = React.lazy(() => import('~/modules/aifn/flatten/FlattenerModal').then(module => ({ default: module.FlattenerModal })));
const TradeModalLazy = React.lazy(() => import('~/modules/trade/TradeModal').then(module => ({ default: module.TradeModal })));
export function AppChat() {
@@ -111,21 +138,23 @@ export function AppChat() {
// external state
const theme = useTheme();
const [composerHasContent, setComposerHasContent] = React.useState(false);
const isMobile = useIsMobile();
const isTallScreen = useIsTallScreen();
const isZenMode = useUIComplexityIsMinimal();
const intent = useRouterQuery<Partial<AppChatIntent>>();
const showAltTitleBar = useUXLabsStore(state => DEV_MODE_SETTINGS && state.labsChatBarAlt === 'title');
const { chatLLM } = useChatLLM();
const { domainModelId: chatLLMId } = useModelDomain('primaryChat');
const chatLLM = useLLM(chatLLMId) ?? null;
const {
// state
chatPanes,
focusedPaneConversationId, // <-- key
focusedPaneIndex,
focusedPaneConversationId,
// actions
navigateHistoryInFocusedPane,
openConversationInFocusedPane,
@@ -147,10 +176,10 @@ export function AppChat() {
}, [chatPanes]);
const beamsOpens = useAreBeamsOpen(paneBeamStores);
const beamOpenStoreInFocusedPane = React.useMemo(() => {
const open = focusedPaneIndex !== null ? (beamsOpens?.[focusedPaneIndex] ?? false) : false;
return open ? paneBeamStores?.[focusedPaneIndex!] ?? null : null;
}, [beamsOpens, focusedPaneIndex, paneBeamStores]);
const beamOpenStoreInFocusedPane = focusedPaneIndex === null ? null
: !beamsOpens?.[focusedPaneIndex] ? null
: paneBeamStores?.[focusedPaneIndex] ?? null;
const focusedChatBeamOpen = focusedPaneIndex !== null && !!beamsOpens?.[focusedPaneIndex];
const {
// focused
@@ -171,7 +200,7 @@ export function AppChat() {
// const focusedConversationWorkspaceId = workspaceForConversationIdentity(focusedPaneConversationId);
//// const focusedConversationWorkspace = useWorkspaceIdForConversation(focusedPaneConversationId);
const { mayWork: capabilityHasT2I } = useCapabilityTextToImage();
const { mayWork: capabilityHasT2I, mayEdit: capabilityHasT2IEdit } = useCapabilityTextToImage();
const activeFolderId = useFolderStore(({ enableFolders, folders }) => {
const activeFolderId = enableFolders ? _activeFolderId : null;
@@ -179,6 +208,10 @@ export function AppChat() {
return activeFolder?.id ?? null;
});
// Composer Auto-hiding
const isChromeless = useOptimaChromeless() && isMobile; // auto-hide on Chromeless too
const forceComposerHide = isChromeless || !!beamOpenStoreInFocusedPane /* || !focusedPaneConversationId */; // auto-hide when no chat (the 'please select a conversation...' state) doesn't feel good
const composerAutoHide = useComposerAutoHide(forceComposerHide, composerHasContent);
// Window actions
@@ -211,7 +244,7 @@ export function AppChat() {
else if (outcome === 'err-t2i-unconfigured')
optimaOpenPreferences('draw');
else if (outcome === 'err-no-persona')
addSnackbar({ key: 'chat-no-persona', message: 'No persona selected.', type: 'issue' });
addSnackbar({ key: 'chat-no-persona', message: 'No persona selected.', type: 'issue', overrides: { autoHideDuration: 4000 } });
else if (outcome === 'err-no-conversation')
addSnackbar({ key: 'chat-no-conversation', message: 'No active conversation.', type: 'issue' });
else if (outcome === 'err-no-last-message')
@@ -237,7 +270,7 @@ export function AppChat() {
// create the user:message
// NOTE: this can lead to multiple chat messages with data refs that are referring to the same dblobs,
// however, we already got transferred ownership of the dblobs at this point.
const userMessage = createDMessageFromFragments('user', duplicateDMessageFragmentsNoVoid(fragments)); // [chat] create user:message to send per-chat
const userMessage = createDMessageFromFragments('user', duplicateDMessageFragments(fragments, true)); // [chat] create user:message to send per-chat
if (metadata) userMessage.metadata = duplicateDMessageMetadata(metadata);
ConversationsManager.getHandler(conversation.id).messageAppend(userMessage); // [chat] append user message in each conversation
@@ -307,11 +340,6 @@ export function AppChat() {
});
}, [handleExecuteAndOutcome]);
const handleTextSpeak = React.useCallback(async (text: string): Promise<void> => {
await elevenLabsSpeakText(text, undefined, true, true);
}, []);
// Chat actions
const handleConversationNewInFocusedPane = React.useCallback((forceNoRecycle: boolean, isIncognito: boolean) => {
@@ -329,9 +357,10 @@ export function AppChat() {
useFolderStore.getState().addConversationToFolder(activeFolderId, conversationId);
// focus the composer
composerTextAreaRef.current?.focus();
if (!isMobile)
composerTextAreaRef.current?.focus();
}, [activeFolderId, focusedPaneConversationId, handleOpenConversationInFocusedPane, prependNewConversation, recycleNewConversationId]);
}, [activeFolderId, focusedPaneConversationId, handleOpenConversationInFocusedPane, isMobile, prependNewConversation, recycleNewConversationId]);
const handleConversationImportDialog = React.useCallback(() => setTradeConfig({ dir: 'import' }), []);
@@ -429,18 +458,18 @@ export function AppChat() {
// Pluggable Optima components
const barAltTitle = showAltTitleBar ? focusedChatTitle ?? 'No Chat' : null;
const barAltTitle = null;
const focusedBarContent = React.useMemo(() => beamOpenStoreInFocusedPane
? <ChatBarAltBeam beamStore={beamOpenStoreInFocusedPane} isMobile={isMobile} />
? <ChatBarBeam conversationTitle={focusedChatTitle ?? 'No Chat'} beamStore={beamOpenStoreInFocusedPane} isMobile={isMobile} />
: (barAltTitle === null)
? <ChatBarDropdowns conversationId={focusedPaneConversationId} llmDropdownRef={llmDropdownRef} personaDropdownRef={personaDropdownRef} />
? <ChatBarChat conversationId={focusedPaneConversationId} llmDropdownRef={llmDropdownRef} personaDropdownRef={personaDropdownRef} />
: <ChatBarAltTitle conversationId={focusedPaneConversationId} conversationTitle={barAltTitle} />
, [barAltTitle, beamOpenStoreInFocusedPane, focusedPaneConversationId, isMobile],
, [barAltTitle, beamOpenStoreInFocusedPane, focusedChatTitle, focusedPaneConversationId, isMobile],
);
// Disabled by default, as it lags the opening of the drawer and immediatly vanishes during the closing animation
// Disabled by default, as it lags the opening of the drawer and immediately vanishes during the closing animation
const isDrawerOpen = true; // useOptimaDrawerOpen();
const drawerContent = React.useMemo(() => !isDrawerOpen ? null :
@@ -450,6 +479,7 @@ export function AppChat() {
activeFolderId={activeFolderId}
chatPanesConversationIds={paneUniqueConversationIds}
disableNewButton={disableNewButton}
focusedChatBeamOpen={focusedChatBeamOpen}
onConversationActivate={handleOpenConversationInFocusedPane}
onConversationBranch={handleConversationBranch}
onConversationNew={handleConversationNewInFocusedPane}
@@ -458,11 +488,12 @@ export function AppChat() {
onConversationsImportDialog={handleConversationImportDialog}
setActiveFolderId={setActiveFolderId}
/>,
[activeFolderId, disableNewButton, focusedPaneConversationId, handleConversationBranch, handleConversationExport, handleConversationImportDialog, handleConversationNewInFocusedPane, handleDeleteConversations, handleOpenConversationInFocusedPane, isDrawerOpen, paneUniqueConversationIds],
[activeFolderId, disableNewButton, focusedChatBeamOpen, focusedPaneConversationId, handleConversationBranch, handleConversationExport, handleConversationImportDialog, handleConversationNewInFocusedPane, handleDeleteConversations, handleOpenConversationInFocusedPane, isDrawerOpen, paneUniqueConversationIds],
);
const focusedMenuItems = React.useMemo(() =>
const focusedChatPanelContent = React.useMemo(() => !focusedPaneConversationId ? null :
<ChatPane
isMobile={isMobile}
conversationId={focusedPaneConversationId}
disableItems={!focusedPaneConversationId || isFocusedChatEmpty}
hasConversations={hasConversations}
@@ -477,8 +508,6 @@ export function AppChat() {
[focusedPaneConversationId, handleConversationBranch, handleConversationFlatten, handleConversationReset, hasConversations, isFocusedChatEmpty, isMessageSelectionMode, isMobile, isTallScreen],
);
useSetOptimaAppMenu(focusedMenuItems, 'AppChat');
// Effects
@@ -486,7 +515,7 @@ export function AppChat() {
React.useEffect(() => {
// Debug: open a null chat
if (Release.IsNodeDevBuild && intent.initialConversationId === 'null')
openConversationInFocusedPane(null! /* for debugging purporse */);
openConversationInFocusedPane(null! /* for debugging purpose */);
// Open the initial conversation if set
else if (intent.initialConversationId)
openConversationInFocusedPane(intent.initialConversationId);
@@ -578,8 +607,11 @@ export function AppChat() {
return <>
<OptimaDrawerIn>{drawerContent}</OptimaDrawerIn>
{/* -> Toolbar, -> Drawer, -> Panel*/}
<OptimaToolbarIn>{focusedBarContent}</OptimaToolbarIn>
<OptimaDrawerIn>{drawerContent}</OptimaDrawerIn>
<OptimaPanelIn>{focusedChatPanelContent}</OptimaPanelIn>
<PanelGroup
direction={(isMobile || isTallScreen) ? 'vertical' : 'horizontal'}
@@ -596,20 +628,22 @@ export function AppChat() {
const _panesCount = chatPanes.length;
const _keyAndId = `chat-pane-${pane.paneId}`;
const _sepId = `sep-pane-${idx}`;
return <WorkspaceIdProvider conversationId={_paneIsFocused ? _paneConversationId : null} key={_keyAndId}>
return <WorkspaceIdProvider conversationId={_paneIsFocused ? _paneConversationId : null} key={_keyAndId}><ErrorBoundary>
<Panel
id={_keyAndId}
order={idx}
collapsible={chatPanes.length === 2}
defaultSize={(_panesCount === 3 && idx === 1) ? 34 : Math.round(100 / _panesCount)}
minSize={20}
// minSize={20 /* IMPORTANT: this forces a reflow even on a simple on hover */}
onClick={(event) => {
const setFocus = chatPanes.length < 2 || !event.altKey;
setFocusedPaneIndex(setFocus ? idx : -1);
// Alt + Click: undocumented feature to clear focus
if (event.altKey && chatPanes.length > 1)
return setFocusedPaneIndex(-1);
setFocusedPaneIndex(idx);
}}
onCollapse={() => {
// NOTE: despite the delay to try to let the draggin settle, there seems to be an issue with the Pane locking the screen
// NOTE: despite the delay to try to let the dragging settle, there seems to be an issue with the Pane locking the screen
// setTimeout(() => removePane(idx), 50);
// more than 2 will result in an assertion from the framework
if (chatPanes.length === 2) removePane(idx);
@@ -618,28 +652,45 @@ export function AppChat() {
// for anchoring the scroll button in place
position: 'relative',
...(isMultiPane ? {
marginBottom: '1px', // compensates for the -1px in `composerOpenSx` for the Composer offset
borderRadius: '0.375rem',
border: `2px solid ${_paneIsFocused
borderStyle: 'solid',
borderColor: _paneIsFocused
? ((willMulticast || !isMultiConversationId) ? theme.palette.primary.solidBg : theme.palette.primary.solidBg)
: ((willMulticast || !isMultiConversationId) ? theme.palette.primary.softActiveBg : theme.palette.background.level1)}`,
: ((willMulticast || !isMultiConversationId) ? theme.palette.primary.softActiveBg : theme.palette.divider),
borderWidth: '2px',
// borderBottomWidth: '3px',
// DISABLED on 2024-03-13, it gets in the way quite a lot
// filter: (!willMulticast && !_paneIsFocused)
// ? (!isMultiConversationId ? 'grayscale(66.67%)' /* clone of the same */ : 'grayscale(66.67%)')
// : undefined,
// 2025-02-27: didn't try, here's another version
// filter: _paneIsFocused ? 'none' : 'brightness(0.94) saturate(0.9)',
} : {
// NOTE: this is a workaround for the 'stuck-after-collapse-close' issue. We will collapse the 'other' pane, which
// will get it removed (onCollapse), and somehow this pane will be stuck with a pointerEvents: 'none' style, which de-facto
// disables further interaction with the chat. This is a workaround to re-enable the pointer events.
// The root cause seems to be a Dragstate not being reset properly, however the pointerEvents has been set since 0.0.56 while
// The root cause seems to be a Drag state not being reset properly, however the pointerEvents has been set since 0.0.56 while
// it was optional before: https://github.com/bvaughn/react-resizable-panels/issues/241
pointerEvents: 'auto',
}),
...((_paneIsIncognito && {
backgroundColor: theme.palette.background.level3,
backgroundImage: 'repeating-linear-gradient(45deg, rgba(0,0,0,0.03), rgba(0,0,0,0.03) 10px, transparent 10px, transparent 20px)',
})),
}}
>
{isMultiPane && !isZenMode && (
<PaneTitleOverlay
paneIdx={idx}
conversationId={_paneConversationId}
isFocused={_paneIsFocused}
isIncognito={_paneIsIncognito}
onConversationDelete={handleDeleteConversations}
/>
)}
<ScrollToBottom
bootToBottom
stickToBottomInitial
@@ -653,7 +704,7 @@ export function AppChat() {
conversationHandler={_paneChatHandler}
capabilityHasT2I={capabilityHasT2I}
chatLLMAntPromptCaching={chatLLM?.interfaces?.includes(LLM_IF_ANT_PromptCaching) ?? false}
chatLLMContextTokens={chatLLM?.contextTokens ?? null}
chatLLMContextTokens={getLLMContextTokens(chatLLM) ?? null}
chatLLMSupportsImages={chatLLM?.interfaces?.includes(LLM_IF_OAI_Vision) ?? false}
fitScreen={isMobile || isMultiPane}
isMobile={isMobile}
@@ -664,7 +715,6 @@ export function AppChat() {
onConversationNew={handleConversationNewInFocusedPane}
onTextDiagram={handleTextDiagram}
onTextImagine={handleImagineFromText}
onTextSpeak={handleTextSpeak}
sx={chatMessageListSx}
/>
)}
@@ -691,50 +741,67 @@ export function AppChat() {
</PanelResizeHandle>
)}
</WorkspaceIdProvider>;
</ErrorBoundary></WorkspaceIdProvider>;
})}
</PanelGroup>
<Composer
isMobile={isMobile}
chatLLM={chatLLM}
composerTextAreaRef={composerTextAreaRef}
targetConversationId={focusedPaneConversationId}
capabilityHasT2I={capabilityHasT2I}
isMulticast={!isMultiConversationId ? null : isComposerMulticast}
isDeveloperMode={isFocusedChatDeveloper}
onAction={handleComposerAction}
onConversationsImportFromFiles={handleConversationsImportFromFiles}
onTextImagine={handleImagineFromText}
setIsMulticast={setIsComposerMulticast}
sx={beamOpenStoreInFocusedPane ? composerClosedSx : composerOpenSx}
/>
{/* Composer with auto-hide */}
<Box {...composerAutoHide.compressorProps}>
<div style={composerAutoHide.compressibleStyle}>
<Composer
isMobile={isMobile}
chatLLM={chatLLM}
composerTextAreaRef={composerTextAreaRef}
targetConversationId={focusedPaneConversationId}
capabilityHasT2I={capabilityHasT2I}
capabilityHasT2IEdit={capabilityHasT2IEdit}
isMulticast={!isMultiConversationId ? null : isComposerMulticast}
isDeveloperMode={isFocusedChatDeveloper}
onAction={handleComposerAction}
onConversationBeamEdit={handleMessageBeamLastInFocusedPane}
onConversationsImportFromFiles={handleConversationsImportFromFiles}
onTextImagine={handleImagineFromText}
setIsMulticast={setIsComposerMulticast}
onComposerHasContent={setComposerHasContent}
sx={isMobile ? composerOpenMobileSx : composerOpenSx}
/>
</div>
</Box>
{/* Hover zone for auto-hide */}
{!isChromeless && !forceComposerHide && composerAutoHide.isHidden && <Box {...composerAutoHide.detectorProps} />}
{/* Diagrams */}
{!!diagramConfig && (
<DiagramsModal
config={diagramConfig}
onClose={() => setDiagramConfig(null)}
/>
<React.Suspense fallback={null}>
<DiagramsModalLazy
config={diagramConfig}
onClose={() => setDiagramConfig(null)}
/>
</React.Suspense>
)}
{/* Flatten */}
{!!flattenConversationId && (
<FlattenerModal
conversationId={flattenConversationId}
onConversationBranch={handleConversationBranch}
onClose={() => setFlattenConversationId(null)}
/>
<React.Suspense fallback={null}>
<FlattenerModalLazy
conversationId={flattenConversationId}
onConversationBranch={handleConversationBranch}
onClose={() => setFlattenConversationId(null)}
/>
</React.Suspense>
)}
{/* Import / Export */}
{!!tradeConfig && (
<TradeModal
config={tradeConfig}
onConversationActivate={handleOpenConversationInFocusedPane}
onClose={() => setTradeConfig(null)}
/>
<React.Suspense fallback={null}>
<TradeModalLazy
config={tradeConfig}
onConversationActivate={handleOpenConversationInFocusedPane}
onClose={() => setTradeConfig(null)}
/>
</React.Suspense>
)}
</>;
+41 -12
View File
@@ -1,19 +1,41 @@
import * as React from 'react';
import type { SxProps } from '@mui/joy/styles/types';
import { Box, Modal, ModalClose } from '@mui/joy';
import { Box, IconButton, Modal } from '@mui/joy';
import CloseFullscreenIcon from '@mui/icons-material/CloseFullscreen';
import { BeamStoreApi, useBeamStore } from '~/modules/beam/store-beam.hooks';
import { BeamView } from '~/modules/beam/BeamView';
import { GoodTooltip } from '~/common/components/GoodTooltip';
import { ScrollToBottom } from '~/common/scroll-to-bottom/ScrollToBottom';
import { themeZIndexBeamView } from '~/common/app.theme';
/*const overlaySx: SxProps = {
position: 'absolute',
inset: 0,
zIndex: themeZIndexBeamView, // stay on top of Message > Chips (:1), and Overlays (:2) - note: Desktop Drawer (:26)
}*/
const beamWrapperStyles = {
wrapper: {
position: 'absolute',
inset: 0,
backgroundColor: 'background.level2', // darker than the expected Level1, for a change
} as const,
closeContainer: {
position: 'absolute',
top: '0.25rem',
// left: '0.25rem',
left: { xs: 'calc(50% - 3rem)', md: '50%' }, // center on desktop, a bit left (for the islands) on mobile
// transform: 'translate(-50%, 0)',
zIndex: themeZIndexBeamView, // stay on top of Message > Chips (:1), and Overlays (:2) - note: Desktop Drawer (:26)
} as const,
closeButton: {
// color: 'white',
// borderRadius: '25%',
boxShadow: 'md',
} as const,
} as const;
export function ChatBeamWrapper(props: {
@@ -40,15 +62,22 @@ export function ChatBeamWrapper(props: {
return isMaximized ? (
<Modal open onClose={handleUnMaximize}>
<Box sx={{
backgroundColor: 'background.level1',
position: 'absolute',
inset: 0,
}}>
<Box sx={beamWrapperStyles.wrapper}>
<ScrollToBottom disableAutoStick>
{beamView}
</ScrollToBottom>
<ModalClose sx={{ color: 'white', backgroundColor: 'background.surface', boxShadow: 'xs', mr: 2 }} />
{/* Modal-Close-alike */}
<Box sx={beamWrapperStyles.closeContainer}>
<GoodTooltip title='Exit maximized mode'>
<IconButton variant='solid' onClick={handleUnMaximize} sx={beamWrapperStyles.closeButton}>
<CloseFullscreenIcon />
{/*<CloseRoundedIcon />*/}
</IconButton>
</GoodTooltip>
</Box>
</Box>
</Modal>
) : (

Some files were not shown because too many files have changed in this diff Show More