From f1f28bca617d0022fea567e8b6174b8e6054d9df Mon Sep 17 00:00:00 2001 From: Darsh Mashru Date: Tue, 27 Aug 2024 22:32:11 +0530 Subject: [PATCH 1/6] Fixed a redirection link typo Changed "config-local-ollama" to "config-local-ollama.md" in link --- docs/environment-variables.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/environment-variables.md b/docs/environment-variables.md index e412213c0..fa9a1ad87 100644 --- a/docs/environment-variables.md +++ b/docs/environment-variables.md @@ -99,7 +99,7 @@ requiring the user to enter an API key | `LOCALAI_API_HOST` | Sets the URL of the LocalAI server, or defaults to http://127.0.0.1:8080 | Optional | | `LOCALAI_API_KEY` | The (Optional) API key for LocalAI | Optional | | `MISTRAL_API_KEY` | The API key for Mistral | Optional | -| `OLLAMA_API_HOST` | Changes the backend host for the Ollama vendor. See [config-local-ollama.md](config-local-ollama) | | +| `OLLAMA_API_HOST` | Changes the backend host for the Ollama vendor. See [config-local-ollama.md](config-local-ollama.md) | | | `OPENROUTER_API_KEY` | The API key for OpenRouter | Optional | | `PERPLEXITY_API_KEY` | The API key for Perplexity | Optional | | `TOGETHERAI_API_KEY` | The API key for Together AI | Optional | From 91a5878460f9c66f8fd949cdc946fd97179f5214 Mon Sep 17 00:00:00 2001 From: Enrico Ros Date: Mon, 16 Sep 2024 13:08:36 -0700 Subject: [PATCH 2/6] Help: microphone troubleshooting. --- docs/help-feature-microphone.md | 132 ++++++++++++++++++++++++++++++++ 1 file changed, 132 insertions(+) create mode 100644 docs/help-feature-microphone.md diff --git a/docs/help-feature-microphone.md b/docs/help-feature-microphone.md new file mode 100644 index 000000000..8e0bac748 --- /dev/null +++ b/docs/help-feature-microphone.md @@ -0,0 +1,132 @@ +# Enabling Microphone Access for Speech Recognition + +This guide explains how to enable microphone access for speech recognition in various browsers and mobile devices. +Ensuring microphone access is essential for using voice features in applications like big-AGI. + +## Desktop Browsers + +### Google Chrome (All Platforms, recommended) + +1. Open the website (e.g., big-AGI) in Chrome. +2. Click the **lock icon** in the address bar. +3. In the dropdown, find **"Microphone"**. + - Set it to **"Allow"**. +4. If "Microphone" isn't listed: + - Click on **"Site settings"**. + - Find **"Microphone"** in the permissions list. + - Change the setting to **"Allow"**. +5. **Refresh** the page. + +### Safari (macOS) + +1. Open the website in Safari. +2. Click **"Safari"** in the top menu bar. +3. Select **"Preferences"** or **"Settings**" (depending on your version). +4. Go to the **"Websites"** tab. +5. Select **"Microphone"** from the sidebar. +6. Find the website in the list and set it to **"Allow"**. +7. Close the Preferences window. +8. **Refresh** the page. + +### Microsoft Edge (Windows) + +1. Open the website in Edge. +2. Click the **lock icon** in the address bar. +3. Click **"Permissions for this site"**. +4. Find **"Microphone"**. + - Set it to **"Allow"**. +5. **Refresh** the page. + +### Firefox (All Platforms) + +> **Note:** The Speech Recognition API is **not supported** in Firefox. If you're using Firefox, please switch to a supported browser to use speech recognition +> features. + +## Mobile Devices + +### Android (Chrome) + +1. Open the website in Chrome. +2. Tap the **lock icon** in the address bar. +3. Tap **"Permissions"**. +4. Find **"Microphone"**. + - Set it to **"Allow"**. +5. **Refresh** the page. + +### iOS (Safari) + +1. Open the **Settings** app on your device. +2. Scroll down and tap **"Safari"**. +3. Tap **"Microphone"**. +4. Ensure **"Ask"** or **"Allow"** is selected. +5. Return to Safari and open the website. +6. If prompted, allow microphone access. +7. **Refresh** the page. + +### iOS (Chrome) + +> **Note:** Chrome on iOS uses Safari's engine due to system limitations. Microphone permissions are managed through iOS settings. + +1. Open the **Settings** app. +2. Scroll down and tap **"Chrome"**. +3. Ensure **"Microphone"** is toggled **on**. +4. Open Chrome and navigate to the website. +5. If prompted, allow microphone access. +6. **Refresh** the page. + +## Troubleshooting + +If you're still experiencing issues after enabling microphone access: + +**Check System Permissions (macOS):** + +- Open **System Settings**. +- Go to **"Privacy & Security"**. +- Select the **"Privacy"** tab. +- Click **"Microphone"** in the sidebar. +- Ensure your browser (e.g., Chrome, Safari) is checked. +- You may need to unlock the settings by clicking the lock icon at the bottom. + +**Check Microphone Access (Windows):** + +- Open **Settings**. +- Go to **"Privacy"** > **"Microphone"**. +- Ensure **"Allow apps to access your microphone"** is **on**. +- Scroll down and make sure your browser is allowed. + +**Close Other Applications:** + +- Close any applications that might be using the microphone. + +**Restart the Browser:** + +- Close all browser windows and reopen. + +**Update Your Browser:** + +- Ensure you're using the latest version. + +**Check for Browser Extensions:** + +- Disable extensions that might block access to the microphone. + +For persistent issues, consult your browser's official support resources or contact big-AGI support. + +## Technical Details + +Big-AGI uses the [Web Speech API (SpeechRecognition)](https://developer.mozilla.org/en-US/docs/Web/API/SpeechRecognition) +to transcribe spoken words into text. This API provides real-time transcription with live previews and works on most +modern mobile and desktop browsers. + +**Note on Browser Support:** + +| Browser | Support Level | Notes | +|----------------|-----------------|------------------------------------------------------------------------| +| Google Chrome | โœ… Recommended | Fully supported on desktop and Android. Preferred for best experience. | +| Safari | โœ… Supported | Requires macOS/iOS 14 or later. | +| Microsoft Edge | โœ… Supported | Fully supported on desktop. | +| Firefox | โŒ Not Supported | SpeechRecognition API not available. | + +**Recommendation:** +For the best experience with speech recognition features, we strongly recommend using Google Chrome. +Ensure your browser is up to date to benefit from the latest features and security updates. From 05e74a7cd8f88cfff334144651cd5fa72e39c16e Mon Sep 17 00:00:00 2001 From: Enrico Ros Date: Tue, 17 Sep 2024 02:50:04 -0700 Subject: [PATCH 3/6] Update help-feature-microphone.md --- docs/help-feature-microphone.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/docs/help-feature-microphone.md b/docs/help-feature-microphone.md index 8e0bac748..8841d452e 100644 --- a/docs/help-feature-microphone.md +++ b/docs/help-feature-microphone.md @@ -19,6 +19,8 @@ Ensuring microphone access is essential for using voice features in applications ### Safari (macOS) +[https://vimeo.com/1010161384](https://vimeo.com/1010161384) + 1. Open the website in Safari. 2. Click **"Safari"** in the top menu bar. 3. Select **"Preferences"** or **"Settings**" (depending on your version). From 9b902e0f8bedc03246a7aefc4155b0b529944c43 Mon Sep 17 00:00:00 2001 From: Enrico Ros Date: Tue, 17 Sep 2024 11:29:50 -0700 Subject: [PATCH 4/6] Update help-feature-microphone.md --- docs/help-feature-microphone.md | 25 ++++++++++++++++--------- 1 file changed, 16 insertions(+), 9 deletions(-) diff --git a/docs/help-feature-microphone.md b/docs/help-feature-microphone.md index 8841d452e..c6d7939c1 100644 --- a/docs/help-feature-microphone.md +++ b/docs/help-feature-microphone.md @@ -19,16 +19,23 @@ Ensuring microphone access is essential for using voice features in applications ### Safari (macOS) -[https://vimeo.com/1010161384](https://vimeo.com/1010161384) +**[Watch the video tutorial: How to enable Speech Recognition in Safari](https://vimeo.com/1010161384)** -1. Open the website in Safari. -2. Click **"Safari"** in the top menu bar. -3. Select **"Preferences"** or **"Settings**" (depending on your version). -4. Go to the **"Websites"** tab. -5. Select **"Microphone"** from the sidebar. -6. Find the website in the list and set it to **"Allow"**. -7. Close the Preferences window. -8. **Refresh** the page. +If you're seeing a "Speech Recognition permission denied" error, follow these steps: + +1. Open **System Settings**. + - Go to **Privacy & Security** > **Speech Recognition**. + - Enable Safari in the list of allowed applications. + - Quit & Open Safari. +2. Click **Safari** in the top menu bar. + - Select **Settings**. + - Go to the **Websites** tab. + - Select **Microphone** from the sidebar. + - Find big-AGI (or localhost for developers) in the list and set it to **Allow**. + - Close the Settings window. +3. **Refresh** the page. + +This quick and simple fix should get essential voice input working in big-AGI on your Mac. ### Microsoft Edge (Windows) From 06d2bf2ba5629e11f99b8268e7958c9205f7bbb3 Mon Sep 17 00:00:00 2001 From: Enrico Ros Date: Tue, 17 Sep 2024 11:30:39 -0700 Subject: [PATCH 5/6] Update help-feature-microphone.md --- docs/help-feature-microphone.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/help-feature-microphone.md b/docs/help-feature-microphone.md index c6d7939c1..2d4fd6e7a 100644 --- a/docs/help-feature-microphone.md +++ b/docs/help-feature-microphone.md @@ -19,7 +19,7 @@ Ensuring microphone access is essential for using voice features in applications ### Safari (macOS) -**[Watch the video tutorial: How to enable Speech Recognition in Safari](https://vimeo.com/1010161384)** +**[Watch the video tutorial: How to enable Speech Recognition in Safari](https://vimeo.com/1010342201)** If you're seeing a "Speech Recognition permission denied" error, follow these steps: From 782c0cf17237efa9a86d0ae25458194872bed04f Mon Sep 17 00:00:00 2001 From: Enrico Ros Date: Tue, 17 Sep 2024 16:17:30 -0700 Subject: [PATCH 6/6] Link help-feature-microphone to the README --- docs/README.md | 61 ++++++++++++++++++++++++++------------------------ 1 file changed, 32 insertions(+), 29 deletions(-) diff --git a/docs/README.md b/docs/README.md index 6b442a071..8a2237cb9 100644 --- a/docs/README.md +++ b/docs/README.md @@ -1,60 +1,63 @@ -# big-AGI Documentation +# Big-AGI Documentation -Find all the information you need to get started, configure, and effectively use big-AGI. +Information you need to get started, configure, and use big-AGI productively. -[//]: # (## Quick Start) +## Getting Started -[//]: # (- **[Introduction](big-agi.md)**: Overview of big-AGI's features.) +Guides for basic big-AGI features: -## Configuration Guides +- **[Enabling Microphone for Speech Recognition](help-feature-microphone.md)**: Instructions to + allow speech recognition in browsers and apps. -Detailed guides to configure your big-AGI interface and models. +## AI Model Configuration -๐Ÿ‘‰ The following applies to the users of big-AGI.com, as the public instance is empty and to be configured by the user. +Detailed guides to configure AI models and advanced features in big-AGI. -- **Cloud Model Services**: +> ๐Ÿ‘‰ The following applies to users of big-AGI.com, as the public instance is empty and requires user configuration. + +- **Cloud AI Services**: - **[Azure OpenAI](config-azure-openai.md)** - **[OpenRouter](config-openrouter.md)** - - easy API key: **Anthropic**, **Google AI**, **Groq**, **Mistral**, **OpenAI**, **Perplexity**, **TogetherAI** + - Easy API key setup: **Anthropic**, **Deepseek**, **Google AI**, **Groq**, **Mistral**, **OpenAI**, **OpenPipe**, **Perplexity**, **TogetherAI** -- **Local Model Servers**: +- **Local AI Integrations**: - **[LocalAI](config-local-localai.md)** - **[LM Studio](config-local-lmstudio.md)** - **[Ollama](config-local-ollama.md)** - **[Oobabooga](config-local-oobabooga.md)** -- **Advanced Feature Configuration**: - - **[Browse](config-feature-browse.md)**: Enable web page download through third-party services or your own cloud (advanced) - - **ElevenLabs API**: Voice and cutom voice generation, only requires their API key - - **Google Search API**: guide not yet available, see the Google options in '[Environment Variables](environment-variables.md)' - - **Prodia API**: Stable Diffusion XL image generation, only requires their API key, alternative to DALLยทE +- **Enhanced AI Features**: + - **[Web Browsing](config-feature-browse.md)**: Enable web page download through third-party services or your own cloud (advanced) + - **Web Search**: Google Search API (see '[Environment Variables](environment-variables.md)') + - **Image Generation**: DALLยทE 3 and 2, or Prodia API for Stable Diffusion XL + - **Voice Synthesis**: ElevenLabs API for voice generation -## Deployment +## Deployment & Customization -System integrators, administrators, whitelabelers: instead of using the public big-AGI instance on get.big-agi.com, you can deploy your own instance. +> ๐Ÿ‘‰ The following applies to developers and experts who deploy their own big-AGI instance. -Step-by-step deployment and system configuration instructions. +For deploying a custom big-AGI instance: -- **[Installation](installation.md)**: Set up your own instance of big-AGI and related products - - build from source or use pre-built - - locally, in the public cloud, or on your own servers +- **[Installation Guide](installation.md)**: Set up your own big-AGI instance + - Source build or pre-built options + - Local, cloud, or on-premises deployment -- **Advanced Customizations**: - - **[Source code alterations guide](customizations.md)**: source code primer and alterations guidelines - - **[Basic Authentication](deploy-authentication.md)**: Optional, adds a username and password wall +- **Advanced Setup**: + - **[Source Code Customization Guide](customizations.md)**: Modify the source code + - **[Access Control](deploy-authentication.md)**: Optional, add basic user authentication - **[Database Setup](deploy-database.md)**: Optional, enables "Chat Link Sharing" - - **[Reverse Proxy](deploy-reverse-proxy.md)**: Optional, enables custom domain and SSL - - **[Environment Variables](environment-variables.md)**: ๐Ÿ“Œ Pre-configures models and services + - **[Reverse Proxy](deploy-reverse-proxy.md)**: Optional, enables custom domains and SSL + - **[Environment Variables](environment-variables.md)**: Pre-configures models and services -## Support and Community +## Community & Support -Join our community or get support: +Connect with the growing big-AGI community: - Visit our [GitHub repository](https://github.com/enricoros/big-AGI) for source code and issue tracking - Check the latest updates and features on [Changelog](changelog.md) or the in-app [News](https://get.big-agi.com/news) - Connect with us and other users on [Discord](https://discord.gg/MkH4qj2Jp9) for discussions, help, and sharing your experiences with big-AGI -Thank you for choosing big-AGI. We're excited to see what you'll build. +Thank you for choosing big-AGI. We're excited to give you the best tools to amplify yourself.