mixa/zed - zed - Gitea: Git with a cup of tea

mixa/zed

Author	SHA1	Message	Date
Richard Feldman	c0fadae881	Thought signatures (#42915 ) Implement Gemini API's [thought signatures](https://ai.google.dev/gemini-api/docs/thinking#signatures) Release Notes: - Added thought signatures for Gemini tool calls	2025-11-18 10:41:19 -05:00
Danilo Leal	98a83b47e6	agent_ui: Make input fields in Bedrock settings keyboard navigable (#42916 ) Closes https://github.com/zed-industries/zed/issues/36587 This PR enables jumping from one input to the other, in the Bedrock settings section, with tab. Release Notes: - N/A	2025-11-17 19:13:15 -03:00
Finn Evers	d9cfc2c883	Fix formatting in various files (#42917 ) This fixes various issues where rustfmt failed to format code due to too long strings, most of which I stumbled across over the last week and some additonal ones I searched for whilst fixing the others. Release Notes: - N/A	2025-11-17 21:48:09 +00:00
Tim McLean	fb90b12073	Add retry support for OpenAI-compatible LLM providers (#37891 ) Automatically retry the agent's LLM completion requests when the provider returns 429 Too Many Requests. Uses the Retry-After header to determine the retry delay if it is available. Many providers are frequently overloaded or have low rate limits. These providers are essentially unusable without automatic retries. Tested with Cerebras configured via openai_compatible. Related: #31531 Release Notes: - Added automatic retries for OpenAI-compatible LLM providers --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-11-13 14:15:46 +00:00
tidely	28d019be2e	ollama: Fix tool calling (#42275 ) Closes #42303 Ollama added tool call identifiers (https://github.com/ollama/ollama/pull/12956) in its latest version [v0.12.10](https://github.com/ollama/ollama/releases/tag/v0.12.10). This broke our json schema and made all tool calls fail. This PR fixes the schema and uses the Ollama provided tool call identifier when available. We remain backwards compatible and still use our own identifier with older versions of Ollama. I added a `TODO` to remove the `Option` around the new field when most users have updated their installations to v0.12.10 or above. Note to reviewer: The fix to this issue should likely get cherry-picked into the next release, since Ollama becomes unusable as an agent without it. Release Notes: - Fixed tool calling when using the latest version of Ollama	2025-11-11 16:10:47 +01:00
Danilo Leal	2fb3d593bc	agent_ui: Add component to standardize the configured LLM card (#42314 ) This PR adds a new component to the `language_models` crate called `ConfiguredApiCard`: <img width="500" height="420" alt="Screenshot 2025-11-09 at 2  07@2x" src="https://github.com/user-attachments/assets/655ea941-2df8-4489-a4da-bba34acf33a9" /> We were previously recreating this component from scratch with regular divs in all LLM providers render function, which was redundant as they all essentially looked the same and didn't have any major variations aside from labels. We can clean up a bunch of similar code with this change, which is cool! Release Notes: - N/A	2025-11-09 14:32:05 -03:00
chenmi	cc1d66b530	agent_ui: Improve API key configuration UI display (#42306 ) Improve the layout and text display of API key configuration in multiple language model providers to ensure proper text wrapping and ellipsis handling when API URLs are long. Before: <img width="320" alt="image" src="https://github.com/user-attachments/assets/2f89182c-34a0-4f95-a43a-c2be98d34873" /> After: <img width="320" alt="image" src="https://github.com/user-attachments/assets/09bf5cc3-07f0-47bc-b21a-d84b8b1caa67" /> Changes include: - Add proper flex layout with overflow handling - Replace truncate_and_trailoff with CSS text ellipsis - Ensure consistent UI behavior across all providers Release Notes: - Improved API key configuration display in language model settings	2025-11-09 13:00:31 -03:00
Mikayla Maki	5f8226457e	Automate settings registration (#42238 ) Release Notes: - N/A --------- Co-authored-by: Nia <nia@zed.dev>	2025-11-07 22:27:14 +00:00
Maokaman1	160bf915aa	language_models: Filter out whitespace-only text content parts for OpenAI and OpenAI compatible providers (#40316 ) Closes #40097 When multiple files are added sequentially to the agent panel, the request JSON incorrectly includes "text" elements containing only spaces. These empty elements cause the Zhipu AI API to return a "text cannot be empty" error. The fix filters out any "text" elements that are empty or contain only whitespaces. UI state when the error occurs: <img width="300" alt="Image" src="https://github.com/user-attachments/assets/c55e5272-3f03-42c0-b412-fa24be2b0043" /> Request JSON (causing the error): ``` { "model": "glm-4.6", "messages": [ { "role": "system", "content": "<<CUT>>" }, { "role": "user", "content": [ { "type": "text", "text": "[@1.txt](zed:///agent/file?path=C%3A%5CTemp%5CTest%5C1.txt)" }, { "type": "text", "text": " " }, { "type": "text", "text": "[@2.txt](zed:///agent/file?path=C%3A%5CTemp%5CTest%5C2.txt)" }, { "type": "text", "text": " describe" }, ``` Release Notes: - Fixed an issue when an OpenAI request contained whitespace-only text content Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-11-07 14:43:07 +01:00
Techy	27a18843d4	open_ai: Make the deltas optional (#39142 ) I am using an Azure OpenAI instance since that is what is provided at work and with how they have it setup not all responses contain a delta, which lead to errors and truncated responses. This is related to how they are filtering potentially offensive requests and responses. I don't believe this filter was made in-house, instead I believe it is provided by Microsoft/Azure, so I suspect this fix may help other users. Release Notes: - N/A Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-11-05 13:47:14 +01:00
tidely	12d71b37bb	ollama: Add button for refreshing available models (#38181 ) Closes #17524 This PR adds a button to the bottom right corner of the ollama settings ui. It resets the available ollama models, also resets the "Connected" state in the process. This means it can be used to check if the connection is still valid as well. It's a question whether we should clear the available models on ALL `fetch_models` calls, since these only happen during auth anyway. Ollama is a local model provider which means clicking the refresh button often only flashes the "not connected" state because the latency of the request is so low. This accentuates changes in the UI, however I don't think there's a way around this without adding some rather cumbersome deferred ui updates. I've attached the refresh button to the "Connected" `ButtonLike`, since I don't think automatic UI spacing should separate these elements. I think this is okay because the "Connected" isn't actually something that the user can interact with. Before: <img width="211" height="245" alt="image" src="https://github.com/user-attachments/assets/ea90e24a-b603-4ee2-9212-2917e1695774" /> After: <img width="211" height="250" alt="image" src="https://github.com/user-attachments/assets/be9af950-86a2-4067-87a0-52034a80a823" /> Alternative approach: There was also a suggestion to simply add a entry to the command palette, however none of the other providers have this ability currently either so I went with this approach. The current approach also makes it more discoverable to the user. Release Notes: - Added a button for refreshing available ollama models --------- Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de>	2025-10-31 18:12:02 +00:00
versecafe	1cb512f336	bedrock: Fix duplicate region input (#41341 ) Closes #41313 Release Notes: - Fixes #41313 <img width="453" height="870" alt="Screenshot 2025-10-27 at 10 23 37 PM" src="https://github.com/user-attachments/assets/93bfba18-1bff-494e-a4c2-05b54ad6eed8" /> Co-authored-by: Richard Feldman <oss@rtfeldman.com>	2025-10-31 16:43:45 +00:00
Danilo Leal	992448b560	edit prediction: Add ability to switch providers from the status bar menu (#41504 ) Closes https://github.com/zed-industries/zed/issues/41500 <img width="500" height="1122" alt="Screenshot 2025-10-29 at 9  43@2x" src="https://github.com/user-attachments/assets/ac2a81ad-99bb-43cd-b032-f2485fc23166" /> Release Notes: - Added the ability to switch between configured edit prediction providers through the status bar menu.	2025-10-29 22:16:13 -03:00
Somtoo Chukwurah	92c31278ee	Add support for GitHub Copilot /responses endpoint (#40762 ) Add support for GithubCopilot /responses endpoint. This gives the copilot chat provider the ability to use the new GPT-5 codex model and any other model that lacks support for /chat/copmletions endpoint. Closes #38858 Release Notes: - Add support for GithubCopilot /responses endpoint. # Added 1. copilot_response.rs that has the /response endpoint types 2. uses response endpoint if model does not support /chat/completions. 3. new into_copilot_response() to map LanguageCompletionEvents to Request. 4. new map_stream() to map response stream event to LanguageCompletionEvents and tests. 5. Fixed a bug where trying to parse response for non streaming for /chat/completion was failing # Notes There is a pr open - https://github.com/zed-industries/zed/pull/39989 for adding /response support for OpenAi and OpenAi compatible API. Altough they share some similarities (copilot api seems to mirror openAi directly) ive simplified some stuff and tried to keep it the same with the vscode-chat implementation where possible. There might be a case for code reuse but i think keeping them separate for now should be ok. # Tool Calls <img width="716" height="670" alt="Screenshot from 2025-10-15 17-12-30" src="https://github.com/user-attachments/assets/14e88a52-ba8b-4209-8f78-73d15034b1e0" /> # Image <img width="923" height="494" alt="Screenshot from 2025-10-21 02-02-26" src="https://github.com/user-attachments/assets/b96ce97c-331e-45cb-b5b1-7aa10ed387b4" />	2025-10-24 13:25:58 -06:00
Danilo Leal	8f3da5c5cd	settings_ui: Add pickers for theme and icon themes (#40829 ) In the process of adding pickers for the theme and icon themes fields in the settings UI, I felt like there was an improvement opportunity in regards to where some of these components are stored. The `ui_input` crate originally was meant only for the text field-like component, which couldn't be in the regular `ui` crate due to the dependency with `editor`. Given we had also added the number field there—which is similar in also having the same dependency—it made sense to think of this crate more like a home for form-like components rather than for only one component. However, we were also storing some settings UI-specific stuff in that crate, which didn't feel right. So I ended up creating a new directory within the `settings_ui` for components and moved all the pickers and the custom input field there. I think this makes it for a cleaner structure. Release Notes: - settings_ui: Added the ability to search for theme and icon themes in their respective fields.	2025-10-21 19:58:43 -03:00
Julia Ryan	ef5b8c6fed	Remove workspace-hack (#40216 ) We've been considering removing workspace-hack for a couple reasons: - Lukas ran into a situation where its build script seemed to be causing spurious rebuilds. This seems more likely to be a cargo bug than an issue with workspace-hack itself (given that it has an empty build script), but we don't necessarily want to take the time to hunt that down right now. - Marshall mentioned hakari interacts poorly with automated crate updates (in our case provided by rennovate) because you'd need to have `cargo hakari generate && cargo hakari manage-deps` after their changes and we prefer to not have actions that make commits. Currently removing workspace-hack causes our workspace to grow from ~1700 to ~2000 crates being built (depending on platform), which is mainly a problem when you're building the whole workspace or running tests across the the normal and remote binaries (which is where feature-unification nets us the most sharing). It doesn't impact incremental times noticeably when you're just iterating on `-p zed`, and we'll hopefully get these savings back in the future when rust-lang/cargo#14774 (which re-implements the functionality of hakari) is finished. Release Notes: - N/A	2025-10-17 18:58:14 +00:00
David Kleingeld	77933f83e5	Decouple cloud provider from Model in Zed (#40281 ) Release Notes: - N/A Co-authored-by: Marshall Bowers <git@maxdeviant.com>	2025-10-15 20:54:57 +00:00
Bennet Bo Fenner	3d5ddcccf0	ollama: Resolve context window size via API (#39941 ) Previously we were guessing the context window size here: `8c3f09e31e/crates/ollama/src/ollama.rs (L22)` This is inaccurate and must be updated manually. This PR ensures that we extract the context window size from the request in the same way that the Ollama CLI does when running `ollama show <model-name>` (Relevant code is [here](`3d32249c74/cmd/cmd.go (L860)`)) The format looks like this: ```json { "model_info": { "general.architecture": "llama", "llama.context_length": 132000 } } ``` Once this PR is merged we could technically remove the old code `8c3f09e31e/crates/ollama/src/ollama.rs (L22)` I decided to keep it for now, as it is unclear if the necessary fields are available via the API on older Ollama versions. Release Notes: - Fixed an issue where Ollama models would use the wrong context window size	2025-10-10 12:59:52 +00:00
David	5fd187769d	Add Codestral edit predictions provider (#34371 ) Release Notes: - Added Codestral edit predictions provider which can be enabled by adding an API key in the Mistral section of agent settings. ![2025-07-13 11 35 33](https://github.com/user-attachments/assets/8bf599d7-33c7-4556-b878-6c645d69661f) ## Config Get API key from https://console.mistral.ai/codestral and add it in the Mistral section of the agent settings. ``` "features": { "edit_prediction_provider": "codestral" }, "edit_predictions": { "codestral": { "model": "codestral-latest", "max_tokens": 150 } }, ``` --------- Co-authored-by: Michael Sloan <michael@zed.dev>	2025-10-08 12:02:21 -06:00
Conrad Irwin	1d1c799b4b	Reland "Remove cx from ThemeSettings" (#39720 ) - Reapply "Remove cx from ThemeSettings (#38836)" (#39691) - Fix theme loading races Closes #ISSUE Release Notes: - N/A	2025-10-08 17:36:52 +02:00
Conrad Irwin	41cf114d8a	Revert "Remove cx from ThemeSettings (#38836 )" (#39691 ) This reverts commit `a2a7bd139a`. This caused themes to not load correctly on startup, you needed to edit your settings. Release Notes: - N/A	2025-10-07 15:45:20 +00:00
Conrad Irwin	a2a7bd139a	Remove cx from ThemeSettings (#38836 ) Before this change the active theme and icon theme were retrofitted onto the ThemeSettings. Now they're in their own new global (GlobalTheme::theme(cx) and GlobalTheme::icon_theme(cx)) This lets us remove cx from the settings traits, and tidy up a few other things along the way. Release Notes: - N/A	2025-10-06 23:06:50 +00:00
Richard Feldman	3ae65153db	Default to Sonnet 4.5 in BYOK (#39132 ) <img width="381" height="204" alt="Screenshot 2025-09-29 at 2 29 58 PM" src="https://github.com/user-attachments/assets/c7aaf0b0-b09b-4ed9-8113-8d7b18eefc2f" /> Release Notes: - Claude Sonnet 4.5 and 4.5 Thinking are now the recommended Anthropic models	2025-09-29 18:56:03 +00:00
Jowell Young	92a09ecf25	x_ai: Add support for tools and images with custom models (#38792 ) After the change, we can add "supports_images", "supports_tools" and "parallel_tool_calls" properties to set up new models. Our `settings.json` will be as follows: ```json "language_models": { "x_ai": { "api_url": "https://api.x.ai/v1", "available_models": [ { "name": "grok-4-fast-reasoning", "display_name": "Grok 4 Fast Reasoning", "max_tokens": 2000000, "max_output_tokens": 64000, "supports_tools": true, "parallel_tool_calls": true, }, { "name": "grok-4-fast-non-reasoning", "display_name": "Grok 4 Fast Non-Reasoning", "max_tokens": 2000000, "max_output_tokens": 64000, "supports_images": true, } ] } } ``` Closes https://github.com/zed-industries/zed/issues/38752 Release Notes: - xAI: Added support for for configuring tool and image support for custom model configurations	2025-09-29 11:38:55 +00:00
Marshall Bowers	ee357e8987	language_models: Send a header indicating that the client supports xAI models (#38931 ) This PR adds an `x-zed-client-supports-x-ai` header to the `GET /models` request sent to Cloud to indicate that the client supports xAI models. Release Notes: - N/A	2025-09-26 04:11:48 +00:00
Marshall Bowers	4f91fab190	language_models: Add xAI support to Zed Cloud provider (#38928 ) This PR adds xAI support to the Zed Cloud provider. Release Notes: - N/A	2025-09-26 03:19:12 +00:00
Michael Sloan	67984d5e49	provider configuration: Use `SingleLineInput` instead of `Editor` (#38814 ) Release Notes: - N/A	2025-09-25 22:38:27 +00:00
Marshall Bowers	17e55daf6f	Remove `billing-v2` feature flag (#38843 ) This PR removes the `billing-v2` feature flag, now that the new pricing is launched. Release Notes: - N/A	2025-09-25 02:11:48 +00:00
tidely	d5a99d079e	ollama: Remove dead code (#38550 ) The `Duration` argument in `get_models` has been unused for over a year. The `complete` function is also unused and it has fallen behind in new feature additions such as Authorization support. This used to exist because ollama didn't support tools in streaming mode, `with_tools` also existed because of that. Now however there is no reason to keep this around. `ChatResponseDelta ` had unnecessary `#[allow(unused)]` macros since the fields are marked `pub`. Using `#[expect(unused)]` would've caught this. Release Notes: - N/A	2025-09-24 02:19:52 -06:00
Marshall Bowers	f78699eb71	Update plan text (#38731 ) Release Notes: - N/A --------- Co-authored-by: David Kleingeld <davidsk@zed.dev>	2025-09-23 17:44:43 +00:00
Umesh Yadav	3646aa6bba	language_models: Actually override Ollama model from settings (#38628 ) The current problem is that if I specify model parameters, like `max_tokens`, in `settings.json` for an Ollama model, they do not override the values coming from the Ollama API. Instead, the parameters from the API are used. For example, in the settings below, even though I have overridden `max_tokens`, Zed will still use the API's default `context_length` of 4k. ``` "language_models": { "ollama": { "available_models": [ { "name": "qwen3-coder:latest", "display_name": "Qwen 3 Coder", "max_tokens": 64000, "supports_tools": true, "keep_alive": "15m", "supports_thinking": false, "supports_images": false } ] } }, ``` Release Notes: - Fixed an issue where Ollama model parameters were not being correctly overridden by user settings.	2025-09-23 13:16:52 -04:00
Marshall Bowers	e484f49ee8	language_models: Treat a `block_reason` from Gemini as a refusal (#38670 ) This PR updates the Gemini provider to treat a `prompt_feedback.block_reason` as a refusal, as Gemini does not seem to return a `stop_reason` to use in this case. <img width="639" height="162" alt="Screenshot 2025-09-22 at 4 23 15 PM" src="https://github.com/user-attachments/assets/7a86d67e-06c1-49ea-b58f-fa80666f0f8c" /> Previously this would just result in no feedback to the user. Release Notes: - Added an error message when a Gemini response contains a `block_reason`.	2025-09-22 20:40:56 +00:00
tidely	f18b19a73e	http_client: Relax lifetime bounds and add fluent builder methods (#38448 ) `HttpClient`: Relaxes the lifetime bound to `&self` in `get`/`post` by returning the `self.send` future directly. This makes both methods return `'static` futures without extra boxing. `HttpRequestExt`: Added fluent builder methods to `HttpRequestExt` inspired by the `gpui::FluentBuilder` trait. Release Notes: - N/A	2025-09-19 01:39:26 +02:00
Conrad Irwin	b09764c54a	settings: Use a derive macro for refine (#38451 ) When we refactored settings to not pass JSON blobs around, we ended up needing to write a lot of code that just merged things (like json merge used to do). Use a derive macro to prevent typos in this logic. Release Notes: - N/A	2025-09-18 21:13:49 +00:00
Conrad Irwin	fcdab160f9	Settings refactor (#38367 ) Co-Authored-By: Ben K <ben@zed.dev> Co-Authored-By: Anthony <anthony@zed.dev> Co-Authored-By: Mikayla <mikayla@zed.dev> Release Notes: - settings: Major internal changes to settings. The primary user-facing effect is that some settings which did not make sense in project settings files are no-longer read from there. (For example the inline blame settings) --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Mikayla Maki <mikayla.c.maki@gmail.com> Co-authored-by: Anthony <anthony@zed.dev>	2025-09-18 16:47:23 +00:00
Marshall Bowers	74e5b848ff	cloud_llm_client: Make `default_model` and `default_fast_model` optional (#38288 ) This PR makes the `default_model` and `default_fast_model` fields optional on the `ListModelsResponse`. Release Notes: - N/A	2025-09-16 22:24:03 +00:00
Marshall Bowers	26202e5af2	language_models: Use `message` field from Cloud error responses, if present (#38286 ) This PR updates the Cloud language model provider to use the `message` field from the Cloud error response, if it is present. Previously we would always show the entire JSON payload in the error message, but with this change we can show just the user-facing `message` the error response is in a shape that we recognize. Release Notes: - N/A	2025-09-16 21:45:25 +00:00
Umesh Yadav	526196917b	language_models: Add support for API key to Ollama provider (#34110 ) Closes https://github.com/zed-industries/zed/issues/19491 Release Notes: - Ollama: Added configuration of URL and API key for remote Ollama provider. --------- Signed-off-by: Umesh Yadav <git@umesh.dev> Co-authored-by: Peter Tripp <peter@zed.dev> Co-authored-by: Oliver Azevedo Barnes <oliver@liquidvoting.io> Co-authored-by: Michael Sloan <michael@zed.dev>	2025-09-15 06:34:26 +00:00
Michael Sloan	a598fbaa73	ai: Show "API key configured for {URL}" for non-default urls (#38170 ) Followup to #38163, also makes some changes intended to be included in that PR. Release Notes: - N/A	2025-09-15 05:49:25 +00:00
Michael Sloan	634ae72cad	Misc cleanup + clear language model provider API key editors when API keys are submitted (#38165 ) Followup to #38163 along with some other misc cleanups Release Notes: - N/A	2025-09-15 05:08:38 +00:00
Michael Sloan	98edf1bf0b	Reload API keys when URLs configured for LLM providers change (#38163 ) Three motivations for this: * Changing provider URL could cause credentials for the prior URL to be sent to the new URL. * The UI is in a misleading state after URL change - it shows a configured API key, but on restart it will show no API key. * #34110 will add support for both URL and key configuration for Ollama. This is the first provider to have UI for setting the URL, and this makes these issues show up more directly as odd UI interactions. #37610 implemented something similar for the OpenAI and OpenAI compatible providers. This extracts out some shared code, uses it in all relevant providers, and adds more safety around key use. I haven't tested all providers, but the per-provider changes were pretty mechanical, so hopefully work properly. Release Notes: - Fixed handling of changes to LLM provider URL in settings to also load the associated API key.	2025-09-15 03:36:24 +00:00
Umesh Yadav	01f181339f	language_models: Remove unnecessary LM Studio connection refused log (#37277 ) In zed logs you can see these logs of lmstudio connection refused. Currently zed connects to lmstudio by default as there is no credential mechanism to check if the user has enabled lmstudio previously or not like we do with other providers using api keys. This pr removes the below annoying log and makes the zed logs less polluted. ``` 2025-09-01T02:11:33+05:30 ERROR [language_models] Other(error sending request for url (http://localhost:1234/api/v0/models) Caused by: 0: client error (Connect) 1: tcp connect error: Connection refused (os error 61) 2: Connection refused (os error 61)) ``` Release Notes: - N/A --------- Signed-off-by: Umesh Yadav <git@umesh.dev>	2025-09-13 20:17:13 +02:00
Umesh Yadav	1142408675	language_models: Add provider options for OpenRouter models (#37979 ) Supersedes: #34500 Also this will allow to fix this: #35386 without the UX changes but providers can now be control through settings as well within zed. Just rebased the latest main and docs added. Added @AurelienTollard as co-author as it was started by him everything else remains the same from original PR. Release Notes: - Added ability to control Provider Routing for OpenRouter models from settings. Co-authored-by: Aurelien Tollard <tollard.aurelien1999@gmail.com>	2025-09-12 11:17:55 +02:00
Marshall Bowers	cb75c2aeb7	Make plans backwards compatible (#37941 ) This PR fixes the backwards compatibility of the new `Plan` variants. We can't add new variants to the wire representation, as old clients won't be able to understand them. Release Notes: - N/A	2025-09-10 20:11:07 +00:00
Bennet Bo Fenner	acb3406eb8	editor: Wrap placeholder if text overflows (#37919 ) This fixes an issue where long placeholders would be cut off, e.g. in a Claude Code thread: <img width="387" height="115" alt="image" src="https://github.com/user-attachments/assets/831a54aa-cf2b-4d87-af86-e368a5936f6b" /> Now: <img width="354" height="115" alt="image" src="https://github.com/user-attachments/assets/e5df5e05-0869-4db2-8dee-38611263191c" /> Most of the changes in this PR are caused by us requiring `&mut Window` in `set_placeholder_text`. Release Notes: - Fixed an issue where placeholders inside editors would not wrap --------- Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-09-10 15:38:19 +00:00
Umesh Yadav	2e36e9782e	language_models: Make Copilot Chat resilient to new model vendors and add tokenizer-based token counting (#37118 ) While working on fixing this: #37116. I reliased the current implementation of github copilot is not truly resilient to upstream changes. This PR enhances GitHub Copilot Chat to be forward-compatible with new AI model vendors and improves token counting accuracy by using vendor-specific tokenizers from the GitHub Copilot API. The system previously failed when GitHub added new model vendors like xAI with deserialization errors, and token counting wasn't utilizing the vendor-specific tokenizer information provided by the API. The solution adds an Unknown variant to the ModelVendor enum with serde other attribute to gracefully handle any new vendors GitHub introduces, implements tokenizer-aware token counting that uses the model's specified tokenizer mapping o200k_base to gpt-4o with fallback, adds explicit support for xAI models with proper tool input format handling, and includes comprehensive test coverage for unknown vendor scenarios. Key changes include adding the tokenizer field to model capabilities, implementing the tokenizer method on models, updating tool input format logic to handle unknown vendors, and simplifying token counting to use the vendor's specified tokenizer or fall back to gpt-4o. This ensures Zed's Copilot Chat integration remains robust and accurate as GitHub continues expanding their AI model provider ecosystem. Release Notes: - Enhanced model vendor compatibility to automatically support future AI providers and improved token counting accuracy using vendor-specific tokenizers from the GitHub Copilot --------- Signed-off-by: Umesh Yadav <git@umesh.dev>	2025-09-09 13:28:26 -06:00
Umesh Yadav	7ae8f81d74	language_models: Clear cached credentials when OpenAI and OpenAI Compatible provider `api_url` change (#37610 ) Closes #37093 Also check this: #37099. So currently in zed for both OpenAI and OpenAI Compatible provider when the url is changed from settings the api_key stored in the provider state is not cleared and it is still used. But if you restart zed the api_key is cleared. Currently zed uses the api_url to store and fetch the api key from credential provider. The behaviour is not changed overall, it's just that we have made it consistent it with the zed restart logic where it re-authenticates and fetches the api_key again. I have attached the video below to show case before and after of this. So all in all the problem was we were not re-authenticating the in case api_url change while zed is still running. Now we trigger a re-authentication and clear the state in case authentication fails. OpenAI Compatible Provider: \| Before \| After \| \|--------\|--------\| \| <video src="https://github.com/user-attachments/assets/324d2707-ea72-4119-8981-6b596a9f40a3" /> \| <video src="https://github.com/user-attachments/assets/cc7fdb73-8975-4aaf-a642-809bb03ce319" /> \| OpenAI Provider: \| Before \| After \| \|--------\|--------\| \| <video src="https://github.com/user-attachments/assets/a1c07d1b-1909-4b49-b33c-fc05123e92e7" /> \| <video src="https://github.com/user-attachments/assets/d78aeccd-5cd3-4d0c-8b9f-6f98e499d7c8" /> \| Release Notes: - Fixed OpenAI and OpenAI Compatible provide API keys being persisted when changing the API URL setting. Authentication is now properly revalidated when settings change. --------- Signed-off-by: Umesh Yadav <git@umesh.dev>	2025-09-08 06:57:16 +02:00
marius851000	9450bcad25	ollama: Properly format tool calls fed back to the model (#34750 ) Fix an issue that resulted in Ollama models not being able to not being able to access the input of the commands they executed (only being able to access the result). This properly return the function history as shown in https://github.com/ollama/ollama/blob/main/docs/api.md#chat-request-with-history-with-tools Previously, function input where not returned and result where returned as a "user" role. Release Notes: - ollama: Improved format when returning tool results to the models	2025-09-08 04:26:01 +00:00
Umesh Yadav	1f37fbd051	language_models: Use `/models/user` for fetching OpenRouter models (#37534 ) This PR switches the OpenRouter integration from fetching all models to fetching only the models specified in the user's account preferences. This will help improve the experience The Problem The previous implementation used the `/models` endpoint, which returned an exhaustive list of all models supported by OpenRouter. This resulted in a long and cluttered model selection dropdown in Zed, making it difficult for users to find the models they actually use. The Solution We now use the `/models/user` endpoint. This API call returns a curated list based on the models and providers the user has selected in their [OpenRouter dashboard](https://openrouter.ai/models). Ref: [OpenRouter API Docs for User-Filtered Models](https://openrouter.ai/docs/api-reference/list-models-filtered-by-user-provider-preferences) Release Notes: - language_models: Support OpenRouter user preferences for available models	2025-09-06 07:42:15 +02:00
Umesh Yadav	8c9442ad11	language_models: Skip empty delta text content in OpenAI and OpenAI compatible provider (#37626 ) Closes #37302 Related: #37614 In case of open_ai_compatible providers like Zhipu AI and z.ai they return empty content along with usage data. below is the example json captured from z.ai. We now ignore empty content returned by providers now to avoid this issue where we would return the same empty content back to provider which would error out. ``` OpenAI Stream Response JSON: { "id": "2025090518465610d80dc21e66426d", "created": 1757069216, "model": "glm-4.5", "choices": [ { "index": 0, "finish_reason": "tool_calls", "delta": { "role": "assistant", "content": "" } } ], "usage": { "prompt_tokens": 7882, "completion_tokens": 150, "total_tokens": 8032, "prompt_tokens_details": { "cached_tokens": 7881 } } } ``` Release Notes: - Skip empty delta text content in OpenAI and OpenAI compatible provider Signed-off-by: Umesh Yadav <git@umesh.dev>	2025-09-06 07:16:08 +02:00

1 2 3 4 5 ...

325 Commits