mixa/zed - zed - Gitea: Git with a cup of tea

mixa/zed

Author	SHA1	Message	Date
Max Brunsfeld	624dab2027	Combine zeta and zeta2 edit prediction providers (#43284 ) We've realized that a lot of the logic within an `EditPredictionProvider` is not specific to a particular edit prediction model / service. Rather, it is just the generic state management required to perform edit predictions at all in Zed. We want to move to a setup where there's one "built-in" edit prediction provider in Zed, which can be pointed at different edit prediction models. The only logic that is different for different models is how we construct the prompt, send the request, and parse the output. This PR also changes the behavior of the staff-only `zeta2` feature flag so that in only gates your ability to use Zeta2, but you can still use your local settings file to choose between different edit prediction models/services: zeta1, zeta2, and sweep. This PR also makes zeta1's outcome reporting and prediction-rating features work with all prediction models, not just zeta1. To do: * [x] remove duplicated logic around sending cloud requests between zeta1 and zeta2 * [x] port the outcome reporting logic from zeta to zeta2. * [x] get the "rate completions" modal working with all EP models * [x] display edit prediction diff * [x] show edit history events * [x] remove the original `zeta` crate. Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev>	2025-11-25 15:52:08 +01:00
Oleksiy Syvokon	4118b71010	zeta2: Support experimental 1120-seedcoder model (#43411 ) 1. Introduce a common `PromptFormatter` trait 2. Let models define their generation params. 3. Add support for the experimental 1120-seedcoder prompt format Release Notes: - N/A	2025-11-25 15:52:07 +01:00
Piotr Osiewicz	298dbd881c	releases: Add build number to Nightly builds (#42990 ) - Remove semantic_version crate and use semver instead - Update upload-nightly Release Notes: - N/A --------- Co-authored-by: Conrad Irwin <conrad.irwin@gmail.com>	2025-11-25 15:52:07 +01:00
Ben Kunkle	f2f40a5099	zeta2: Merge Sweep and Zeta2 Providers (#43097 ) Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ... --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-11-19 15:40:06 -08:00
Max Brunsfeld	09e02a483a	Allow running zeta evals against sweep (#43039 ) This PR restructures the subcommands in `zeta-cli`, so that the prediction engine (currently `zeta1` vs `zeta2`) is no longer the highest order subcommand. Instead, there is just one layer of subcommands: `eval`, `predict`, `context`, etc. Within these commands, there are flags for using `zeta1`, `zeta2`, and now `sweep`. Release Notes: - N/A --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Agus <agus@zed.dev>	2025-11-19 16:09:19 -05:00
Ben Kunkle	39f8aefa8c	zeta2: Improve context retrieval (#43014 ) Closes #ISSUE Release Notes: - N/A or Added/Fixed/Improved ... Co-authored-by: Agus <agus@zed.dev> Co-authored-by: Max <max@zed.dev>	2025-11-19 14:44:58 +00:00
Piotr Osiewicz	40dd4e2270	zeta: Add stats about context lines from patch that were retrieved during context retrieval (#43053 ) A.K.A: Eval: Expect lines necessary to uniquely target every change in "Expected Patch" to be included as context Release Notes: - N/A	2025-11-19 11:25:53 +00:00
Ben Kunkle	4bf3b9d62e	zeta2: Output `bucketed_analysis.md` (#42890 ) Closes #ISSUE Makes it so that a file named `bucketed_analysis.md` is written to the runs directory after an eval is ran with > 1 repetitions. This file buckets the predictions made by the model by comparing the edits made so that seeing how many times different failure modes were encountered becomes much easier. Release Notes: - N/A or Added/Fixed/Improved ...	2025-11-17 15:17:39 -05:00
Oleksiy Syvokon	b2f561165f	zeta2: Support qwen3-minimal prompt format (#42902 ) This prompt is for a fine-tuned model. It has the following changes, compared to `minimal`: - No instructions at all, except for one sentence at the beginning of the prompt. - Output is a simplified unified diff -- hunk headers have no line counts (e.g., `@@ -20 +20 @@`) - Qwen's FIM tokens are used where possible (`<\|file_sep\|>`, `<\|fim_prefix\|>`, `<\|fim_suffix\|>`, etc.) To evaluate this model: ``` ZED_ZETA2_MODEL=zeta2-exp [usual zeta-cli eval params ...] --prompt-format minimal-qwen ``` This will point to the most recent Baseten deployment of zeta2-exp (which may change in the future, so the prompt-format may get out of sync). Release Notes: - N/A	2025-11-17 20:36:05 +02:00
Oleksiy Syvokon	b274f80dd9	zeta2: Print average length of prompts and outputs (#42885 ) Release Notes: - N/A	2025-11-17 16:56:58 +02:00
Piotr Osiewicz	f1bebd79d1	zeta2: Add skip-prediction flag to eval CLI (#42872 ) Release Notes: - N/A	2025-11-17 13:37:51 +00:00
Ben Kunkle	8772727034	zeta2: Improve zeta old text matching (#42580 ) This PR improves Zeta2's matching of `old_text`/`new_text` pairs, using similar code to what we use in the edit agent. For right now, we've duplicated the code, as opposed to trying to generalize it. Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Michael <michael@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Agus <agus@zed.dev>	2025-11-14 11:18:16 -05:00
Oleksiy Syvokon	723f9b1371	zeta2: Add minimal prompt for fine-tuned models (#42691 ) 1. Add `--prompt-format=minimal` that matches single-sentence instructions used in fine-tuned models (specifically, in `1028-` and `1029-` models) 2. Use separate configs for agentic context search model and edit prediction model. This is useful when running a fine-tuned EP model, but we still want to run vanilla model for context retrieval. 3. `zeta2-exp` is a symlink to the same-named Baseten deployment. This model can be redeployed and updated without having to update the deployment id. 4. Print scores as a compact table Release Notes: - N/A --------- Co-authored-by: Piotr Osiewicz <piotr@zed.dev>	2025-11-14 13:08:54 +00:00
Agus Zubiaga	c2c5fceb5b	zeta eval: Allow no headings under "Expected Context" (#42638 ) Release Notes: - N/A	2025-11-13 15:43:22 +00:00
Agus Zubiaga	8467a1b08b	zeta eval: Improve output (#42629 ) Hides the aggregated scores if only one example/repetition ran. It also fixes an issue with the expected context scoring. Release Notes: - N/A	2025-11-13 14:47:48 +00:00
Agus Zubiaga	b0700a4625	zeta eval: `--repeat` flag (#42569 ) Adds a `--repeat` flag to the zeta eval that runs each example as many times as specified. Also makes the output nicer in a few ways. Release Notes: - N/A --------- Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Michael <michael@zed.dev>	2025-11-12 16:58:22 -05:00
Ben Kunkle	6501b0c311	zeta eval: Improve determinism and debugging ergonomics (#42478 ) - Improves the determinism of the search step for better cache reusability - Adds a `--cache force` mode that refuses to make any requests or searches that aren't cached - The structure of the `zeta-*` directories under `target` has been rethought for convenience Release Notes: - N/A --------- Co-authored-by: Agus <agus@zed.dev>	2025-11-12 18:16:13 +00:00
Ben Kunkle	6c0069ca98	zeta2: Improve error reporting and eval purity (#42470 ) Closes #ISSUE Improves error reporting for various failure modes of zeta2, including failing to parse the `<old_text>`/`<new_text>` pattern, and the contents of `<old_text>` failing to match. Additionally, makes it so that evals are checked out into a worktree with the _repo_ name instead of the _example_ name, in order to make sure that the eval name has no influence on the models prediction. The repo name worktrees are still namespaced by the example name like `{example_name}/{repo_name}` to ensure evals pointing to the same repo do not conflict. Release Notes: - N/A or Added/Fixed/Improved ... --------- Co-authored-by: Agus <agus@zed.dev>	2025-11-12 12:52:11 -05:00
Agus Zubiaga	f2ad0d716f	zeta cli: Print log paths when running predict (#42396 ) Release Notes: - N/A Co-authored-by: Michael Sloan <mgsloan@gmail.com> Co-authored-by: Ben Kunkle <ben@zed.dev>	2025-11-11 09:56:20 -03:00
Max Brunsfeld	b607077c08	Add old_text/new_text as a zeta2 prompt format (#42171 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Michael Sloan <mgsloan@gmail.com>	2025-11-10 15:44:54 -07:00
Agus Zubiaga	c748b177c4	zeta2 cli: Cache at LLM request level (#42371 ) We'll now cache LLM responses at the request level (by hash of URL+contents) for both context and prediction. This way we don't need to worry about mistakenly using the cache when we change the prompt or its components. Release Notes: - N/A --------- Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-11-10 14:23:52 -03:00
Agus Zubiaga	d420dd63ed	zeta: Improve unified diff prompt (#42354 ) Extract some of the improvements from to the unified diff prompt from https://github.com/zed-industries/zed/pull/42171 and adds some other about how context work to improve the reliability of predictions. We also now strip the `<\|user_cursor\|>` marker if it appears in the output rather than failing. Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-11-10 14:58:42 +00:00
Agus Zubiaga	c241eadbc3	zeta2: Targeted retrieval search (#42240 ) Since we removed the filtering step during context gathering, we want the model to perform more targeted searches. This PR tweaks search tool schema allowing the model to search within syntax nodes such as `impl` blocks or methods. This is what the query schema looks like now: ```rust /// Search for relevant code by path, syntax hierarchy, and content. #[derive(Debug, Clone, Serialize, Deserialize, JsonSchema)] pub struct SearchToolQuery { /// 1. A glob pattern to match file paths in the codebase to search in. pub glob: String, /// 2. Regular expressions to match syntax nodes by their first line and hierarchy. /// /// Subsequent regexes match nodes within the full content of the nodes matched by the previous regexes. /// /// Example: Searching for a `User` class /// ["class\s+User"] /// /// Example: Searching for a `get_full_name` method under a `User` class /// ["class\s+User", "def\sget_full_name"] /// /// Skip this field to match on content alone. #[schemars(length(max = 3))] #[serde(default)] pub syntax_node: Vec<String>, /// 3. An optional regular expression to match the final content that should appear in the results. /// /// - Content will be matched within all lines of the matched syntax nodes. /// - If syntax node regexes are provided, this field can be skipped to include as much of the node itself as possible. /// - If no syntax node regexes are provided, the content will be matched within the entire file. pub content: Option<String>, } ``` We'll need to keep refining this, but the core implementation is ready. Release Notes: - N/A --------- Co-authored-by: Ben <ben@zed.dev> Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-11-08 01:06:12 +00:00
Mikayla Maki	5f8226457e	Automate settings registration (#42238 ) Release Notes: - N/A --------- Co-authored-by: Nia <nia@zed.dev>	2025-11-07 22:27:14 +00:00
Max Brunsfeld	f89bb2f0d2	Small zeta cli fixes (#42170 ) * Fix a panic that happened because we lost the `ContextRetrievalStarted` debug message, so we didn't assign `t0`. * Write the edit prediction response log file as a markdown file containing the text, not a JSON file. We mostly always want the text content. Release Notes: - N/A	2025-11-07 07:34:05 +00:00
Max Brunsfeld	5044e6ac1d	zeta2: Make eval example file format more expressive (#42156 ) * Allow expressing alternative possible context fetches in `Expected Context` section * Allow marking a subset of lines as "required" in `Expected Context`. We still need to improve how we display the results. I've removed the context pass/fail pretty printing for now, because it would need to be rethought to work with the new structure, but for now I think we should focus on getting basic predictions to run. But this is progress toward a better structure for eval examples. Release Notes: - N/A --------- Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-11-06 18:05:18 -08:00
Max Brunsfeld	784fdcaee3	zeta2: Build edit prediction prompt and process model output in client (#41870 ) Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Ben Kunkle <ben@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com>	2025-11-06 18:36:58 -05:00
Oleksiy Syvokon	91d631c229	Evaluate zeta2 context retrieval and edit predictions (#41921 ) This PR implements the `zeta-cli eval` command. It will: - Run the edit prediction model if there are no cached results - Compute precision/recall/F1 for context retrieval at the line level: every retrieved line of context is counted as a true positive (correct retrieval), false positive (retrieved something that was not expected), or false negative (didn't retrieve an expected line) - Compute similar metrics for edit predictions - Pretty-print results, highlighting the difference between actual and expected when printing to tty Other changes: - `zeta-cli predict` accepts a `--format` argument with options `md`, `json`, `diff` - Code restructure Release Notes: - N/A --------- Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com> Co-authored-by: Agus Zubiaga <agus@zed.dev>	2025-11-04 17:36:50 +00:00
Max Brunsfeld	1631cec15a	Add zeta-cli subcommand for running zeta2 predictions (#41722 ) This PR adds a `zeta zeta2 predict` subcommand that takes an edit prediction example markdown file as an argument, and performs zeta2's prediction, showing the retrieved context and the predicted edit. * [x] Apply uncommitted diff to get repo into the right state. * [x] Apply edits in edit history * [x] Display predicted edits as unified diff, regardless of model output format Release Notes: - N/A --------- Co-authored-by: Agus Zubiaga <agus@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com> Co-authored-by: Ben Kunkle <ben.kunkle@gmail.com>	2025-11-03 15:12:08 -08:00
Agus Zubiaga	06bdb28517	zeta cli: Add convert-example command (#41608 ) Adds a `convert-example` subcommand to the zeta cli that converts eval examples from/to `json`, `toml`, and `md` formats. Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2025-11-01 19:35:04 +00:00
Agus Zubiaga	60c546196a	zeta2: Expose llm-based context retrieval via zeta_cli (#41584 ) Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-10-30 21:41:09 +00:00
Agus Zubiaga	ee80ba6693	zeta2: LLM-based context gathering (#41326 ) Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com> Co-authored-by: Max Brunsfeld <max@zed.dev>	2025-10-27 22:54:42 +00:00
Agus Zubiaga	eda7a49f01	zeta2: Max retrieved definitions option (#40515 ) Release Notes: - N/A	2025-10-20 15:26:41 +00:00
Julia Ryan	ef5b8c6fed	Remove workspace-hack (#40216 ) We've been considering removing workspace-hack for a couple reasons: - Lukas ran into a situation where its build script seemed to be causing spurious rebuilds. This seems more likely to be a cargo bug than an issue with workspace-hack itself (given that it has an empty build script), but we don't necessarily want to take the time to hunt that down right now. - Marshall mentioned hakari interacts poorly with automated crate updates (in our case provided by rennovate) because you'd need to have `cargo hakari generate && cargo hakari manage-deps` after their changes and we prefer to not have actions that make commits. Currently removing workspace-hack causes our workspace to grow from ~1700 to ~2000 crates being built (depending on platform), which is mainly a problem when you're building the whole workspace or running tests across the the normal and remote binaries (which is where feature-unification nets us the most sharing). It doesn't impact incremental times noticeably when you're just iterating on `-p zed`, and we'll hopefully get these savings back in the future when rust-lang/cargo#14774 (which re-implements the functionality of hakari) is finished. Release Notes: - N/A	2025-10-17 18:58:14 +00:00
Agus Zubiaga	fba7f4d8cc	zeta2: Update prompts to match training more closely (#40383 ) Release Notes: - N/A	2025-10-16 14:13:19 +00:00
Agus Zubiaga	376335496d	zeta2: Numbered lines prompt format (#40218 ) Adds a new `NumberedLines` format which is similar to `MarkedExcerpt` but each line is prefixed with its line number. Also fixes a bug where contagious snippets wouldn't get merged. Release Notes: - N/A --------- Co-authored-by: Michael Sloan <mgsloan@gmail.com> Co-authored-by: Michael <michael@zed.dev>	2025-10-15 09:35:39 -03:00
Agus Zubiaga	1bd34e0db0	zeta2 cli: Export retrieval stats data frame (#40145 ) Retrieval stats will now use polars to build a big data frame for references with the cartesian product of LSP declarations and retrieved declaration candidates (with all their score components) and rebuilds the stats summary on top of it. This data frame is written to a `.parquet` file, which we can load into advanced analytics tools (such as Metabase), so we can explore our scoring distributions and find ways to improve retrieval, and then train the decision tree. Release Notes: - N/A	2025-10-14 13:34:07 -03:00
Agus Zubiaga	6a9639f62f	zeta2 cli: Split retrieval stats module (#39977 ) Refactors zeta2 cli a bit. Merging this by itself to prevent conflicts. Release Notes: - N/A	2025-10-10 19:35:51 +00:00
Agus Zubiaga	a693d44553	zeta2 cli: Resumable LSP declarations gathering (#39828 ) Gathering LSP declarations in zeta_cli can take a really long time for big repos and has to be started from scratch if interrupted. Instead of writing the cache file once we have walked the whole worktree, we'll now do so incrementally as we complete each file. On subsequent runs, we'll load as many valid declarations as has been previously written to the cache, and then continue to request the rest from the LSP which will append to the existing file as it makes progress. If the last cache entry is incomplete, we'll truncate the cache file to the end of the last valid line and continue from there, so we can just `ctrl-c` without breaking resumability. Release Notes: - N/A	2025-10-10 12:44:36 -03:00
Michael Sloan	bcef3b5010	zeta2: Parse imports via Tree-sitter queries + improve `zeta retrieval-stats` (#39735 ) Release Notes: - N/A --------- Co-authored-by: Max <max@zed.dev> Co-authored-by: Agus <agus@zed.dev> Co-authored-by: Oleksiy <oleksiy@zed.dev>	2025-10-08 12:04:06 -06:00
Michael Sloan	c61409e577	zeta_cli: Avoid unnecessary rechecks in `retrieval-stats` (#39267 ) Before this change, it would save every buffer and wait for diagnostics. For rust analyzer this would cause a lot of rechecking and greatly slow down the analysis Release Notes: - N/A Co-authored-by: Agus <agus@zed.dev>	2025-10-01 06:31:27 +00:00
Agus Zubiaga	df43a2d3b1	zeta2 cli: Include section ranges in new full output format (#39203 ) Release Notes: - N/A --------- Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-09-30 14:30:13 +00:00
Michael Sloan	6af385235d	zeta_cli: Add retrieval-stats command for comparing with language server symbol resolution (#39164 ) Release Notes: - N/A --------- Co-authored-by: Agus <agus@zed.dev>	2025-09-30 08:06:31 +00:00
Michael Sloan	773850f477	zeta2: Use bounded parallelism for tree-sitter indexing + await completion in zeta_cli (#39147 ) Also skips indexing files that don't have a suffix that indicates a known language, and skips when the language doesn't have an outline grammar. Release Notes: - N/A --------- Co-authored-by: Agus <agus@zed.dev>	2025-09-29 22:15:00 +00:00
Michael Sloan	a5683f3541	zeta_cli: Add `--output-format both` and `--prompt-format only-snippets` (#38920 ) These are options are probably temporary, added for use in some experimental code Release Notes: - N/A Co-authored-by: Oleksiy <oleksiy@zed.dev>	2025-09-25 22:49:36 +00:00
Max Brunsfeld	495a7b0a84	Clean up RelPath API (#38912 ) Consolidate constructors and accessors. Release Notes: - N/A --------- Co-authored-by: Cole Miller <cole@zed.dev>	2025-09-25 14:42:32 -07:00
Agus Zubiaga	f25ace6be0	zeta2 cli: Output raw request (#38876 ) Release Notes: - N/A Co-authored-by: Bennet Bo Fenner <bennetbo@gmx.de> Co-authored-by: Oleksiy Syvokon <oleksiy.syvokon@gmail.com>	2025-09-25 12:59:17 +00:00
Michael Sloan	8fc7bd9ae8	zeta2: Add labeled sections prompt format (#38828 ) Release Notes: - N/A Co-authored-by: Agus <agus@zed.dev>	2025-09-25 00:07:43 +00:00
Max Brunsfeld	03f9cf4414	Represent relative paths using a dedicated, separator-agnostic type (#38744 ) Closes https://github.com/zed-industries/zed/issues/38690 Closes #37353 ### Background On Windows, paths are normally separated by `\`, unlike mac and linux where they are separated by `/`. When editing code in a project that uses a different path style than your local system (e.g. remoting from Windows to Linux, using WSL, and collaboration between windows and unix users), the correct separator for a path may differ from the "native" separator. Previously, to work around this, Zed converted paths' separators in numerous places. This was applied to both absolute and relative paths, leading to incorrect conversions in some cases. ### Solution Many code paths in Zed use paths that are relative to either a worktree root or a git repository. This PR introduces a dedicated type for these paths called `RelPath`, which stores the path in the same way regardless of host platform, and offers `Path`-like manipulation APIs. RelPath supports displaying the path using either separator, so that we can display paths in a style that is determined at runtime based on the current project. The representation of absolute paths is left untouched, for now. Absolute paths are different from relative paths because (except in contexts where we know that the path refers to the local filesystem) they should generally be treated as opaque strings. Currently we use a mix of types for these paths (std::path::Path, String, SanitizedPath). Release Notes: - N/A --------- Co-authored-by: Cole Miller <cole@zed.dev> Co-authored-by: Piotr Osiewicz <24362066+osiewicz@users.noreply.github.com> Co-authored-by: Peter Tripp <petertripp@gmail.com> Co-authored-by: Smit Barmase <heysmitbarmase@gmail.com> Co-authored-by: Lukas Wirth <me@lukaswirth.dev>	2025-09-24 18:57:33 -04:00
Agus Zubiaga	831de8e48f	zeta2: Include edits in prompt and add `max_prompt_bytes` param (#38737 ) Release Notes: - N/A Co-authored-by: Michael Sloan <mgsloan@gmail.com>	2025-09-23 19:50:07 +00:00

1 2

63 Commits