cybercyst/jj - jj - Gitea: Git with a cup of tea

mirror of https://github.com/martinvonz/jj.git synced 2025-05-31 03:42:39 +00:00

Author	SHA1	Message	Date
Yuya Nishihara	767e94f5af	fsmonitor: drop unneeded mut from make_fsmonitor_matcher() We only need &self.working_copy_path here.	2023-11-23 10:06:00 +09:00
Yuya Nishihara	c16c89bc27	fsmonitor: keep paths relative to the workspace root Since the caller wants repo-relative paths, it doesn't make sense to convert them back and forth.	2023-11-23 10:06:00 +09:00
Yuya Nishihara	5186066cf5	working_copy: simply collect() proto file states into BTreeMap Suppose the input list is presorted, sorting a sorted vec would be cheaper than .insert()-ing sorted items one by one. In my "linux" repo (watchman eanbled): - jj-0: baseline - jj-1: previous (don't randomize by HashMap) - jj-2: this % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 1.034 s ± 0.020 s [User: 0.881 s, System: 0.212 s] Range (min … max): 1.011 s … 1.068 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 849.3 ms ± 13.8 ms [User: 710.7 ms, System: 199.3 ms] Range (min … max): 821.7 ms … 870.2 ms 10 runs Benchmark 3: target/release-with-debug/jj-2 -R ~/mirrors/linux status Time (mean ± σ): 786.2 ms ± 16.7 ms [User: 650.7 ms, System: 204.1 ms] Range (min … max): 760.8 ms … 805.2 ms 10 runs Relative speed comparison 1.32 ± 0.04 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.08 ± 0.03 target/release-with-debug/jj-1 -R ~/mirrors/linux status 1.00 target/release-with-debug/jj-2 -R ~/mirrors/linux status	2023-11-20 08:29:33 +09:00
Yuya Nishihara	ee6a1e2c0a	working_copy: don't build intermediate HashMap from proto file states According to the doc, this is compatible with the map syntax. https://protobuf.dev/programming-guides/proto3/#maps This change means that the serialized file states are sorted by RepoPath, so BTreeMap<RepoPath, _> can be reconstructed with fewer cache misses. In my "linux" repo (watchman enabled): - jj-0: baseline - jj-1: this % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 1.034 s ± 0.020 s [User: 0.881 s, System: 0.212 s] Range (min … max): 1.011 s … 1.068 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 849.3 ms ± 13.8 ms [User: 710.7 ms, System: 199.3 ms] Range (min … max): 821.7 ms … 870.2 ms 10 runs Relative speed comparison 1.32 ± 0.04 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.08 ± 0.03 target/release-with-debug/jj-1 -R ~/mirrors/linux status Cache-misses got reduced: % perf stat -e task-clock,cycles,instructions,cache-references,cache-misses \ -- ./target/release-with-debug/jj-0 -R ~/mirrors/linux --no-pager status 1,091.68 msec task-clock # 1.032 CPUs utilized 4,179,596,978 cycles # 3.829 GHz 6,166,231,489 instructions # 1.48 insn per cycle 134,032,047 cache-references # 122.776 M/sec 29,322,707 cache-misses # 21.88% of all cache refs 1.057474164 seconds time elapsed 0.897042000 seconds user 0.194819000 seconds sys % perf stat -e task-clock,cycles,instructions,cache-references,cache-misses \ -- ./target/release-with-debug/jj-1 -R ~/mirrors/linux --no-pager status 927.05 msec task-clock # 1.083 CPUs utilized 3,451,299,198 cycles # 3.723 GHz 6,222,418,272 instructions # 1.80 insn per cycle 98,499,363 cache-references # 106.251 M/sec 11,998,523 cache-misses # 12.18% of all cache refs 0.855938336 seconds time elapsed 0.720568000 seconds user 0.207924000 seconds sys	2023-11-20 08:29:33 +09:00
Yuya Nishihara	56047cb7ec	working_copy: don't pass all proto data to from_proto() functions Just a code cleanup. This allows us to consume proto fields if needed. I also removed redundant .clone() and .as_str().	2023-11-20 08:29:33 +09:00
Martin von Zweigbergk	9b24d24612	conflicts: add another helper for materializing a tree value We have a few places where we have a `MergedTreeValue` and need to read the data associated with it so we can write to the working copy or include it in a diff. Let's extract some of that shared logic to a function so we can reuse it. I plan to use it for reading file contents in advance while streaming a diff in `local_working_copy` soon (and probably in `jj diff` thereafter), but I think it seems like an improvement on its own.	2023-11-08 21:21:38 -08:00
Martin von Zweigbergk	65bd5cacba	working copy: on checkout, move read from store out of `write_()` functions I'd like to read N files ahead from the backend, to avoid serializing too many server calls on backends that are backed by a server. Moving the reads a little earlier is a little step towards that. The `TreeState::write_()` functions can now be made into free/static functions if we prefer.	2023-11-08 21:21:38 -08:00
Martin von Zweigbergk	904c37d36d	working copy: use `MergedTree::diff_stream()` This will make it a little faster to update the working copy at Google once we've made `MergedTree::diff_stream()` fetch trees concurrently. (It only makes it a little faster because we still fetch files serially.)	2023-11-03 08:15:10 -07:00
Martin von Zweigbergk	24b706641f	async: switch to `pollster`'s `block_on()` During the transition to using more async code, I keep running into https://github.com/rust-lang/futures-rs/issues/2090. Right now, I want to convert `MergedTree::diff()` into a `Stream`. I don't want to update all call sites at once, so instead I'm adding a `MergedTree::diff_stream()` method, which just wraps `MergedTree::diff()` in a `Stream. However, since the iterator is synchronous, it needs to block on the async `Backend::read_tree()` calls. If we then also block on the `Stream` in the CLI, we run into the panic.	2023-11-03 08:15:10 -07:00
Martin von Zweigbergk	a1ef9dc845	merged_tree: propagate backend errors in diff iterator I want to fix error propagation before I start using async in this code. This makes the diff iterator propagate errors from reading tree objects. Errors include the path and don't stop the iteration. The idea is that we should be able to show the user an error inline in diff output if we failed to read a tree. That's going to be especially useful for backends that can return `BackendError::AccessDenied`. That error variant doesn't yet exist, but I plan to add it, and use it in Google's internal backend.	2023-10-26 06:20:56 -07:00
Martin von Zweigbergk	309f1200d6	merge: introduce a type alias for `Merge<Option<TreeValue>>` Reasons to introduce this alias: * Reduces complexity of a type, to silence Clippy warnings in the future if we use this type as a type parameter * The type is used quite frequently, so it makes sense to have a name for it * It's easier to visually scan for the end of the type when you don't have to match opening and closing angle brackets	2023-10-26 06:20:56 -07:00
Martin von Zweigbergk	8764ad9826	conflicts: make materialization async We need to let async-ness propagate up from the backend because `block_on()` doesn't like to be called recursively. The conflict materialization code is a good place to make async because it doesn't depends on anything that isn't already async-ready.	2023-10-20 07:38:34 -07:00
Martin von Zweigbergk	6bfd618275	workspace: load working copy implementation dynamically This makes `Workspace::load()` look a new `.jj/working_copy/type` file in order to load the right working copy implementation, just like `Repo::load()` picks the right backends based on `.jj/store/type`, `.jj/op_store/type`, etc. We don't write the file yet, and we don't have a way of adding alternative working copy implementations, so it will always be `LocalWorkingCopy` for now.	2023-10-16 22:33:44 -07:00
Martin von Zweigbergk	e1f00d9426	working copy: pass commit instead of tree into `check_out()` Our internal working copy implementations at Google will need the commit so they can walk history backwards until they get to a "public" commit. They'll then use that to tell build tools and virtual file systems to present that as a base. I'm not sure if we'll need to update `reset()` too. It's currently only used by `jj untrack`, which doesn't change the commit's parent, so it wouldn't affect any history walks.	2023-10-16 22:33:44 -07:00
Martin von Zweigbergk	0582893144	working copy: return `Box<dyn LockedWorkingCopy>` from `start_mutation()`	2023-10-15 16:13:19 -07:00
Martin von Zweigbergk	580586d008	working copy: return `Box<dyn WorkingCopy>` from `finish()`	2023-10-15 16:13:19 -07:00
Martin von Zweigbergk	6a13fa8264	working copy: add `tree_id()` to backend trait Looks like I missed this earlier. I think it makes sense to have on all working copy implementations.	2023-10-15 16:13:19 -07:00
Martin von Zweigbergk	a733fceba9	working copy: add functions to start/finish modification to backend trait To keep this patch small, the functions still return the concrete local-disk implementations. I'll fix that soon.	2023-10-15 16:13:19 -07:00
Martin von Zweigbergk	63654d064b	working copy: add sparse pattern functions to backend trait	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	6457a13260	working copy: add `reset()` function to the backend trait This includes documenting the new function and the other types moved to the `working_copy` module.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	0d2247b0df	working copy: add `check_out()` function to the backend trait This includes documenting the new function and the other types moved to the `working_copy` module.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	781859cb51	working copy: add `snapshot()` function to the backend trait This includes documenting the new function and the other types moved to the `working_copy` module.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	3aa57b1a04	working copy: start defining a trait for a locked working copy As with the `WorkingCopy` trait, this just contains some trivial methods for now.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	49637cb0fd	working copy: don't clear `tree_state_dirty` just before it's dropped	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	8826639e4e	working copy: remove reference from locked instance to base instance It's going to be easier to define a `LockedWorkingCopy` trait if it doesn't need to borrow from `WorkingCopy`, so let's remove the reference we currently have and have `LockedLocalWorkingCopy::finish()` return the new `LocalWorkingCopy` instead. I think the main disadvantage is that we now have to remember to replace the old `LocalWorkingCopy` instance by the new one, whereas the compiler would remind us before this commit. We could make `start_modification()` take an owned `self`, but that would be a bit annoying to work with when we have the instance stored in a field.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	35f9c12cb5	working copy: move `LocalWorkingCopy::check_out()` to `Workspace` `LocalWorkingCopy::check_out()` can be expressed using the planned `WorkingCopy` trait, so it doesn't need to be in the trait itself `WorkingCopy`. I wasn't sure if I should make it a free function in `working_copy`, but I ended up moving it onto `Workspace`.	2023-10-15 15:59:49 -07:00
Martin von Zweigbergk	0aa5f1ae10	working copy: rename `working_copy_path()` to just `path()` It seems pretty clear from the context. Turns out we only use the function in a test case. Maybe we don't even need it. It's easy to provide it, though.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	9e43207911	working copy: don't expose `TreeStateError` in `LocalWorkingCopy` API The `TreeStateError` type is specific to the current local-disk working-copy backend, so it should not be part of the generic working-copy interface I'm trying to create.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	0e09d53ce6	working copy: make some reset errors less specific Same reasoning as the previous commits.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	645be615b4	working copy: make some snapshot errors less specific Same reasoning as the previous commit.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	324c40d4c5	working copy: make some checkout errors less specific I think some of the errors variants in `CheckoutError` are too specific to the local-disk implementation. Let's merge them and make them less specific, so it's easier to define a reasonable trait for the working copy.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	33d27ed09f	working copy: start defining a working copy trait This just extracts a trait for the trivial bits to start with.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	b9a122ffe7	working_copy: inline `apply_diff` closure This effectively undoes d8a313cdd474, which is no longer needed since we just changed that error handling. It should make it easier to share some of the current if/else blocks.	2023-10-07 14:02:31 -07:00
Martin von Zweigbergk	44eb902171	working_copy: don't crash when updating and tracked file exits on disk Before this patch, when updating to a commit that has a file that's currently an ignored file on disk, jj would crash. After this patch, we instead leave the conflicting files or directories on disk. We print a helpful message about how to inspect the differences between the intended working copy and the actual working copy, and how to discard the unintended changes. Closes #976.	2023-10-07 14:02:31 -07:00
Martin von Zweigbergk	4601c87710	working_copy: move creation of parent dirs to one place I'm about to add handling of parent dirs that are existing ignored files, so it's better to have it in one place. The only functional difference should be that we now create parent directories for git submodules. I don't think that matters.	2023-10-07 14:02:31 -07:00
Martin von Zweigbergk	187ba9430a	working_copy: rename to local_working_copy It's about time we make the working copy a pluggable backend like we have for the other storage. We will use it at Google for at least two reasons: * To support our virtual file system. That will be a completely separate working copy backend, which will interact with the virtual file system to update and snapshot the working copy. * On local disk, we need to tell our build system where to find the paths that are not in the sparse patterns. We plan to do that by wrapping the standard local working copy backend (the one moved in this commit), writing a symlink that points to the mainline commit where the "background" files can be read from. Let's start by renaming the exising implementation to `local_working_copy`.	2023-10-07 08:19:03 -07:00

1 2

86 Commits