Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Atomic move operation for element reparenting & reordering #1255

Open
domfarolino opened this issue Feb 14, 2024 · 110 comments · May be fixed by #1307
Open

Atomic move operation for element reparenting & reordering #1255

domfarolino opened this issue Feb 14, 2024 · 110 comments · May be fixed by #1307
Assignees
Labels
a11y-tracker Group bringing to attention of a11y, or tracked by the a11y Group but not needing response. addition/proposal New features or enhancements needs implementer interest Moving the issue forward requires implementers to express interest stage: 3 Committed

Comments

@domfarolino
Copy link
Member

domfarolino commented Feb 14, 2024

What problem are you trying to solve?

Chrome (@domfarolino, @noamr, @mfreed7) is interested in pursuing the addition of an atomic move primitive in the DOM Standard. This would allow an element to be re-parented or re-ordered without today's side effects of first being removed and then inserted.

Here are all of the prior issues/PRs I could find related to this problem space:

Problem

Without an atomic move operation, re-parenting or re-ordering elements involves first removing them and then re-inserting them. With the DOM Standard's current removal/insertion model, this resets lots of state on various elements, including iframe document state, selection/focus on <input>s, and more. See @josepharhar's reparenting demo for a more exhaustive list of state that gets reset.

This causes lots of developer pain, as recently voiced on X by frameworks like HTMX, and other companies such as Wix, Microsoft, and internally at Google.

This state-resetting is in part caused by the DOM Standard's current insertion & removal model. While well-defined, its model of insertion and removal steps has two issues, both captured by #808:

  1. Undesirable model: The current DOM Standard allows for the non-atomic insertion of multiple nodes at a time. In practice, this means when appending e.g., a DocumentFragment, script can run in between each individual child insertion, thus observing DOM state before the entire fragment insertion is complete.
  2. Interop issues: While Safari matches the spec, Chromium & Gecko have a model that ensures all DOM mutations are synchronously performed before any script runs as a result of the mutations.

What solutions exist today?

One very limited partial solution that does not actually involve any DOM tree manipulation, is this shadow DOM example that @emilio had posted a while back: whatwg/html#5484 (comment) (see my brief recreation of it below).

Screen Recording 2024-01-29 at 5 00 26 PM

But as mentioned, this does not seem to perform any real DOM mutations; rather, the slot mutation seems to just visually compose the element in the right place. Throughout this example, the iframe's actual parent does not change.


Otherwise, we know there is some historical precedent for trying to solve this problem with WebKit's since-rolled-back "magic iframes". See whatwg/html#5484 (comment) and https://bugs.webkit.org/show_bug.cgi?id=13574#c12. We believe that the concerns from that old approach can be ameliorated by:

How would you solve it?

Solution

To lay the groundwork for an atomic move primitive in the DOM Standard, we plan on resolving #808 by introducing a model desired by @annevk, @domfarolino, @noamr, and @mfreed7, that resembles Gecko & Chromium's model of handling all script-executing insertion/removal side-effects after all DOM mutations are done, for any given insertion.

With this in place, we believe it will be much easier to separate out the cases where we can simply skip the invocation of insertion/removal side-effects for nodes that are atomically moved in the DOM. This will make us, and implementers, confident that there won't be any way to observe an inconsistent DOM state while atomically moving an element, or experience other nasty unknown side-effects.

The API shape for this new primitive is an open question. Below are a few ideas:

  • A new DOM API like replaceChildAtomic()/replaceChildrenAtomic() that can take a connected node and atomically re-parent it without removal/insertion side-effects.
    • One limitation here is that we'd have to pick and choose which existing DOM APIs we want to mirror with atomic counterparts. For example, if we ever wanted append() or appendChild() to ever be able to also atomically move already-connected nodes, we'd have to introduce appendAtomic() and appendChildAtomic(), and so on.
  • A setting for existing DOM APIs, e.g., append(node, {atomic: true}), replaceChild(node, {atomic: true})
  • A scoped, declarative attribute that changes the behavior of DOM mutation APIs in a subtree
    • This could be an element attribute that makes all existing DOM mutation APIs behave "atomically" when operating on already-connected nodes under the element's subtree
    • This could also be a property on the document overall, set via a header/meta tag, or some other mechanism

Compatibility issues here take the form relying on insertion/removal side-effects which no longer happen during an atomic move. They vary depending on the shape of our final design.

  1. With a new DOM API/setting that developers have to affirmatively opt-into, you could atomically move fragments/subtrees constructed by other library code that's unaware it's being atomically moved. Those fragments may be built in a way that relies on non-atomic move side-effects (though we haven't heard of such concerns directly yet).
  2. Consider an element attribute that changes the behavior of all DOM mutation APIs to behave atomically on already-connected nodes in its subtree. You could minimize compat concerns by externally-constructed portions of the subtree to opt-out of atomic moves with the same attribute. But what would that mean exactly, to have part of a subtree move atomically and part of it not?

A non-exhaustive list of additional complexities that would be nice to track/discuss before a formal design:

  • How to handle mutation events? There was discussion at the TPAC 2023 about suppressing mutation events when new-ish DOM features are used, so we could probably get away with simply suppressing mutation events whenever an atomic move is being performed??
  • Handling things like focus/selection properly (need to land on desired behavior)
  • Fixing up things like live ranges; the way DOM handles this today might already be suitable for atomic moves, but unclear

Anything else?

No response

@domfarolino domfarolino added needs implementer interest Moving the issue forward requires implementers to express interest addition/proposal New features or enhancements labels Feb 14, 2024
@domfarolino domfarolino self-assigned this Feb 14, 2024
@domfarolino domfarolino added the agenda+ To be discussed at a triage meeting label Feb 14, 2024
@WebReflection
Copy link

WebReflection commented Feb 14, 2024

First of all, thank you! I've been vocal about this issue about forever and part of one of the biggest discussions you've linked.

As author of various "reactive" libraries and somehow veteran of the "DOM diffing field", I'd like to add an idea:

The API shape for this new primitive is an open question. Below are a few ideas:

I understand a node can be moved from <main> to an <aside> element and this proposal should still work but I think we should not discard the Range API:

  • most modern libraries have a concept of fragments, inevitably represented as virtual because there's no persistent fragment whatsoever yet on the DOM (I've been vocal about this too)
  • in a classic table sort mechanism there could be only few TRs moved within a specific place and taht's the same for LIs and others ... if any proposed API consider only parentNode to work that would not satisfy most fragment based requirements where areas are confined within Virtual DOM or comment nodes to confine those special cases while the Range api could instead simply select a node start, a node end, and update atomically inner nodes

On top of this I hope whatever solution comes to mind works well with DOM diffing, so that new nodes can even pass through the usual DOM dance when the parent is changed or they become live, removed nodes that won't land anywhere else would eventually invoke disconnectedCallback if Custom Elements, but nodes already present in that container and moved around basically do nothing in terms of state, they are just shuffled in the layout, if they do.

As quick idea to eventually signal a node is going to be moved in an atomic way, and assuming it's targeting also a live parent, I think something like parent.insertBeforeAtomic(node[, reference]) could be an interesting approach to consider as that basically solves everything, from append to prepend to any other case insertBefore works wonderfully well and it hints that such node should:

  • do nothing if the parent is the same as before (or the node was already live) ... just move it and skip all the things
  • trigger connectedCallback if the node was not live
  • ... that's it?

As insertBefore covers append, appendChild, prepend, before and after with ease, it might be the easiest starting point to have something working and useful for the variety of virtual fragments based solutions and diffing APIs out there.

I hope this answer of mine makes sense and maybe trigger some even better idea / API.

edit on after thoughts another companion of the API should be reflected in MutationObserver, or better, MutationRecord ... so far we have addedNodes and removedNodes but nothing about movedNodes which will still be desired for most convoluted edge cases.

The movedNodes record will contain, beside of course the target, a from parent container and a to parent container which might be the same if moved internally but it would signal previous parent and new parent otherwise that something different is within their content.

@1cg
Copy link

1cg commented Feb 18, 2024

This would be a fantastic addition of functionality for web development in general and for web libraries in particular. Currently if developers want to preserve the state of a node when updating the DOM they need to be extremely careful not to remove that node from the DOM.

Morphing (https://github.com/patrick-steele-idem/morphdom) is an idea that has developed around addressing this. I have created an extension to the original morphdom algorithm called idiomorph (https://github.com/bigskysoftware/idiomorph/) and the demo for idiomorph shows how it preserves a video in a situation when morphdom cannot. 37Signals has recently integrated idiomorph into Turbo 8 & Rails (https://radanskoric.com/articles/turbo-morphing-deep-dive-idiomorph)

If you look at the details of the idiomorph demo you will see it's set up in a particular way: namely, the video cannot change the depth in the DOM at which it is placed, nor can any of the types of the parent nodes of the video change. This is a severe restriction on what sorts of UI changes idiomorph can handle. With the ability to reparent elements idiomorph could offer much better user experience, handling much more significant changes to the DOM without losing state such as video playback, input focus, etc.

Note that it's not only morphing algorithms like idiomorph that would benefit from this change: nearly any library that mutates the DOM would benefit from this ability. Even virtual DOM based libraries, when the rubber meets the road, need to update the actual DOM and move actual elements around. This change would benefit them tremendously.

Thank you for considering it!

@smaug----
Copy link
Collaborator

Anything else?

Add some complexity to selection/range: how to deal with Shadow DOM when the host moves around and selection is partially in shadow DOM?

@ydogandjiev
Copy link

This is a very exciting proposal! In the Microsoft Teams Platform, we extensively use iframes to host embedded apps in the Teams Web/Desktop Clients. When a user navigates away from an experience powered by one of these embedded apps and comes back to it later, we provide the ability for them to keep their iframe cached in the DOM (in a hidden state) and then re-show it later when it's needed again. To implement this functionality, we had to resort to creating the embedded app frames under the body of our page and absolute position them in the right place within our UX. This approach has lots of obvious disadvantages (e.g. breaks the accessibility tree, requires us to run a bounds synchronization loop, etc.) and the only reason we had to resort to it was because moving the iframe in the DOM would reload the embedded app from scratch thus negating any benefits of caching the frame. This proposal would allow us to implement a much more ideal iframe caching solution!

Note the location of the iframe in the DOM and its absolute positioning in this recording:
https://github.com/whatwg/dom/assets/3357245/7fd4d2a7-2c2d-4bed-9a78-9c60f26a42f4

@infogulch
Copy link

The WHATNOT meetings that occurred after this issue was created deferred discussion about the topic. I wonder what next steps would be needed to move this issue forward. The next meeting is on March 28 (#10215).

@noamr
Copy link
Collaborator

noamr commented Mar 22, 2024

The WHATNOT meetings that occurred after this issue was created deferred discussion about the topic. I wonder what next steps would be needed to move this issue forward. The next meeting is on March 28 (#10215).

I hope we can get to it in the 28.3 WHATNOT. @domfarolino @past ?

@past
Copy link

past commented Mar 22, 2024

It's already on the agenda, so if the interested parties are attending we will discuss this.

@iteriani
Copy link

Are the imperative and declarative APIs meant to slowly replace the existing APIs over time? Or do we need to choose between one or the other because of potential overhead?

@noamr
Copy link
Collaborator

noamr commented Mar 26, 2024

Are the imperative and declarative APIs meant to slowly replace the existing APIs over time? Or do we need to choose between one or the other because of potential overhead?

If I understand the question, it's mainly for backwards compatibility. In some cases you might want the existing behavior or something subtle in your app relies on it, so we can't just change it under the hood.

@sebmarkbage
Copy link

This would be very nice for React since we currently basically just live with things sometimes incorrectly resetting. A couple of notes on the API options:

  • Associating with the node that gets moved e.g. an option on the <iframe> doesn't make much sense because it can be deeply nested inside the tree that moves. The iframe doesn't know anything about which context it moves inside. At best maybe you'd just have to by default add it to all possible nodes that might contain any state - which is all nodes.
  • Associating with a subtree creates a kind of "mode". Basically for a React app we'd just add it to the entire document, but that also affects any subtrees embedded inside the document which might be an entire legacy app or a different framework. It forces us to basically break the whole app to opt into it. It'd basically be like a new doctype kind of mode.

The thing that does causes a change is the place where the move happens. But even then it's kind of random which one gets moved and which one implicitly moves by everything around it moving. We don't remove all children and then reinsert them. So sometimes things preserve state.

A new API for insertion/move seems like a better option.

We'd basically like to just always the same API for all moves - which can be thousands at a time. This means that this API would have to be really fast - similar to insertBefore. An API like append(node, {atomic: true}) doesn't seem good because the allocation and creation of potentially new objects and reading back the value from C++ to JS isn't exactly fast. Since this is a high performance API, this seems like a bad option.

Something new like replaceChildAtomic would be easy to adopt inside a library and faster.

@rniwa
Copy link
Collaborator

rniwa commented Mar 26, 2024

One thing that's nice to nail down is whether re-ordering of child nodes is enough or we need to support re-parenting (i.e. parent node changing from one node to another). Supporting the latter is a lot more challenging than just supporting re-ordering.

@1cg
Copy link

1cg commented Mar 26, 2024

Definitely would prefer full re-parenting. I gave an htmx demo of an morph-based swap at Github where you could flip back and forth between two pages and a video keeps working:

https://www.youtube.com/watch?v=Gj6Bez2182k&t=2100s

The dark secret of that demo was that I had to really carefully structure the HTML in the first and second pages to make sure that the video stayed at the same depth w/ the same parent element types to make the video playing keep working. Would be far better for HTML authors if they could change the HTML structure entirely, just build page 1 the way they want and build page 2 the way they want, and we could swap elements into their new spots by ID.

@domfarolino
Copy link
Member Author

(For the purpose of brevity, I will begin using the SPAM acronym that we've been toying around with internally, which means "state-preserving atomic move". The most obvious example is an iframe that gets SPAM-moved doesn't lose its document or otherwise get torn down).


  • Associating with a subtree [...] Basically for a React app we'd just add it to the entire document, but that also affects any subtrees embedded inside the document [...]. It forces us to basically break the whole app to opt into it.

The thing that does causes a change is the place where the move happens.
[...]
A new API for insertion/move seems like a better option.

@sebmarkbage I understand your hesitation around a new subtree-associated-HTML-attribute — in that it would be over-broad, affecting tons of nested content that a framework might not own, possibly breaking parts of an app that doesn't expect SPAM moves to happen. But I'm curious if a new DOM API really gets you out from under that over-broadness, while still being useful? What would you expect orderedList.replaceChildAtomic(newListItem, oldListItem) to do, where newListItem is an <li> with a bunch of app-specific (not framework-owned) child content, including <iframe>s?

I guess I had in mind that the imperative API would force-SPAM-move the "state-preservable" elements in the subtree that's moving, so that any nested iframes do not get their documents reset1. But if that API would not preserve nested iframe state, then the only way it would be possible to actually preserve that iframe's state in this case is if the application took care to apply an iframe-specific HTML attribute to it, specifying that it opts into SPAM moves:

  • Associating with the node that gets moved e.g. an option on the <iframe> doesn't make much sense because it can be deeply nested inside the tree that moves. [...]

But it sounded like that option didn't sit well with you because the application author would be one-by-one sprinkling these attributes to random iframes without understanding the context in which the SPAM move might actually take place, by a framework way higher up the stack.

So how can we best enable the scenario where an <li> that contains a deeply-nested iframe, gets SPAM-moved without the iframe being reset? My thought is that:

  • list.replaceChildAtomic(new, old) would force-SPAM-move iframes in the new subtree (if new is already connected in the DOM of course)
  • Good ole fashioned list.replaceChild(new, old) would only cause SPAM moves to happen on elements in the subtree with the HTML attribute directly applied to it (i.e., <iframe preserve=content>), and no other elements.

But I would love to get more thoughts on the subtree side-effects stuff in general.

Footnotes

  1. Possibly other state like focus/selection being preserved on other eligible elements; that bit would need to be figured out!

@rniwa
Copy link
Collaborator

rniwa commented Mar 27, 2024

I don't think we can make this happen automatically based on a content attribute on an iframe. It most certainly needs to be a completely new DOM API.

@domfarolino
Copy link
Member Author

I don't think we can make this happen automatically based on a content attribute on an iframe. It most certainly needs to be a completely new DOM API.

I am very much open to that, I'm just trying to consider what subtree side-effects are acceptable. That is, if parent.appendAtomic(connectedDivWithChildIframe) should preserve the "child iframe" state or not? I think it has to, for the API to be useful at all. But I'm also sympathetic to compat concerns that it might cause a preserving-move to happen on deeply-nested iframes in a subtree built by another application/framework than the one performing the move in the first place. (And maybe that could break things if parts of the app relies on preserving moves not happening on nodes in the subtree).

@domfarolino
Copy link
Member Author

An attribute + DOM API could work together in this case a bit, to ameliorate some of the compat concerns. For example:

const nodeToAtomicallyMove = document.querySelector('......');
// Never trigger atomic moves on *this* specific sub-subtree, that was built by "old" content.
nodeToAtomicallyMove.querySelector('.built-by-legacy-app').preserve = 'none';
newParent.appendAtomic(nodeToAtomicallyMove);

In this case, all <iframe>s inside nodeToAtomicallyMove could be SPAM moved except ones that exist inside the subtree .built-by-legacy-app. Those ones are specifically opted-out, because maybe they can't handle preserving-moves... Just an idea!

@rniwa
Copy link
Collaborator

rniwa commented Mar 27, 2024

That sounds like something that could be built by a user hand library, not something that needs to be built into browser's native API. We really need to keep this API proposal as simple & succinct as much as possible.

@noamr
Copy link
Collaborator

noamr commented Mar 27, 2024

I don't think we can make this happen automatically based on a content attribute on an iframe. It most certainly needs to be a completely new DOM API.

Can you expand on why this is impossible? I can see the point why it might be preferable, but I think both directions are possible.

@noamr
Copy link
Collaborator

noamr commented Mar 27, 2024

and +1 to not limiting it to reordering. We'll end up just scratching the surface of the use-cases, coming back to where we started where we still need a full solution for reparenting.

@annevk
Copy link
Member

annevk commented Mar 27, 2024

I'm also a bit at a loss as to why we'd discuss new attributes. That seems like a pretty severe layering violation? The way I see it:

  1. https://dom.spec.whatwg.org/#mutation-algorithms needs to gain a new "move" operation that encapsulates argument validation, new mutation observer records, new callback steps for specifications to hook into, etc.
  2. We figure out what API is best suitable for that new primitive, e.g., parent.moveBefore(node, before). (Possibly multiple APIs, but best to start small and give it time to bake in multiple implementations.)

@noamr
Copy link
Collaborator

noamr commented Mar 27, 2024

I'm also a bit at a loss as to why we'd discuss new attributes. That seems like a pretty severe layering violation? The way I see it:

  1. https://dom.spec.whatwg.org/#mutation-algorithms needs to gain a new "move" operation that encapsulates argument validation, new mutation observer records, new callback steps for specifications to hook into, etc.
  2. We figure out what API is best suitable for that new primitive, e.g., parent.moveBefore(node, before). (Possibly multiple APIs, but best to start small and give it time to bake in multiple implementations.)

I tend to agree with the conclusion, but I want to explain why the main reason to consider things like an iframe attribute, in case it raises something else.

Outside "keep iframes from reloading", it's unclear exactly what the effects of this would be. For focus, we need to blur and refocus anyway, e.g. in case you're moving the element to an inert tree. We can decide to do that and just suppress the events. Similar provisions have to be taken for selection. So if we add moveBefore, we have to decide if it does all these things, if so, how exactly, or just the iframes thing for start.

@gnoff
Copy link

gnoff commented Mar 27, 2024

@domfarolino

I guess I had in mind that the imperative API would force-SPAM-move the "state-preservable" elements in the subtree that's moving

I think what Seb is saying is that React can decide if a move should be state preserving but if React added a "preserve-state" attribute to <html /> and then some embedded application deep in the DOM does an append expecting the append to be non-state-preserving we've just altered the moves that the other application owns.

Our perspective is that the mover decides the move semantics rather than the tree. So any moves done by this embedded application won't preserve state b/c that is what the application was expecting and any moves done by React would preserve state becuase React was updated to signal this intent by using a novel API

@noamr
Copy link
Collaborator

noamr commented Nov 25, 2024

So you'll be left with canMove = container.isConnected === node.isConnected

so ... two false would not throw? Interesting

Yea, you can move between disconnected parents.
It would only throw when you can't move without side-effects - as in, across documents or when the move would connect/disconnect the element.

@WebReflection
Copy link

to whom it might concern, just adding those two isConnected checks repeatedly gave me ~30% slower results but it's true that for large diffing I can trap isConnected for the parent only once, although in that case I have ~20% slowdown due checks per each node. I am not considering native moveBefore which might produce faster end results, just testing how much those checks can impact performance for more complex scenarios (js-framework-benchmark).

@dead-claudia
Copy link

dead-claudia commented Nov 25, 2024

to whom it might concern, just adding those two isConnected checks repeatedly gave me ~30% slower results but it's true that for large diffing I can trap isConnected for the parent only once, although in that case I have ~20% slowdown due checks per each node.

@WebReflection What time is the "~30% slower" relative to? And same for the "~20% slowdown". Just looking for some perspective here.

@WebReflection
Copy link

multiple moves and we're talking 0.3 VS 0.4 but for benchmarks 0.1 might mean everything ... of course more extensive tests with actually performing moveBefore when possible instead of just insertBefore must be done, right now the moveBefore operation is monkey-patched as insertBefore, see changed file then benchmarked via:

Node.prototype.moveBefore = Node.prototype.insertBefore;

right before the test suite benchmark.

@dead-claudia
Copy link

@WebReflection Sorry, I meant like specific time numbers, like 60k ops/sec or 1 ms/op.

@noamr
Copy link
Collaborator

noamr commented Nov 25, 2024

@WebReflection given that it's established that raw moveBefore has valid use cases, and we can perhaps add other variants or have something like canMoveBefore in addition do you mind opening another issue for these with those benchmarks and what not? This issue is becoming overloaded with too many comments about the same point.

@WebReflection
Copy link

@noamr the whole discussion is about not landing moveBefore as it is so me opening a new discussion won't benefit anyone. I understand your use case (still edgy and not-clear what the developer would end up doing after a throw happens) but I don't like for this method to land in a developer hostile way.

I am also OK to stop discussing it but so far we have most libraries authors not happy about that throwing (using such name that is too similar to insertBefore that does not throw) and, like I've said, a method that duplicates checks on nodes would result as slow as me checking isConnected in an optimized way (i.e. checking the parent is connected once per multiple moves, instead of each time).

If there is no way this method can be renamed to moveBeforeOrThrow as low-level API so that we can discuss in the future a moveBefore that does not throw, I might as well rest my case and by the time I will have benchmarks will be too late to change the current state of this low-level API with a way too high-level name.

Unless explicitly asked to, I will stop commenting on this, or the other, issue.

@slorber
Copy link

slorber commented Nov 25, 2024

FYI React experimental reconcilier integration PR: facebook/react#31596 with preference for not throwing apparently:

The other reason it's still off is because there's currently a semantic breaking change. moveBefore() errors if both nodes are disconnected. That happens if we're inside a completely disconnected React root. That's not usually how you should use React because it means effects can't read layout etc. However, it is currently supported. To handle this we'd have to try/catch the moveBefore to handle this case but we hope this semantic will change before it ships. Before we turn this on in experimental we either have to wait for the implementation to not error in the disconnected-disconnected case in Chrome or we'd have to add try/catch.

To me moveBefore not throwing is preferable. If it lands and throws, this is an irreversible decision reducing the DX for the most common usages. A fail-fast behavior can be opt-in, through an extra option: parent.moveBefore(child, refNode, {fallback: false})

@dead-claudia

This comment was marked as duplicate.

@dead-claudia
Copy link

dead-claudia commented Nov 25, 2024

The benchmark discussion was simply us assessing whether it's fast enough for it to be viable for us to use - if it's too slow, the throwing variant is outright useless to us. And switching to a method that moves instead of removing and inserting would allow us to fix a longstanding animations bug: MithrilJS/mithril.js#2612.

It's incredibly important to me that it meets my performance requirements. And those requirements are stiff: around 5 µs/op firm and 50 µs/op hard. (Soft affords ~1k keyed element moves without dropping frames at 60 FPS, and hard affords ~100 moves.) Slower than that, I simply can't switch to it. And no, this is not hypothetical - check the comment at the top of this file and imagine if someone reverses such a 100-row table's order. (Namesilo's UI currently allows changing sort order, by the way, and they can display up to 250 rows per page.)

By the way, keyed moves have the strictest deadlines for this. Other callers can tolerate as much as 100x slower for insertBefore.

A try/catch can add as much as 0.5 µs/op, and I believe the insertBefore currently clocks in at around 1-3 µs/call on my laptop. This overhead is not negligible. If falling back is needed, elements that don't support moves would likely clock in around 2-5 µs/call if performance turns out to be similar between the two. This means we'd be able to tolerate a dynamic fallback, but only barely.

This is why I'm so concerned and pushy about it. We need the ability to just move, and we need simple node movement to be very fast.

@sorvell
Copy link

sorvell commented Nov 26, 2024

I want to push back lightly on the use cases (e.g. #1255 (comment)) for throwing to try to understand this more. When exactly is it valuable?

We have a number of obvious cases where state can be affected, for example (1) animation, (2) focus, (3) iframe status. We have 2 important nodes, the container to move to and the target moving element. I think that means these permutations:

case container target throws state preserved
1 connected connected no yes yes
2 disconnected disconnected no no no
3 connected disconnected yes no no
4 disconnected connected yes yes no

I can't see any value in throwing for case (3) since there's no relevant state to consider.

That leaves case (4) and the question: is there ever a time where you wouldn't want to remove/disconnect an element to prevent its state from being lost? I'm struggling to find a practical case where that would be desirable.

I also think that moving across documents is a corner case and if we throw there, I suspect people won't care because that likely won't need hot path checking code.

@annevk
Copy link
Member

annevk commented Nov 26, 2024

The way I am looking at this is whether the method performs a move or not. For the past two-and-half decades the DOM has only had insert and remove. And we've only been able to define a move operation for two same-document connected nodes or two same-shadow-including-root disconnected nodes (I suspect that's sufficient for @sebmarkbage's case cited above). Note that for the connected case it also comes with a new custom element callback (this is not offered in the disconnected case because custom elements never fire callbacks there; only built-in elements appear to have a need for that thus far). Making it implicitly fallback to different semantics prevents us from building upon it in the future. As people would rely on it having insert or remove behavior in a certain set of scenarios. And it would also be rather magical to change the semantics under the hood. You could perhaps have a method moveBeforeOrMaybeInsert() but that seems way better to leave to libraries. As again it would lock us into certain behavior we know we don't want.

It will be interesting to see the performance aspects explored more. Once there is more data there we should certainly investigate why a couple of conditionals are so expensive. And perhaps adoption of abstractions is so overwhelming that it's indeed compelling to introduce moveBeforeOrMaybeInsert() at some point, but not before we are much more confident about the problem space.

Also, let me say that this issue has become quite unreadable. Adding way too many duplicate comments (this includes repeating what other people already said, to be clear) in just a couple of days is not a good way to make your case. There's a 160 people watching this repository. We all owe it to them to be more considerate of their time and attention.

@WebReflection
Copy link

WebReflection commented Nov 26, 2024

last question from me: is it possible to rename and land this method as moveBeforeAtomic(child, ref?), keeping it exactly as it is (throwing), to have room in the future for a moveBefore that is more developer friendly for the 98% of presented use cases, where an optional third argument such as { atomic: true } would still throw whenever that is desired? The disambiguation will be in the name where atomic would explain the throwing nature when such operation would not be possible.

edit to not bother more ... but ...

For the past two-and-half decades the DOM has only had insert and remove

This is the exact reason most of us are against landing a completely new thing under a name that undermines its most meaningful behavior from veterans to new-comers. If this is all new, and somehow experimental, because:

It will be interesting to see the performance aspects explored more

... but in the meantime the most desirable name for such API has been doomed forever, I strongly believe there is all the time we want, and need, to explore such new behavior/API under a name that is not representing, in the name, meaning, and intent, what most Web developers would expect from such API: let it be moveBeforeAtomic to signal that!

As quick reminder, the most successful and popular Websites out there have infinite lists and/or tabular data, all cases where nodes get inevitably and quickly out and back into the living DOM ... a sorted or filtered table is the same, it's not always a reordering of a to-do list and as mentioned before:

For small websites that don't use frameworks/DOM libraries, I think try/catch is probably fine performance-wise

so that it feels like this API has not "scaling" in mind, when it comes to performance.

Moreover, the state-preserving feature of this API has been overlooked as the only desired outcome when in practice X, and similar projects, FB, and similar project, or even GitHub that is rendering code views in split chunks of nodes, won't benefit at all from the ergonomics, the duplicated checks on both client side and inevitably on the native browser engine too, to do what is the most common thing everyone does these days: stream of visual data to consume.

but that seems way better to leave to libraries

Agreed, but only if libraries have a way to hook carelessly into the fastest-possible path, because otherwise this API is asking consumers of such API to add bloat to every page the world is surfing these days, because it couldn't literally compromise on anything, even if every developer that paid attention (I even suggested insertBeforeAtomic because insertBefore is the single thing that covers it all and now I regret that very first contribution) has been so loud about not being happy you had to play the "stop commenting on this" card when you, in the first place, asked me to comment in the related issue, and others just followed, or did that before me.

I understand there are many people involved in this project, and while repeated reasons from different developers are annoying, because repeated, the fact different developers repeatedly stated they didn't like the throwing should be visible to all others involved in this effort.

It's a new thing, it's potentially a game-changer only if it lands right ... and until it does, please do not nuke the most spot-on name that will backfire the day after it'll land out there.

Thanks to anyone patience enough to read this edit. I hope for the best, which is everything but moveBeforeOrInsertMaybe meme I can already see after landing.

edit 2 as mentioned in the comment after, if moveBefore remains the name, node.moveBefore(reference) is also doomed as one would throw, the other one hopefully not? ... but then again, if it doesn't, it will create already confusion, so that this spot-on naming choice (for dev intents) affect both the present and the future of the DOM ... please be careful and take full responsibility for the currently chosen name over a new field nobody has any concrete results about. Please change name already if throwing is desired, for a name shaped after 20 years of non-throwing legacy (insertBefore).

@domenic
Copy link
Member

domenic commented Nov 29, 2024

Hey all,

I want to weigh in with my perspective in support of both a low-level moveBefore(), and a higher-level version of moveOrInsertBefore() that has fallback behavior (but only in some cases). But first, I want to briefly discuss some meta-issues.

This thread is unfortunately noisy, full of long and repetitive comments by the same people. This is not helpful for making your point, and indeed is more likely to cause entrenchment and get yourself ignored. I usually subscribe to all whatwg/* issues, but had to unsubscribe from this thread (and the associated pull request) because of the excess noise a week ago. I see that since then there have been tens of comments. Please consider a more constructive strategy for engaging in standards bodies in the future, such as leaving a single, ideally short comment laying out your position and then accepting the fact that others might disagree. I'll call out @sorvell's single such comment at #1255 (comment) as a good example of this.

Secondly, I think it's discouraging for us as spec writers and implementers to work on such a cool, technically-difficult, and long-awaited feature, which does the magic work of preserving iframe/canvas/etc. state even across moves… and have almost the entirety of the web developer feedback be about surface-level details. There's no appreciation or encouragement; there's just complaints about how terrible it is to write an extra if statement, often coupled with hyperbolic discussions about how it makes the API useless or footgunned or non-performant. Now, we aren't entitled to positive feedback! We're doing this as part of our jobs, after all. But please consider the human cost of how you're choosing to engage in these discussions.

Alright, on to the technical bits.

I don't think performance is a serious consideration here. if statements are not significantly faster in C++ vs. JavaScript. It would be best to set that aside.

I agree there's value in a low-level moveBefore() primitive that only moves. This gives clear semantics for low-level DOM manipulation, where you strongly intend move semantics, and you know it's a programmer error if you accidentally create code that crosses tree boundaries and thus is no longer able to do a move. The DOM is already full of such cases: e.g., we know it's a programmer error to try to insert an Element as a child of an Attr, and we don't fall back to inserting it into the Attr's ownerElement. We throw an exception, so you can detect such coding errors quickly. Even for the current moveBefore() PR, nobody is claiming that there should be fallback behavior for moving ancestors to become their own grandchildren.

However, unlike the Attr-into-Element case, which is always a programmer error and falling back would never really make sense, I think there are some cases where you legitimately want to fall back to insert instead of move.

Recall that there are a few error cases under discussion:

  • Disconnected-to-connected
  • Connected-to-disconnected
  • Cross-document

My proposal is that we allow moveOrInsertBefore() to fall back to insertBefore() behavior for the disconnected-to-connected case, but not the other two. Because disconnected-to-connected is the case where you don't lose any state. You can "gain" some state, by running the connected steps: e.g., your iframes will start loading. And that is signaled by the OrInsert part of the name.

I claim that the cross-document case is clearly a programmer error. Just like frameworks bubble up the exception that is thrown if you try to insert two DocumentType nodes into a Document, without catching it and adding fallback, they should bubble up the exception that is thrown if the web developer invokes some framework operation which moves elements across documents. Because otherwise, the web developer will silently and unpredictably lose state: iframes will be unloaded/reloaded, canvas contents will be cleared, form values will be reset, etc. And that state loss will only happen in the less-well-tested cases of cross-document operations.

Alternatively, frameworks could surface clearly-different APIs for same-document vs. cross-document operations, if they don't want to throw. This is not something they need to do today, because today there is no DOM API whose behavior differs significantly depending on cross- vs. same-document. But if a framework truly intends to be compatible with multiple documents, they're going to need to start surfacing this distinction to their user, because now such an operation exists. There's just no avoiding the extra complexity here. Hiding it is the wrong move, and not one we should encourage in the API design by having moveOrInsertBefore() silently fall back to insertBefore() in the cross-document case.

The connected-to-disconnected case is a bit trickier, but only a bit. I don't claim that it's always a programmer error to be moving an element into a disconnected document or DocumentFragment. But I do claim that, similar to the cross-document case, it needs to be handled consciously, not via automatic fallback, because of the possibility of lost state. Today, it's a transparent refactoring to start with code that moves from in-document point A to in-document point B, and rewrite it to code that moves from in-document point A, to an out-of-document DocumentFragment, to in-document point B. In both cases, you disconnect the element (e.g. unload the iframe), then reconnect it (reload it). But if you start using moveOrInsertBefore(), this is no longer a transparent refactoring: it's a state-losing bug. And we should surface such state-losing bugs loudly, via exceptions, and cause developers to actively handle such cases with extra code that makes their intent clear.

I think my proposal gives a version of moveOrInsertBefore() that frameworks and developers could slot in, in place of insertBefore(), which is basically just an "upgrade" to move behavior in all the cases that count. It still allows the relatively-common operation of assembling a DOM tree in a disconnected location before inserting into the document, via its OrInsert functionality. But it disallows the state-losing connected-to-disconnected and cross-document cases, which should be load failures by default, and the platform should not provide an easy escape hatch from.

I think we have ample evidence from this thread that this would be helpful to web developers today, and would be suitable for merging alongside the low-level moveBefore() primitive.

It's possible some web and framework developers feel that a state-losing fallback would be helpful, beyond what I've proposed. I am skeptical that we have enough evidence to support that conclusion. As per the above, I don't think frameworks have yet adapted to the reality where cross-document or connected-to-disconnected-to-connected are state-losing operations. I'd like to see how that goes for the next year or so before considering an even-more-lenient state-losing fallback version of this API.

@WebReflection
Copy link

WebReflection commented Nov 29, 2024

@domenic not sure you're still ignoring this thread, but worth trying to communicate my (as terse as possible) answer, and I am not speaking for others, just for myself, hoping others agree (with maybe just a thumb up instead of repeated comments):

  1. this was, to me, an opportunity to have a better insertBefore story the Web knew so far, meaning, we expected that if any insertBefore operation would've worked, no matter the parent/target situation, we expected (probably wrongly so) to "just work" and eventually have a stricter flag/third-argument for when the throwing was desired: { fallback: false }
  2. everyone is interested in the living DOM diffing ... I wrote many differs to date and all targeted living DOM. I don't think there is out there a single use-case where people would move live nodes into offline fragments (connected -> disconnected), but happy to learn better ... in all my differs, gone nodes are gone, via remove() or parentNode.removeChild(thatOne)
  3. accordingly, if the API can cover connected->connected, disconnected->disconnected (even if unusual) and disconnected->connected use cases, I think all libraries would be covered as the diffing is done around the live parentNode and rarely, if ever, around some offline fragment ... that check is also O(1) instead of O(n) per each child, as it's clear when the parentNode is not connected, there's no point in even trying to use moveBefore
  4. when me (but I guess others too) mentioned not throwing I am pretty sure we all meant "not throwing in all cases where insertBefore would NOT throw too" ... it wasn't a general not throwing argument, it had insertBefore as baseline for throwing (attributes, like you mentioned, wrong reference nodes and whatnot)
  5. I would welcome this new API, I am just a bit sad we'll land 2 APIs that do almost exactly the same thing except one case
  6. by no mean I ever meant to be against this huge effort to add a new primitive to the DOM, and I would like to thank everyone involved, it's just that the final detail, it's name, semantic and English-related understanding of what move means in dictionaries, that made me sad such best name, moveBefore, will be (speculative) used less in the future than this moveOrInsertBefore proposal, because the latter is what most people expects while the former already needed an amend, regardless the mentioned possibility to have it opt-in for throwing as third argument: { fallback: false }
  7. the if statement is not the issue, bindings and reaching Node.prototype per each isConnected out of potentially thousand nodes to move is ... somebody already run some benchmark, it's just slower, as slower would be to access for no reason node.nodeType per every node that is being manipulated out there ... add the fact such check is duplicated anyhow behind the scene, it feels really off as a guard for a new API (again, O(1) for parent node check only VS O(n) for every node that would like to be moved)

Last, but not least, thanks for your comment and the will to fix users' expectations, in a way or another ❤️

P.S. I'd like to propose placeBefore instead of moveBeforeOrInsert if there's no chance to change current moveBefore contract.


how many child.isConnected are required?

moveBefore - as it stands

Users must check both parent and child.

case container target c-ops t-ops
1 connected connected O(1) O(n)
2 disconnected disconnected O(1) O(n)
3 connected disconnected O(1) O(n)
4 disconnected connected O(1) O(n)

moveBeforeOrInsert - as proposed

Users must check parent, use the method if connected, use fallback if not.

case container target c-ops t-ops
1 connected connected O(1) O(0)
2 disconnected disconnected O(1) O(n)
3 connected disconnected O(1) O(0)
4 disconnected connected O(1) O(n)

With moveBefore(newNode, refNode, { fallback: true }) all checks are O(0) on user-land, it might throw anyway in the latter case if impossible to move, when { fallback: false } it throws as desired for all the good reasons it should that are part of this API.

The proposed default for this third argument is { fallback: true }.

Alternatively, a placeBefore name would be easier to digest, imho.

@triskweline
Copy link

I was surprised by the signature of moveBefore().

I had hoped to describe the move destination more ergonomically, like with insertAdjacentElement().

@domfarolino domfarolino added stage: 3 Committed and removed stage: 2 Iteration labels Dec 19, 2024
aarongable pushed a commit to chromium/chromium that referenced this issue Dec 27, 2024
This CL ships the `ParentNode#moveBefore` API to Chrome stable,
targeting M133. See:

 - https://chromestatus.com/feature/5135990159835136
 - https://groups.google.com/a/chromium.org/g/blink-dev/c/YE_xLH6MkRs
 - https://issues.chromium.org/issues/40150299
 - whatwg/dom#1255
 - whatwg/dom#1307
 - whatwg/html#10657

R=chrishtr, masonf, nrosenthal

Bug: 40150299
Change-Id: I005f1db250d731d6845b5238048bbd0b90b20c81
Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/6112499
Reviewed-by: Noam Rosenthal <[email protected]>
Reviewed-by: Mason Freed <[email protected]>
Reviewed-by: Chris Harrelson <[email protected]>
Commit-Queue: Chris Harrelson <[email protected]>
Cr-Commit-Position: refs/heads/main@{#1400559}
@syduki
Copy link

syduki commented Jan 15, 2025

IMHO re-ordering is the most needed feature in the majority of frameworks/libraries, less re-parenting, so why not start with a simpler method to cover this specific use case, such as targetNode.move(beforeNode), until the moveBeforeOrInsert mammoth is eventually conceived?

@WebReflection
Copy link

WebReflection commented Jan 16, 2025

@triskweline moveBefore was suggested after insertBefore which is one API that covers them all and the most used to polyfill newer APIs that came after, meaning you can polyfill with ease a moveAdjacentElement through moveBefore, it’d be slower and/or slightly more awkward the other way around (as in: adjacent does a lot, before does one thing)

@dead-claudia
Copy link

IMHO re-ordering is the most needed feature in the majority of frameworks/libraries, less re-parenting, so why not start with a simpler method to cover this specific use case, such as targetNode.move(beforeNode), until the moveBeforeOrInsert mammoth is eventually conceived?

@syduki As long as it remains in the same document, there really isn't much difference between moving within the same parent and giving it a new parent.

Plus, existing DOM APIs have the same state and animation resetting behavior regardless of whether it's in the same parent or moved to another parent. So doing it this way is more consistent with the current state of things than what you're proposing.

@syduki
Copy link

syduki commented Jan 16, 2025

there really isn't much difference between moving within the same parent and giving it a new parent.

@dead-claudia that may be true from the user screen perspective, I was talking about actual implementation.

@dead-claudia
Copy link

there really isn't much difference between moving within the same parent and giving it a new parent.

@dead-claudia that may be true from the user screen perspective, I was talking about actual implementation.

From an implementation perspective, the difference is still fairly low.

  • Persistent render states, like animations, are linked to nodes, not tree positions. As you can always add and remove nodes at will before a given node, linking such states to tree positions turns an O(1) operation to an O(n) operation, something obviously bad.
  • Browsers do cache tree positions and related style calculations to boost style performance, but this is just an optimization and any insertion and removal necessarily invalidates the parent nodes impacted. They have to compute this for every layout pass, but it almost never changes for individual nodes, so caching it brings very little computational overhead.

@syduki
Copy link

syduki commented Jan 16, 2025

@dead-claudia and what has this with implementation? As far as I know, implementation is not a synonym of optimization.
I already got your point that there isn't much difference between Jungle.moveBeforeOrInsert(Monkey, Banana) and Monkey.move(Banana) in terms of implementation effort, just a little caching here and there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
a11y-tracker Group bringing to attention of a11y, or tracked by the a11y Group but not needing response. addition/proposal New features or enhancements needs implementer interest Moving the issue forward requires implementers to express interest stage: 3 Committed
Development

Successfully merging a pull request may close this issue.