Improving rooted_tree_isomorphism for deep trees #7945

amcandio · 2025-03-29T20:21:29Z

Convert `generate_isomorphism` to an Iterative Implementation

I was looking at tree isomorphism logic when reviewing #7929 and realized implementation was using a recursive approach to compute the node mapping. This can be problematic with deep trees because it can lead to RecursionError exceptions.

@rossbar since you are already working on this module please let me know if you are already working on this particular improvement. Happy to drop this PR!

Summary

This PR refactors the generate_isomorphism function to replace its recursive implementation with an iterative approach using an explicit stack. The change is intended to prevent potential stack overflow issues when dealing with deeply nested trees (e.g., long path graphs).

Changes Made

Converted generate_isomorphism from a recursive function to an iterative one using a stack.
Ensured that child nodes are pushed onto the stack in reverse order to maintain the original left-to-right processing order.
Removed the previous recursive implementation.

Reason for Change

The recursive implementation could cause a RecursionError for deep trees, such as long path graphs. This issue was observed in test cases with large trees (e.g., test_long_paths_graphs). By converting the function to an iterative version, we avoid excessive recursion depth and improve performance for large inputs.

Implementation Details

Previous recursive version:

def generate_isomorphism(v, w, M, ordered_children):  
    assert v < w  
    M.append((v, w))  
    for x, y in zip(ordered_children[v], ordered_children[w]):  
        generate_isomorphism(x, y, M, ordered_children)

New iterative version:

def generate_isomorphism(v, w, M, ordered_children):  
    stack = [(v, w)]  
    while stack:  
        curr_v, curr_w = stack.pop()  
        assert curr_v < curr_w  
        M.append((curr_v, curr_w))  
        stack.extend(  
            zip(reversed(ordered_children[curr_v]), reversed(ordered_children[curr_w]))  
        )

Testing

Verified that all existing test cases pass
Added a new test test_long_paths_graphs to validate two long path graphs
Ensured that tree traversal order remains unchanged.

rossbar

Thanks @amcandio - switching from recursive->iterative implementation is indeed an improvement as the test indicates!

Just a couple suggestions for minor (potential) improvements.

Also since this "helper function" is only used in one place, we could also consider removing it and just moving the implementation to the appropriate location inside rooted_tree_isomorphism. But that can be handled/proposed in a follow-up PR if you prefer to focus on these changes here!

networkx/algorithms/isomorphism/tests/test_tree_isomorphism.py

rossbar · 2025-03-30T23:11:16Z

networkx/algorithms/isomorphism/tree_isomorphism.py

+    # Start with the initial pair in the stack
+    stack = [(v, w)]
+    while stack:
+        curr_v, curr_w = stack.pop()
+        assert curr_v < curr_w
+        M.append((curr_v, curr_w))
+        # Zip children and push them in reverse order to maintain processing order.
+        stack.extend(
+            zip(reversed(ordered_children[curr_v]), reversed(ordered_children[curr_w]))
+        )


Also +1 for switching to an iterative implementation. Another minor thought though: having to reverse the children every time is cumbersome. Perhaps a deque with popleft would do the trick?

The reverse is needed so my change doesn't modify any previous behavior (i.e. isomorphism mapping is a list and I'm maintaining the order). But if people is okay with changing that I can replace it with a nicer BFS implementation like:

def generate_isomorphism(v, w, ordered_children): ret = [(v, w)] next_index = 0 while next_index < len(ret): curr_v, curr_w = ret[next_index] assert curr_v < curr_w next_index += 1 ret.extend(zip(ordered_children[curr_v], ordered_children[curr_w])) return ret

To be honest, I don't think any one should depend on the order of the isomorphism list but you never know

My point was you can do away with the reversed by using a deque to implement the stack instead of a list - here's a quick example:

>>> from collections import deque >>> lstack = [] # the way it is now >>> lstack.extend(reversed(range(3))) >>> while lstack: ... print(lstack.pop()) 0 1 2 # Now, with a deque >>> dstack = deque() >>> dstack.extend(range(3)) # extend with objects, no reversal >>> while dstack: ... print(dstack.popleft()) # Use pop-left to implement FIFO 0 1 2

The problem is that previous recursive implementation naturally does a DFS which is LIFO. The deque will make the algo do a BFS which will change the output order.

To avoid the reversal we would need a stack of dequeues so we do LIFO on the different levels of the tree and FIFO on nodes with the same parent:

from collections import deque stack = [deque([(v, w)])] while stack: last = stack[-1] if not last: stack.pop() continue curr_v, curr_w = last.popleft() M.append((curr_v, curr_w)) stack.append(deque(zip(ordered_children[curr_v], ordered_children[curr_w]))

I can make this change but might be better to drop the order backwards compatibility and simplify the logic

~~How about?~~
[Edit: It's actually slower and has to create a list in the middle -- so probably more memory.... ]
Nevermind! :{

while stack: curr_v, curr_w = stack.pop() M.append((curr_v, curr_w)) stack.extend( reversed(list(zip(ordered_children[curr_v], ordered_children[curr_w]))) )

Yeah, I thought about that and didn't do it bc of the extra memory. Also reversed(zip(...)) throws TypeError:

In [60]: reversed(zip([1,2,3], [1,2,3])) --------------------------------------------------------------------------- TypeError Traceback (most recent call last) Cell In [60], line 1 ----> 1 reversed(zip([1,2,3], [1,2,3])) TypeError: 'zip' object is not reversible

Ah okay, I see what you mean now - I missed that there are two structures defined outside the scope of the function that are being accessed inside (all the more reason to get rid of the helper function definition IMO!)

In that case, I think it's slightly cleaner if we instead move the reversal to the sorted call on L193 where the ordered_children is originally populated. It's probably also worth adding a comment along with the reverse=True to explain why, something like: # reverse=True to preserve DFS order - see gh-7945.

Of course - this is still a nit, not a blocker!

Ah that's a good idea. Will address in a follow-up PR

Making test_long_paths_graphs more performant Co-authored-by: Ross Barnowski <rossbar@caltech.edu>

amcandio · 2025-03-31T02:32:25Z

Thanks @amcandio - switching from recursive->iterative implementation is indeed an improvement as the test indicates!

Just a couple suggestions for minor (potential) improvements.

Also since this "helper function" is only used in one place, we could also consider removing it and just moving the implementation to the appropriate location inside rooted_tree_isomorphism. But that can be handled/proposed in a follow-up PR if you prefer to focus on these changes here!

Thanks! By this you mean just declaring the function inside of rooted_tree_isomorphism definition? Are we okay with potentially breaking backwards compatibility? (I doubt anyone uses this method though)

amcandio · 2025-03-31T02:57:41Z

Replied to comments, I think they make sense. Is it okay if I take care of them in a separate PR?

dschult · 2025-03-31T02:57:54Z

I think he meant: just move the guts of the generate_isomorphism directly into the function where it is currently called. No need to enclose it in a function definition or call that function.

Also, this helper function is not part of the public API -- it is just old enough that it doesn't have a leading underscore. So we are not worried about users who might be using it directly.

Finally, there is an assert statement in that code that shouldn't be needed (especially if the code is moved inside the function where the children are created). Could you remove that line?

Thanks for this!

amcandio · 2025-03-31T03:01:23Z

Got it! Any concern about changing the order of isomorphism list?

rossbar · 2025-03-31T03:22:24Z

Got it! Any concern about changing the order of isomorphism list?

While it's probably not super important, it's also easy to preserve so we should do so - see my comment above about deque!

rossbar

Thanks @amcandio ! I approve this as my comments to this point are nits not blockers and can be addressed separately.

dschult

I approve this as well.

Improving rooted_tree_isomorphism for deep trees

9f617bd

amcandio mentioned this pull request Mar 30, 2025

Improve Performance of Tree Isomorphism and Center Calculation #7946

Merged

rossbar reviewed Mar 30, 2025

View reviewed changes

rossbar added the type: Enhancements label Mar 30, 2025

Update networkx/algorithms/isomorphism/tests/test_tree_isomorphism.py

5d58783

Making test_long_paths_graphs more performant Co-authored-by: Ross Barnowski <rossbar@caltech.edu>

rossbar approved these changes Apr 1, 2025

View reviewed changes

dschult approved these changes Apr 1, 2025

View reviewed changes

dschult merged commit 44d5471 into networkx:main Apr 1, 2025
39 checks passed

rossbar mentioned this pull request Apr 1, 2025

MAINT: Follow-up to 7945 - rm helper function #7952

Merged

amcandio deleted the root-tree-isomorphism-iterative branch April 9, 2025 03:45

amcandio mentioned this pull request Apr 13, 2025

Add Functions for Finding Connected Dominating Sets #7774

Merged

jarrodmillman added this to the 3.5 milestone May 29, 2025

Uh oh!

Improving rooted_tree_isomorphism for deep trees #7945

Improving rooted_tree_isomorphism for deep trees #7945

Conversation

amcandio commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Convert generate_isomorphism to an Iterative Implementation

Summary

Changes Made

Reason for Change

Implementation Details

Testing

Uh oh!

rossbar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rossbar Mar 30, 2025

Choose a reason for hiding this comment

Uh oh!

amcandio Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

rossbar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

amcandio Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

dschult Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amcandio Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

rossbar Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

amcandio Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

amcandio commented Mar 31, 2025

Uh oh!

amcandio commented Mar 31, 2025

Uh oh!

dschult commented Mar 31, 2025

Uh oh!

amcandio commented Mar 31, 2025

Uh oh!

rossbar commented Mar 31, 2025

Uh oh!

rossbar left a comment

Choose a reason for hiding this comment

Uh oh!

dschult left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

amcandio commented Mar 29, 2025 •

edited

Loading

Convert `generate_isomorphism` to an Iterative Implementation

dschult Mar 31, 2025 •

edited

Loading