⚡️ Speed up function tarjan by 124%
#221
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
📄 124% (1.24x) speedup for
tarjaninstanza/models/common/chuliu_edmonds.py⏱️ Runtime :
70.0 milliseconds→31.3 milliseconds(best of167runs)📝 Explanation and details
The optimization achieves a 123% speedup by replacing expensive repeated
np.where()calls with a precomputed dependency lookup map.Key optimization:
np.where(np.equal(tree, i))[0]for each node during the DFS traversal (which scans the entire array each time), the optimized version builds adependents_maponce at the beginning wheredependents_map[i]contains all nodes that haveias their head.Why this is faster:
strong_connect(i)took 95% of runtime in the original (132ms out of 140ms total)Performance characteristics:
The optimization maintains identical behavior and output while dramatically reducing the algorithmic complexity of the dependency finding step.
✅ Correctness verification report:
⚙️ Existing Unit Tests and Runtime
🌀 Generated Regression Tests and Runtime
To edit these changes
git checkout codeflash/optimize-tarjan-mh4g4eqiand push.