Macro parser performance improvements and refactoring#37701

Mark-Simulacrum · 2016-11-11T01:22:17Z

This PR locally increased performance of #37074 by ~6.6 minutes.

Follow up to #37569, but doesn't focus explicitly on expansion performance.

History is relatively clean, but I can/will do some more polishing if this is deemed mergeable. Partially posting this now so I can get Travis to run tests for me.

r? @jseyfried

frewsxcv · 2016-11-11T04:43:36Z

/build/src/libsyntax/ext/tt/macro_parser.rs:494: line longer than 100 chars

jseyfried · 2016-11-12T06:40:48Z

src/libsyntax/ext/tt/macro_parser.rs

I believe this recreates the Rc that we just unwrapped. Perhaps this could be avoided with something like:

while let TokenTree::Token(sp, token::Interpolated(nt)) = tt { if let token::NtTT(..) = *nt { match Rc::try_unwrap(nt) { Ok(token::NtTT(sub_tt)) => tt = sub_tt, Ok(_) => unreachable!(), Err(nt_rc) => match *nt_rc { token::NtTT(ref sub_tt) => tt = sub_tt.clone(), _ => unreachable!(), }, } } else { tt = TokenTree::Token(sp, token::Interpolated(nt.clone())); break } }

Also makes nameize non-public since it's only locally used.

Change multiple functions to be non-public. Change nameize to accept an iterator so as to avoid an allocation.

Mark-Simulacrum · 2016-11-12T14:45:54Z

src/libsyntax/ext/tt/macro_parser.rs

+                        new_pos.matches[idx]
+                            .push(Rc::new(MatchedSeq(sub, mk_sp(ei.sp_lo,
+                                                                span.hi))));
                    }


If there's a way to avoid doing this work here, that would likely bring significant wins. Callgrind reports ~1 trillion instructions (if I'm reading it right) spent on the let sub = ei.matches[idx].clone(); line for the workload I'm profiling.

We might be able to avoid this work by refactoring Rc out of Vec in NamedMatch::MatchedSeq, that is MatchedSeq(Vec<Rc<NamedMatch>>, syntax_pos::Span) -> MatchedSeq(Rc<Vec<NamedMatch>>, syntax_pos::Span), perhaps with a RefCell between the Rc and Vec.

Then, sub would be an Rc so cloning would be cheap, and Rc<NamedMatch> would never be needed since cloning NamedMatch would be almost as cheap as an cloning an Rc.

Mark-Simulacrum · 2016-11-12T16:16:08Z

After realizing that I was benchmarking old code (from the make build system, I switched to rustbuild mid-PR); latest perf gains are even higher: from ~890 seconds with rustc 1.14.0-nightly (cae6ab1c4 2016-11-05) to ~500 seconds with 2189f57 (as of now, that will change if/when this is rebased). This is a performance gain of 6.6 minutes.

jseyfried

Excellent! r=me unless you'd like to add more commits.

Mark-Simulacrum · 2016-11-13T05:03:08Z

I'll try the optimization suggested by @jseyfried and clean up history a little.

Mark-Simulacrum · 2016-11-13T17:17:27Z

Okay, so the suggested optimization doesn't work without more changes, since then we start sharing the vector instead of duplicating it--and we don't want to share it. It's something to look into for the future, but I'll leave it as out of scope for this PR since the changes are unlikely to be trivial.

jseyfried · 2016-11-13T22:27:35Z

@bors r+

bors · 2016-11-13T22:27:36Z

📌 Commit 2189f57 has been approved by jseyfried

bors · 2016-11-14T00:23:57Z

⌛ Testing commit 2189f57 with merge 87b76a5...

@jseyfried

…fried Macro parser performance improvements and refactoring This PR locally increased performance of #37074 by ~6.6 minutes. Follow up to #37569, but doesn't focus explicitly on expansion performance. History is relatively clean, but I can/will do some more polishing if this is deemed mergeable. Partially posting this now so I can get Travis to run tests for me. r? @jseyfried

bors · 2016-11-14T03:46:32Z

rust-highfive assigned jseyfried Nov 11, 2016

brson added the relnotes Marks issues that should be documented in the release notes of the next release. label Nov 11, 2016

Mark-Simulacrum force-pushed the macro-parser-impvement branch from 11bf624 to 8c10253 Compare November 11, 2016 04:52

jseyfried reviewed Nov 12, 2016

View reviewed changes

Mark-Simulacrum force-pushed the macro-parser-impvement branch from a5c2641 to 5fd8f3d Compare November 12, 2016 13:31

Mark-Simulacrum added 10 commits November 12, 2016 06:42

Cleanup macro_parser::parse, removing a few clones.

568874b

Remove unused argument from nameize.

7221b07

Also makes nameize non-public since it's only locally used.

Refactor to extending from a drain instead of while looping.

c9e6089

Clean up extraneous &mut.

eef10d0

Factor out NamedParseResult.

68abb24

Refactor parse_nt.

27c0986

Factor out inner current Earley item loop.

b8d6686

Change multiple functions to be non-public. Change nameize to accept an iterator so as to avoid an allocation.

Use SmallVector for eof and bb eis.

6046595

Move next_eis out of main loop to avoid re-allocating and dropping it.

38912ee

Remove extra level of nesting.

2189f57

Mark-Simulacrum force-pushed the macro-parser-impvement branch from 5fd8f3d to 2189f57 Compare November 12, 2016 14:42

Mark-Simulacrum commented Nov 12, 2016

View reviewed changes

jseyfried approved these changes Nov 13, 2016

View reviewed changes

Mark-Simulacrum force-pushed the macro-parser-impvement branch from abc57c4 to 2189f57 Compare November 13, 2016 17:15

bors merged commit 2189f57 into rust-lang:master Nov 14, 2016

Mark-Simulacrum deleted the macro-parser-impvement branch December 17, 2016 04:23

Mark-Simulacrum mentioned this pull request May 16, 2017

Compiling rust-uinput takes almost 20 minutes #37074

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Macro parser performance improvements and refactoring#37701