Print `token::Interpolated` with token stream pretty printing. #125174

nnethercote · 2024-05-16T10:38:44Z

This is a step towards removing token::Interpolated (#124141). It unavoidably changes the output of the stringify! macro, generally for the better.

r? @petrochenkov

nnethercote · 2024-05-16T10:41:42Z

@bors try @rust-timer queue

…, r=<try> Print `token::Interpolated` with token stream pretty printing. r? `@ghost`

bors · 2024-05-16T10:42:53Z

⌛ Trying commit f7ce1ac with merge 53af78d...

nnethercote · 2024-05-17T06:45:59Z

Full details are in the individual commits.

@dtolnay has agreed to update the tests/ui/macros/stringify.rs test and move the AST pretty printing checks into a new test. That could be done before or after this PR is merged.

A crater run should be done before this merges, to make sure the stringify! changes don't cause problems for existing code.

Performance is unlikely to be affected, but let's confirm that:
@bors try @rust-timer queue

…, r=<try> Print `token::Interpolated` with token stream pretty printing. This is a step towards removing `token::Interpolated` (rust-lang#124141). It unavoidably changes the output of the `stringify!` macro, generally for the better. r? `@petrochenkov`

bors · 2024-05-17T06:47:11Z

⌛ Trying commit 9c693ad with merge e07dc21...

compiler/rustc_parse/src/parser/mod.rs

bors · 2024-05-17T08:24:10Z

☀️ Try build successful - checks-actions
Build commit: e07dc21 (e07dc21eff3c4c268d22684a47778eff58db7d17)

dtolnay · 2024-05-17T09:37:54Z

While I can see how this PR is a necessary step toward removing token::Interpolated, I think it could use some better explanation why removing token::Interpolated is a necessary step toward solving #67062 (it's a necessary stepping stone) before we would commit to this change.

It unavoidably changes the output of the stringify! macro, generally for the better.

This is the opposite of what I expect, and I consider stringify! changes to be a casualty of choosing this approach. We'll need to add a separate stringify_expr! macro to core. The AST pretty printer is always going to be more capable at stringifying expressions than the tokenstream pretty printer.

stringify! is used by the assert and dbg family of macros in the ecosystem (although apparently not by std's assert, which is a built-in and continues to use the AST pretty-printer). The prototypical usage is https://docs.rs/failure/0.1.8/failure/macro.ensure.html

#[macro_export]
macro_rules! ensure {
    ($cond:expr) => {
        if let false = $cond {
            return Err($crate::err_msg(format!("{}", stringify!($cond))));
        }
    };
}

Some changes picked up by https://github.com/dtolnay/anyhow's test suite from this PR:

- Error::msg::<<str as ToOwned>::Owned>.t(1) == 2
+ Error ::msg ::<< str as ToOwned >::Owned >.t (1) == 2

- f::<1>() != ()
+ f ::<1 >() != ()

- f::<-1>() != ()
+ f ::<- 1 >() != ()

- E::U::<> > E::U::<u8>
+ E ::U ::<> > E ::U ::<u8 >

- E::U::<u8> > E::U
+ E ::U ::<u8 > > E ::U

- b\"hmm\"[1] == b'c'
+ b\"hmm\" [1] == b'c'

- result? == 2
+ result ? == 2

- f as for<'a> fn() as usize * 0 != 0
+ f as for<'a > fn() as usize * 0 != 0

- &0 as &dyn EqDebug<i32, Assoc = bool> != &0
+ &0 as &dyn EqDebug <i32, Assoc = bool > != &0

- PhantomData as PhantomData<<i32 as ToOwned>::Owned> != PhantomData
+ PhantomData as PhantomData << i32 as ToOwned >::Owned > != PhantomData

- 0 as int!(...) != 0
+ 0 as int !(...) != 0

- 0 as int![...] != 0
+ 0 as int ![...] != 0

- 0 as int! { ... } != 0
+ 0 as int ! { ... } != 0

- if let -1..=1 = 0 { 0 } else { 1 } == 1
+ if let -1 ..=1 = 0 { 0 } else { 1 } == 1

rust-timer · 2024-05-17T09:39:05Z

Finished benchmarking commit (e07dc21): comparison URL.

Overall result: ❌ regressions - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.3%	[0.2%, 0.3%]	3
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.3%	[0.2%, 0.3%]	3

Max RSS (memory usage)

Results (primary 0.2%, secondary -5.0%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.6%	[0.6%, 0.6%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-0.2%	[-0.2%, -0.2%]	1
Improvements ✅ (secondary)	-5.0%	[-5.0%, -5.0%]	1
All ❌✅ (primary)	0.2%	[-0.2%, 0.6%]	2

Cycles

Results (primary -0.3%, secondary 0.2%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	3.3%	[3.3%, 3.3%]	1
Regressions ❌ (secondary)	2.4%	[2.1%, 2.6%]	3
Improvements ✅ (primary)	-1.5%	[-1.6%, -1.4%]	3
Improvements ✅ (secondary)	-1.5%	[-2.4%, -0.9%]	4
All ❌✅ (primary)	-0.3%	[-1.6%, 3.3%]	4

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 669.726s -> 668.949s (-0.12%)
Artifact size: 316.16 MiB -> 315.99 MiB (-0.06%)

nnethercote · 2024-05-17T10:15:48Z

The AST pretty printer is always going to be more capable at stringifying expressions than the tokenstream pretty printer.

We can discuss what "more capable" means, but there are cases visible in the stringify.rs diff where I consider the tokenstream pretty printer to do a better job.

original	AST pp	tokenstream pp	difference
`f :: < u8>( )`	`f::<u8>()`	`f :: < u8>()`	AST pp loses more whitespace
`f(true,)`	`f(true)`	`f(true,)`	AST loses the `,`
`for<> fn()`	`fn()`	`for<> fn()`	AST loses the `for<>`
`pub macro stringify() {}`	`pub macro stringify { () => {} }`	`pub macro stringify() {}`	AST inserts `() => {}`

Some changes picked up by https://github.com/dtolnay/anyhow's test suite from this PR:

How do I run that suite? I tried cargo test test_ensure. I can't reproduce the failures, and the failures you listed are surprising because the tokenstream pretty printer usually produces much nicer output than that.

dtolnay · 2024-05-17T10:24:19Z

How do I run that suite? I tried cargo test test_ensure.

cargo test --test test_ensure. I built rustc from 9c693ad (the current state of this PR) and ran cargo +stage2 test --test test_ensure using dtolnay/anyhow@fecdbbb to expose more of the failures simultaneously.

nnethercote · 2024-05-17T10:43:57Z

#[macro_export]
macro_rules! ensure {
    ($cond:expr) => {
        if let false = $cond {
            return Err($crate::err_msg(format!("{}", stringify!($cond))));
        }
    };
}

AFAICT this is the cfg(doc) version of that macro. The cfg(not(doc)) version is much more complex. I don't have time to look at this today, I'll take a look next week.

dtolnay · 2024-05-17T16:34:12Z

Minimal repro:

macro_rules! repro {
    ($foo:ident $colons:tt $bar:ident) => {
        stringify_expr!($foo $colons $bar)
    };
}

macro_rules! stringify_expr {
    ($expr:expr) => {
        stringify!($expr)
    };
}

fn main() {
    println!("{}", repro!(foo::bar));
}

- foo::bar
+ foo ::bar

nnethercote · 2024-05-20T08:35:00Z

@craterbot check

craterbot · 2024-05-20T08:35:07Z

👌 Experiment pr-125174 created and queued.
🤖 Automatically detected try build e07dc21
⚠️ Try build based on commit 9c693ad, but latest commit is 33d6e6f. Did you forget to make a new try build?
🔍 You can check out the queue and this experiment's details.

ℹ️ Crater is a tool to run experiments across parts of the Rust ecosystem. Learn more

craterbot · 2024-05-21T11:05:57Z

🚧 Experiment pr-125174 is now running

ℹ️ Crater is a tool to run experiments across parts of the Rust ecosystem. Learn more

…chenkov Tweak `Spacing` use Some clean-up precursors to rust-lang#125174. r? `@petrochenkov`

…chenkov Tweak `Spacing` use Some clean-up precursors to rust-lang#125174. r? ``@petrochenkov``

craterbot · 2024-05-23T08:05:55Z

🎉 Experiment pr-125174 is completed!
📊 192 regressed and 4 fixed (464004 total)
📰 Open the full report.

⚠️ If you notice any spurious failure please add them to the blacklist!
ℹ️ Crater is a tool to run experiments across parts of the Rust ecosystem. Learn more

Rollup merge of rust-lang#125316 - nnethercote:tweak-Spacing, r=petrochenkov Tweak `Spacing` use Some clean-up precursors to rust-lang#125174. r? ``@petrochenkov``

bors · 2024-05-23T09:13:29Z

☔ The latest upstream changes (presumably #125436) made this pull request unmergeable. Please resolve the merge conflicts.

dtolnay · 2024-05-23T09:48:00Z

michaelzoech/vkinfo: this one needs to be fixed.

error: proc-macro derive panicked
  --> src/vk.rs:62:14
   |
62 |     #[derive(Serialize)]
   |              ^^^^^^^^^
   |
   = help: message: called `Result::unwrap()` on an `Err` value: "failed to parse derive input: \"pubstruct InstanceCreateFlags { bits: u32, }\""

I skimmed a few of the rest and they all seemed like they might be the same issue.

dtolnay · 2024-05-23T16:06:41Z

Here is the issue: https://github.com/bitflags/bitflags/blob/1.0.1/src/lib.rs#L312

This is something I observed in anyhow also, with the new printer.
https://github.com/dtolnay/anyhow/blob/e6c730e8ec334eb3d7c6513940fd2dd6516ef7b6/src/ensure.rs#L750
I had to inserted spaces before ) and } to get the output I wanted, where I don't think I should have to.

IMO a punct immediately preceding a close delimiter with no space should be treated as Alone anyway, and the printer should never print space inside of () or [], and always print space inside of {}.

compiler/rustc_ast_pretty/src/pprust/state.rs

petrochenkov · 2024-05-23T18:23:34Z

I think it could use some better explanation why removing token::Interpolated is a necessary step toward solving #67062 (it's a necessary stepping stone) before we would commit to this change.

I don't think removing nonterminals is strictly required for addressing #67062.
It just greatly helps with testing by flooding the parser with Invisible-delimited groups, so if something goes wrong we notice it very soon.

I do feel that characterizing nonterminals as "legacy" or "slated for removal" in comments is premature.
They are not just used to implement some functionality that is (hopefully) can be implemented using invisible delimited groups, they are also an optimization allowing to avoid reparsing same pieces of code again and again when passing them through multiple nested macro calls (including recursive cases).
I'm not sure we'll be able to land the removal without introducing some kind of "TokenStream -> AST" cache, even if functionally invisible delimited groups work without too much breakage.

The AST pretty printer is always going to be more capable at stringifying expressions than the tokenstream pretty printer.

I'd expect large pieces of code like items (impls with nested functions, with large bodies with blocks) to be botched by token pretty-printing to a larger degree than expressions.
I'd actually expect expressions to do pretty well (unless they contain blocks with statements), if the decl macro issues are addressed.
Maybe if enum Spacing keeps some line break information it will help printing larger pieces of code better (but I'm not sure it it's worth doing).

For decl macros we have to be careful, because whitespace after tokens in rules doesn't necessarily match what you want in the final output.

From what I remember in decl macro output we just give up and print spaces (possibly prettified by space_between)?
We probably should do something better there given that the decl macro case causes majority of the problems for dtolnay.

nnethercote · 2024-05-24T03:35:06Z

I've looked at the crater results. There are two things going on.

Spacing

When I first created this PR on May 16 I had a commit "Add Spacing to mbe::TokenTree::Token". This was buggy, and could cause some pairs of tokens to be concatenated, like consecutive identifiers, leading to invalid output such as pubstruct. @dtolnay, this was the cause of some of the extra spaces you had to insert into ensure!. Once I realized this I removed the buggy commit, and made some other changes, and force-pushed on May 20. Only after doing that did I trigger the crater run.

But somehow that buggy commit ended up in the crater run: 3b977bb. The buggy commit doesn't show up in the PR's commits list. But that explains a lot of the failures such as nullptr-solutions.reflect, michaelzoech.vkinfo, and AlcaYezz.pong_rust. I was able to reproduce these failures locally with the buggy commit applied, and then they compiled successfully with the buggy commit removed.

So I think this cause of failures is solved, though I don't understand why the crater run included the buggy commit.

rental

rustc has a "pretty_printing_compatibility_hack" that causes slightly different token-handling behaviour when compiling versions of the rental crate prior to 0.5.6:

rust/compiler/rustc_expand/src/proc_macro.rs

Lines 123 to 133 in 78dd504

    
           let hack = crate::base::ann_pretty_printing_compatibility_hack(&item, &ecx.sess); 
        
           let input = if hack { 
        
               let nt = match item { 
        
                   Annotatable::Item(item) => token::NtItem(item), 
        
                   Annotatable::Stmt(stmt) => token::NtStmt(stmt), 
        
                   _ => unreachable!(), 
        
               }; 
        
               TokenStream::token_alone(token::Interpolated(Lrc::new(nt)), DUMMY_SP) 
        
           } else { 
        
               item.to_tokens() 
        
           };

The hack forces a matching item or statement to be put into an Interpolated token so that it gets printed with the AST pretty-printer, as opposed to the normal behaviour of converting it to a token stream that is then printed with the token stream pretty printer. This is done because the AST pretty-printer inserts an extra comma that rental relies on.

So this PR perfectly undoes the hack, because Interpolated tokens are now printed with the tokenstream pretty printer!
This accounts for all(?) of the other failures. Indeed, there was a previous attempt to remove the hack in #93275 and the crater run in that PR has a lot of failure overlap with the crater run in this PR.

So I think the best way to fix these failures is to finally remove the hack. It has been four years since it was added. I think that's better than trying to adapt the hack to work with the tokenstream pretty printer.

@petrochenkov, what do you think?

dtolnay · 2024-05-24T03:52:29Z

+1 from me for removing that hack targeting 4-year-old rental versions.

I don't understand why the crater run included the buggy commit.

Crater doesn't compile rustc. It uses an already-compiled rustc that got built by bors in the most recent "bors try".

In this PR there was a "bors try" in #125174 (comment). According to the bors response in #125174 (comment), the head of the PR at the time was commit 9c693ad. There was a force push from that commit to 33d6e6f right below #125174 (comment), followed by "craterbot check" in #125174 (comment). Craterbot uses the most recent try build, and says so in #125174 (comment): "🤖 Automatically detected try build e07dc21 ⚠️ Try build based on commit 9c693ad, but latest commit is 33d6e6f. Did you forget to make a new try build?"

nnethercote · 2024-05-24T04:24:31Z

Aha. Old try build, makes sense.

About rental: I have commented in #106060. The discussion there indicated people are being extremely cautious about even scaling back the hack, let alone removing it. Hmm. I've made a short argument, other more eloquent arguments in favour of removal would be appreciated if people have them...

Instead of using AST pretty printing. This is a step towards removing `token::Interpolated`, which will eventually (in rust-lang#124141) be replaced with a token stream within invisible delimiters. This changes (improves) the output of the `stringify!` macro in some cases. This is allowed. As the `stringify!` docs say: "Note that the expanded results of the input tokens may change in the future. You should be careful if you rely on the output." Test changes: - tests/ui/macros/stringify.rs: this used to test both token stream pretty printing and AST pretty printing via different ways of invoking of `stringify!` (i.e. `$expr` vs `$tt`). But those two different invocations now give the same result, which is a nice consistency improvement. This removes the need for all the `c2*` macros. The AST pretty printer now has more thorough testing thanks to rust-lang#125236. - tests/ui/proc-macro/*: minor improvements where small differences between `INPUT (DISPLAY)` output and `DEEP-RE-COLLECTED (DISPLAY)` output disappear.

petrochenkov · 2024-05-24T14:01:53Z

I'm also fine with removing the hack.

We still check for the `rental`/`allsorts-rental` crates. But now if they are detected we just emit a fatal error, instead of emitting a warning and providing alternative behaviour. The original "hack" implementing alternative behaviour was added in rust-lang#73345. The lint was added in rust-lang#83127. The tracking issue is rust-lang#83125. The direct motivation for the change is that providing the alternative behaviour is interfering with rust-lang#125174 and follow-on work.

nnethercote · 2024-05-27T03:01:19Z

#125596 converts the hack to a hard error, which is enough to unblock this PR.

We still check for the `rental`/`allsorts-rental` crates. But now if they are detected we just emit a fatal error, instead of emitting a warning and providing alternative behaviour. The original "hack" implementing alternative behaviour was added in rust-lang#73345. The lint was added in rust-lang#83127. The tracking issue is rust-lang#83125. The direct motivation for the change is that providing the alternative behaviour is interfering with rust-lang#125174 and follow-on work.

Convert `proc_macro_back_compat` lint to an unconditional error. We still check for the `rental`/`allsorts-rental` crates. But now if they are detected we just emit a fatal error, instead of emitting a warning and providing alternative behaviour. The original "hack" implementing alternative behaviour was added in rust-lang#73345. The lint was added in rust-lang#83127. The tracking issue is rust-lang#83125. The direct motivation for the change is that providing the alternative behaviour is interfering with rust-lang#125174 and follow-on work. r? `@estebank`

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels May 16, 2024

nnethercote marked this pull request as draft May 16, 2024 10:38

nnethercote force-pushed the less-ast-pretty-printing branch from a397a90 to f7ce1ac Compare May 16, 2024 10:40

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label May 16, 2024

bors added a commit to rust-lang-ci/rust that referenced this pull request May 16, 2024

Auto merge of rust-lang#125174 - nnethercote:less-ast-pretty-printing…

53af78d

…, r=<try> Print `token::Interpolated` with token stream pretty printing. r? `@ghost`

nnethercote force-pushed the less-ast-pretty-printing branch 2 times, most recently from ca35330 to 9c693ad Compare May 17, 2024 06:41

nnethercote marked this pull request as ready for review May 17, 2024 06:42

rustbot assigned petrochenkov May 17, 2024

This comment has been minimized.

Sign in to view

nnethercote mentioned this pull request May 17, 2024

Remove Nonterminal and TokenKind::Interpolated #124141

Draft

Veykril reviewed May 17, 2024

View reviewed changes

compiler/rustc_parse/src/parser/mod.rs Outdated Show resolved Hide resolved

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels May 17, 2024

dtolnay mentioned this pull request May 17, 2024

token::Interpolated dtolnay/anyhow#364

Closed

craterbot added S-waiting-on-crater Status: Waiting on a crater run to be completed. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 20, 2024

fmease added a commit to fmease/rust that referenced this pull request May 22, 2024

Rollup merge of rust-lang#125316 - nnethercote:tweak-Spacing, r=petro…

75383b3

…chenkov Tweak `Spacing` use Some clean-up precursors to rust-lang#125174. r? `@petrochenkov`

matthiaskrgr added a commit to matthiaskrgr/rust that referenced this pull request May 23, 2024

Rollup merge of rust-lang#125316 - nnethercote:tweak-Spacing, r=petro…

3c79f0c

…chenkov Tweak `Spacing` use Some clean-up precursors to rust-lang#125174. r? ``@petrochenkov``

craterbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-crater Status: Waiting on a crater run to be completed. labels May 23, 2024

petrochenkov reviewed May 23, 2024

View reviewed changes

compiler/rustc_ast_pretty/src/pprust/state.rs Outdated Show resolved Hide resolved

petrochenkov added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 23, 2024

nnethercote mentioned this pull request May 24, 2024

Remove proc-macro back-compat hack for rental #106060

Closed

nnethercote force-pushed the less-ast-pretty-printing branch from 33d6e6f to a322750 Compare May 24, 2024 05:14

nnethercote mentioned this pull request May 27, 2024

Convert proc_macro_back_compat lint to an unconditional error. #125596

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Print `token::Interpolated` with token stream pretty printing. #125174

Print `token::Interpolated` with token stream pretty printing. #125174

nnethercote commented May 16, 2024 •

edited

nnethercote commented May 16, 2024

This comment has been minimized.

bors commented May 16, 2024

nnethercote commented May 17, 2024

This comment has been minimized.

bors commented May 17, 2024

bors commented May 17, 2024

This comment has been minimized.

dtolnay commented May 17, 2024 •

edited

rust-timer commented May 17, 2024

nnethercote commented May 17, 2024

dtolnay commented May 17, 2024

nnethercote commented May 17, 2024

dtolnay commented May 17, 2024

nnethercote commented May 20, 2024

craterbot commented May 20, 2024

craterbot commented May 21, 2024

craterbot commented May 23, 2024

bors commented May 23, 2024

dtolnay commented May 23, 2024

dtolnay commented May 23, 2024

petrochenkov commented May 23, 2024

nnethercote commented May 24, 2024

dtolnay commented May 24, 2024

nnethercote commented May 24, 2024

petrochenkov commented May 24, 2024

nnethercote commented May 27, 2024

Print token::Interpolated with token stream pretty printing. #125174

Are you sure you want to change the base?

Print token::Interpolated with token stream pretty printing. #125174

Conversation

nnethercote commented May 16, 2024 • edited

nnethercote commented May 16, 2024

This comment has been minimized.

bors commented May 16, 2024

nnethercote commented May 17, 2024

This comment has been minimized.

bors commented May 17, 2024

bors commented May 17, 2024

This comment has been minimized.

dtolnay commented May 17, 2024 • edited

rust-timer commented May 17, 2024

Overall result: ❌ regressions - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

nnethercote commented May 17, 2024

dtolnay commented May 17, 2024

nnethercote commented May 17, 2024

dtolnay commented May 17, 2024

nnethercote commented May 20, 2024

craterbot commented May 20, 2024

craterbot commented May 21, 2024

craterbot commented May 23, 2024

bors commented May 23, 2024

dtolnay commented May 23, 2024

dtolnay commented May 23, 2024

petrochenkov commented May 23, 2024

nnethercote commented May 24, 2024

Spacing

rental

dtolnay commented May 24, 2024

nnethercote commented May 24, 2024

petrochenkov commented May 24, 2024

nnethercote commented May 27, 2024

Print `token::Interpolated` with token stream pretty printing. #125174

Print `token::Interpolated` with token stream pretty printing. #125174

nnethercote commented May 16, 2024 •

edited

dtolnay commented May 17, 2024 •

edited