RFC: Raw, Non-Nesting Comment Syntax

daniel-pfeiffer · January 11, 2024, 5:05pm

Added thoughts about whether to nest raw in classical block comments in OP.

Added regex to the examples answer.

zackw · January 17, 2024, 2:13am

I was specifically proposing that line comments should nest inside block comments, not anything else. I don't see why you would want comment delimiters to be active inside strings or vice versa; quite the reverse, there's probably any number of existing programs that do things like

let block_comment_start = "/*";
let block_comment_end = "*/";

and

/* The apostrophe in this comment doesn't indicate
   a character literal or lifetime */

(deleted and reposted with proper reply tagging)

tczajka · January 17, 2024, 11:07am

zackw:

I was specifically proposing that line comments should nest inside block comments, not anything else. I don't see why you would want comment delimiters to be active inside strings or vice versa; quite the reverse, there's probably any number of existing programs that do things like
let block_comment_start = "/*";
let block_comment_end = "*/";

The purpose of nesting block comments inside block comments is presumably so that you can comment out a block of Rust code containing such line comments.

For the same reason you might want commenting out a block of Rust code containing string literals to work:

/* commented out for now
let block_comment_end = "*/";
*/

pitaj · January 17, 2024, 2:26pm

Here's an alternative. What if we made the number of * meaningful for block comments?

Them you can add as many as you need to comment out the block in question. Usually this would be just one or two:

/**
let x = "something */";
**/

This would obviously be a breaking syntax change but it could be done across an edition.

bjorn3 · January 18, 2024, 4:43am

/** **/ is already a doc comment.

pitaj · January 18, 2024, 2:57pm

True, you would have to change block doc comments to something else, maybe /*! or something.

Edit: oh wait that would be an inner comment.

Nemo157 · January 18, 2024, 2:58pm

/*! is the equivalent of #![doc already too.

daniel-pfeiffer · January 18, 2024, 7:20pm

Here's an alternative. What if we made the number of * meaningful for block comments?

…

True, you would have to change block doc comments to something else, maybe /*! or something.

I don't mind alternate solutions. But they shouldn't redefine existing syntax. All of /**, /*! as well as /* or */ after /* already have a meaning. Anything else inside of a comment is just that and shouldn't suddenly get meaning.

That's why I proposed new, currently invalid syntax (hence backwards compatible) on the outside of comments. And it aligns with comparable issues in strings. Except in strings it's only comfort, as you can always mask inner characters. Whereas in comments all the examples above are currently impossible to nest. So they cause surprising problems, when commenting out code.

zackw · January 20, 2024, 5:58pm

OK, I see how you got there from what I said and I agree this is potentially just as troublesome, or more so, as line comments containing block comment boundary markers. I also agree we definitely don't want to start looking for strings nested inside block comments -- not only would that make parsing too complicated, I'm certain it would break existing code that has unpaired " characters inside comments. " has multiple uses in natural language text, some of them naturally unpaired -- ditto marks, leading " on multi-paragraph quotations, etc.

I'm still not convinced that looking for line comments inside block comments is a mistake. It seems inconsistent to me that we don't do that now but we do look for block comments inside block comments. (Honestly, I had no idea Rust allowed block comments to nest in the first place! After 25 years of C it would never have occurred to me to try it.)

What if dedicated syntax for commenting out syntactically correct code that looks like an item? Actually, don't we have this right now in the form of

// corrected, see next two posts
#[cfg(any())] mod comment_out {
  // ...
}

?

CAD97 · January 20, 2024, 10:25pm

#[cfg(false)] is an error (you can't use a keyword), but #[cfg(FALSE)] is fairly common to see and #[cfg(any())] is actually unsatisfiable (and also somewhat common). (I personally think true/false should be permitted with the obvious meaning in cfg tests.)

zackw · January 21, 2024, 2:42am

Yeah, after I wrote that I looked at the spec for cfg expressions and realized that it would need to be #[cfg(any())] at present. I agree with you that true and false should be allowed with the obvious meaning.

Still, my larger point stands: #[cfg(any())] mod x { ... } could be used here, basically as #if 0 ... #endif is used in C.

daniel-pfeiffer · January 24, 2024, 6:46pm

Ok, that's a way of ignoring a single item. But it doesn't deal with the woes (and impossibility) of commenting out arbitrary stretches of code. That's what RFC this is about. When juggling wiith related functions, I have even started a comment in the middle of one, up to the corresponding point in the other – though rarely so, admittedly.

system · April 23, 2024, 6:47pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Doc comments are not real comments language design	5	1035	August 15, 2021
Macro expansion optionally in natural order language design	6	631	May 6, 2024
Markdowns as comments documentation	5	1040	January 5, 2023
Not Explicit language design	19	2606	March 25, 2019
Pre-RFC: documentation comments after code language design	8	1277	March 25, 2019

RFC: Raw, Non-Nesting Comment Syntax

Related topics