Blanket tags for #[doc] code blocks

jsgf · April 25, 2021, 9:22pm

I've been doing some work with a tool which takes a language-independent schema spec, with doc comments, and generates Rust code with corresponding #[doc] attributes on the relevent items.

The problem is that the original source documentation is inconsistently formatted, but markdownish enough. It has indented sections which may or may not be intended to be code blocks, but if they are they're almost certainly not Rust.

This poses two problems:

it generates warnings for text which isn't Rust but is being parsed as Rust
it allows arbitrary code to be injected as doctest into a generated file which is otherwise derived from a non-Turing-complete schema (ie, someone could jokingly put a "nuke the world" code example in a comment which ends up being executed in tests)

I'd like a way to solve both of these by blanket ignoring all code blocks in the doc comments.

In principle I could pre-parse the text as markdown and convert all the apparent code blocks into ```text blocks, but it would be simpler to generate something like:

#[doc(codeblock="text")]
#[doc = "..."]

I can solve 2) with

#[cfg_attr(not(doctest), doc = ...)]

but it doesn't stop cargo doc from complaining about malformed Rust code (which I could suppress by adding doc to the cfg_attr at the cost of losing all the documentation...).

The codeblock="xxx" form is also generic and allows arbitrary code block attributes to be applied.

Or is there some way of handling this with the current set of rustdoc mechanisms?

josh · April 25, 2021, 9:49pm

Having a mechanism to force all code blocks in documentation to be treated as a certain type by default seems reasonable; that would make sense either on the item with the #[doc], or at the top level of a module applying to all documentation in the module.

I don't think any such mechanism exists today, but adding one makes sense.

Also, if you want to give examples of code that includes backquotes, you just need to use more than three backquotes in the surrounding delimiters. For instance, to write:

```text
this is a text block
```

you can write:

````
```text
this is a text block
```
````

CAD97 · April 25, 2021, 10:58pm

With a great markdown engine, you can do the same for inline code blocks as well, as ```` ```text ```` renders as ```text. Unfortunately, many sub-great existing markdown engines do actually support `` inline code blocks, but choke on ```, always treating it as a full code block.

Typing this post and counting ` was interesting.

unrendered

With a great markdown engine, you can do the same for inline code blocks as well, as ````` ```` ```text ```` ````` renders as ```` ```text ````. Unfortunately, many sub-great existing markdown engines do actually support ``` `` ``` inline code blocks, but choke on ```` ``` ````, always treating it as a full code block.

Typing this post and counting `` ` `` was interesting.

<details><summary>unrendered</summary>

<!-- error infinite recursion -->

</details>

ehuss · April 26, 2021, 1:56pm

Issue rustdoc: module flag to prevent code blocks without a lang item from counting as rust · Issue #59867 · rust-lang/rust · GitHub may be relevant to this, as indented code blocks occasionally trips up people using things like bindgen.

jsgf · April 26, 2021, 5:39pm

PR rustdoc - implement #[doc(codeblock_attr = ...)] by jsgf · Pull Request #84597 · rust-lang/rust · GitHub

system · July 25, 2021, 5:40pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre-RFC: Documentation Test Names tools and infrastructure	5	747	January 7, 2021
Rustdoc feature request documentation	3	847	November 16, 2021
Separating documentation and code documentation	8	1326	December 28, 2020
How do I modify `rustdoc` (the tool run by `cargo doc`)	12	845	July 4, 2020
Generating a documentation for tests documentation	0	265	August 27, 2024

Blanket tags for #[doc] code blocks

Related topics