Add HtmlMod #101

brandonchinn178 · 2023-02-03T08:40:03Z

This allows hooking into the rendering of Html. So to use ghc-syntax-highlighting on Haskell code blocks:

let mods =
      mempty
        { onCodeBlock = \origImpl info t ->
            let lang = T.takeWhile (not . isSpace) info in
            if lang `elem` ["haskell", "hs"]
              then htmlRaw $ myRenderFunction $ tokenizeHaskell t
              else origImpl info t
        }
runHtmlMod mods <$> commonmark fp text :: Either ParseError (Html ())

jgm · 2023-02-03T17:49:10Z

I have an idea for a lighter-weight solution to the problem.

PR #102 adds a function transform that will apply a transformation to an HTML tree.

So for this application you'd just need something like

highlightHaskell :: Html a -> Html a
highlightHaskell = transform f
 where
  f (HtmlElement BlockElement "pre" [("class", "language-haskell")]
      (Just (HtmlElement InlineElement "code" codeAttr
       (Just (HtmlText t)))))
       = HtmlRaw (myRenderFunction $ tokenizeHaskell t)
  f x = x

I prefer this to adding a new module, a new type, and the need to add instances of the new type for extensions. That's a lot of additional complexity. What do you think?

brandonchinn178 · 2023-02-03T18:13:31Z

I was just thinking the same. It's good as a quick fix, but it does require you to know the internal details of how Html implements the IsBlock interface.

What do you think about having an intermediate representation

type Nodes = [Nodes]

data Node
  = NodeParagraph Nodes
  | NodeImage Text Text Nodes
  | ...

? That way one could transform the Markdown AST however they want, then run a nodesToHtml :: Nodes -> Text function to render HTML text? Or even fromNodes :: IsBlock il b => Nodes -> b.

I'm asking this because I'm also hitting a second thing where I'd like to do other, more complex transformations, like converting one code block depending on a previous code block

jgm · 2023-02-03T20:13:45Z

That amounts to defining an AST and adding IsBlock and IsInline instances for it, plus a renderer from that AST type to Html. The architecture of commonmark-hs is meant to enable this, but I didn't use an AST in core for two reasons:

performance is likely better going direct to HTML and not constructing the intermediate structure
difficult to see how to make the AST extensible to handle extensions

jgm · 2023-02-03T20:14:04Z

If you don't mind depending on pandoc for rendering, you could just use commonmark-pandoc.

jgm · 2023-02-03T20:14:40Z

Or use commonmark-pandoc (which only depends on pandoc-types) but write your own custom HTML renderer for Pandoc so you don't need to depend on pandoc.

brandonchinn178 · 2023-02-03T20:55:14Z

It wouldnt change existing behavior, so current uses of commonmark ... :: Html () would still go direct Text -> Html. It would just optionally enable someone to do Text -> Nodes -> Html, if someone wants to inspect or transform the AST in the middle.

I have an idea for handling extensions well; will put up a new PR when I finish

jgm · 2023-02-04T05:37:40Z

New idea: what if we generalize transform (from PR #102):

transformM :: Monad m => (Html a -> m (Html a)) -> Html a -> m (Html a)

This would be trivial. Then you could use a State monad to keep track of what you've done in the previous code block, for example.

brandonchinn178 · 2023-02-04T06:18:58Z

Sure, but it still seems a bit too level. Also the biggest issue is that none of the Html constructors are exported, which is what started all of this

jgm · 2023-02-04T18:19:07Z

Yes, if we did this we'd need to export the constructors too. Not sure what you mean by "too level."

I'm open to suggestions, but I am interested in keeping the core library simple, and adding an AST type plus a renderer from the AST to HTML, which would have to be kept in sync with the direct path to HTML, does add quite a bit of complexity. I'm curious about the idea for extensible AST.

brandonchinn178 · 2023-02-04T18:21:45Z

Sorry, I meant "too low level". My idea does get a bit complex, so I don't have any expectations of getting it merged, but I'll put it up for educational purposes 😛

jgm · 2023-02-04T19:52:04Z

I updated my PR to export transformM and the constructors.
Still not sure about merging it.
I'll have a look at your alternative when I have a chance.

Add HtmlMod

870e921

brandonchinn178 closed this Feb 4, 2023

brandonchinn178 deleted the html-mod branch February 4, 2023 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add HtmlMod #101

Add HtmlMod #101

Uh oh!

brandonchinn178 commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023 •

edited

Loading

Uh oh!

brandonchinn178 commented Feb 3, 2023 •

edited

Loading

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

brandonchinn178 commented Feb 3, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

brandonchinn178 commented Feb 4, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

brandonchinn178 commented Feb 4, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add HtmlMod #101

Add HtmlMod #101

Uh oh!

Conversation

brandonchinn178 commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandonchinn178 commented Feb 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

jgm commented Feb 3, 2023

Uh oh!

brandonchinn178 commented Feb 3, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

brandonchinn178 commented Feb 4, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

brandonchinn178 commented Feb 4, 2023

Uh oh!

jgm commented Feb 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jgm commented Feb 3, 2023 •

edited

Loading

brandonchinn178 commented Feb 3, 2023 •

edited

Loading