Option to allow inline HTML #11

neocotic · 2012-12-12T12:20:35Z

When enabled (via the options) the parser should only clean specified HTML elements (i.e. remove all of their attributes) instead of replacing them with Markdown. When "cleaning" an element, all of its children will also be "cleaned" by the parser instead of converted into Markdown.

This functionality will be extremely useful when using tables.

The option should allow users to specify an array of specific tags;

md html, inline: ['p', 'table']

In this example it would "clean" all <p> and <table> tags, along with all their children.

The text was updated successfully, but these errors were encountered:

neocotic · 2013-05-02T13:47:58Z

This won't be included in the next release as it'll require a lot of work to ensure "cleaned" elements (and especially their children) are formatted as you'd expect (e.g. indentation).

jaredly · 2013-08-30T05:08:30Z

I would appreciate having inline html that can't be converted to markdown left intact.

toddb · 2013-12-19T19:10:34Z

+1 @jaredly about leaving raw

Currently, I don't want to change all tables, only some (due to size). I am wondering if we could have the inline option css selector based?

neocotic · 2013-12-19T19:19:13Z

@jaredly A lot of elements that can't be converted are just removed (e.g. <object>) or have their contents rendered (e.g. <div>). This is best in the majority of cases as lots of these elements could exist if this tool was used to parse a full page's HTML. This option will be used to provide exceptions instead of changing all funcitonality. Perhaps another - separate - option could be added for what you're describing, if I'm understanding you correctly.

@toddb Great idea! I'll definitely consider that when looking at implementing this.

toddb · 2013-12-19T19:33:10Z

It does seem to me that the first (experimental) versions of this feature could be 'buyer beware' - if you include raw html you should have cleaned it first. In practice, this is what I am doing with tidy and some custom cleaning because my html is so raw. (I am doing one-off migration of content and suspect this isn't your key use case)

katowulf · 2013-12-19T21:17:32Z

Fwiw, I'm currently using this lib because it strips all HTML which can't be converted. I'm using it to convert Word docs into Markdown, so it's a long process with lots of crazy tags.

So if an option is added to keep inline HTML, I would really appreciate this being optional.

Also, thanks so much for this lib! 🌟

ghost assigned neocotic Dec 12, 2012

neocotic mentioned this issue Dec 19, 2013

allow for table syntax like on github, plus with tests #40

Closed

bergie added a commit to bergie/html.md that referenced this issue Aug 4, 2014

Test for allowed/disallowed HTML, refs neocotic#11

6e0f4f1

bergie added a commit to bergie/html.md that referenced this issue Aug 4, 2014

Initial support for allowing inline IFRAMEs, refs neocotic#11

ff11e27

neocotic removed the CoffeeScript label Nov 18, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to allow inline HTML #11

Option to allow inline HTML #11

neocotic commented Dec 12, 2012

neocotic commented May 2, 2013

jaredly commented Aug 30, 2013

toddb commented Dec 19, 2013

neocotic commented Dec 19, 2013

toddb commented Dec 19, 2013

katowulf commented Dec 19, 2013

Option to allow inline HTML #11

Option to allow inline HTML #11

Comments

neocotic commented Dec 12, 2012

neocotic commented May 2, 2013

jaredly commented Aug 30, 2013

toddb commented Dec 19, 2013

neocotic commented Dec 19, 2013

toddb commented Dec 19, 2013

katowulf commented Dec 19, 2013