Docx to HTML - How to customize the conversion
Posted:
Thu Sep 22, 2011 8:46 am
by airefuerte
Any body knows how to personalize the conversion? In my case I need to exclude few html tags in order to support EDGAR standard
http://www.sec.gov/info/edgar/ednews/edhtml.htm (something like html 4.0 but without a few tags).
Regards
Re: Docx to HTML - How to customize the conversion
Posted:
Thu Sep 22, 2011 11:03 pm
by jason
It is going to depend on what you want to do, but start by looking at src/main/java/org/docx4j/convert/out/html/docx2xhtmlNG2.xslt
You can customise the conversion without altering the XSLT, by including content controls in your docx, and extending SdtTagHandler to specify what to do with a content control matching a given tag.