@timhutton Improved it to work as a Markdown input to Pandoc: https://pastebin.com/NiFUrHgU
Pandoc command: `pandoc --from markdown_github --to html5 --standalone output2.md --output output2.html --toc`
Top-level
@timhutton Improved it to work as a Markdown input to Pandoc: https://pastebin.com/NiFUrHgU 1 comment
|
@dheadshot @timhutton Ran both parser.py and Pandoc on my extracted Twitter archive (which is ~3.8GB by the way). output.md is 18.9M, output.html is 40.5M.
Note that I joined Twitter in June 2009, which should explain the size.