I wrote up a few notes about Alibaba Cloud’s impressive Apache 2 licensed Qwen2-VL vision LLM, which seems to handle tasks like handwriting OCR particularly well
I had to link to the Internet Archive copies of their blog posts because their GitHub organization (which hosted their blog via GitHub pages) mysteriously vanished without a trace some time in the last 24 hours!
Good news: the disappearance is confirmed to be accidental, hopefully they’ll be back soon once GitHub unflag their account https://twitter.com/justinlin610/status/1831489518467477529
@simon the text extraction is impressive, but there’s at least one error: “my sample” instead of “very small.”