Prem Kumar Aparanji 👶🤖🐘

Simon's posts Post Back to profile

Top-level

@simon have you tried wllama?

Like 7 September at 5:24 | Wall-to-wall | Open on mastodon.social

6 comments

Simon Willison

@prem_k ollama? Yeah it’s pretty good, they’re very on top of adding new models

7 September at 13:00 | Open on fedi.simonwillison.net

Prem Kumar Aparanji 👶🤖🐘

@simon no, wllama is the WASM binding for llama.cpp & can Inference the gguf files of the models within the browser itself. It is an alternative to Transformers.js (onnx files) and WebLLM (bin shards).

https://github.com/ngxson/wllama

7 September at 14:24 | Open on mastodon.social

Simon Willison

@prem_k oh fantastic! I’ve played with https://github.com/mlc-ai/web-llm but I didn’t know about the llama.cpp port, that’s awesome

7 September at 15:05 | Open on fedi.simonwillison.net

Prem Kumar Aparanji 👶🤖🐘

@simon #WASM is such an interesting development for web apps that can run locally in the browser, even when offline. 😃

Slightly related note, Motherduck's 1.5 tier architecture powered by WASM is pretty cool too, especially when are able to join between tables on your browser and in your cloud in a single SQL query.

Wonder what else will WASM bring. 🤞🏼

7 September at 15:14 | Open on mastodon.social

Simon Willison

@prem_k I love how easy WASM makes it to run tools like Tesseract - I built this OCR tool using Tesseract.js and PDF.js and it works really well https://tools.simonwillison.net/ocr

7 September at 15:28 | Open on fedi.simonwillison.net

Prem Kumar Aparanji 👶🤖🐘

@simon wow! Didn't know about tesseract.js.

This could potentially remove the need for RPA 😄

7 September at 15:36 | Open on mastodon.social

Go Up