Email or username:

Password:

Forgot your password?
Top-level
Sci-Fi Girl

@ErictheCerise

Maybe add search.marginalia.nu/ to the list?

Their focus is on finding small, old and obscure websites. 😎

And I'll have to look at the ones in your list that are new to me!

@Pajo_16 @amin @nixCraft

10 comments
Amin Hollon 🇺🇸🇲🇾🇮🇳🇦🇫

@5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft

Yeah, Marginalia 1000% deserves the spot more than me. I took a ton of inspiration from their work and they've actually been around for significantly longer than this recent wave of launches. :)

Sci-Fi Girl

@amin

Cool! Having more options is definitely better!! 😎

@ErictheCerise @Pajo_16 @nixCraft

Amin Hollon 🇺🇸🇲🇾🇮🇳🇦🇫

@5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft

Yep!

It's very similar to Clew in goals (promoting personal, non-commercial websites) and even uses the same ranking function at heart (BM25F) but I did make a number of changes in methodology, for example:

- Most of my webpage discovery is centered around RSS feeds (which is both a great mature technology and means sites with RSS feeds [often personal sites] are gonna be better-treated by the crawler)
- Marginalia still indexes big sites like Wikipedia and StackExchange while I specifically blacklist them from the crawler (helps emphasize small sites and saves significant resources for the crawler; I may do some kind of integration in the future but for now I have bangs if you wanna search them)
- Marginalia does warn about javascript, ads, etc., but I don't think it affects pages' rankings, while I penalize ads and trackers
- I'm really proud of my brand new page weight indicators, which I haven't seen anything like in other search engines before. :)

All that said Clew is definitely still very beta. XD

@5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft

Yep!

It's very similar to Clew in goals (promoting personal, non-commercial websites) and even uses the same ranking function at heart (BM25F) but I did make a number of changes in methodology, for example:

- Most of my webpage discovery is centered around RSS feeds (which is both a great mature technology and means sites with RSS feeds [often personal sites] are gonna be better-treated by the crawler)
- Marginalia still indexes big sites like Wikipedia...

Andrew Zonenberg

@amin @5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft You down rank pages with ads and trackers? If only this was more common...

Amin Hollon 🇺🇸🇲🇾🇮🇳🇦🇫

@azonenberg @5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft

Right??? XD

But all the mainstream search engines are mostly by companies that sell advertising and tracking services so it's not likely in them.

I've found it really effective at fighting SEO, though; if people are trying to hack the system to get you on their site, they probably have ads or tracking. ;)

Eric the Cerise

@5ciFiGirl

No 'maybe' about it, that's an awesome one, and new for me. Check out his 'About' page ( marginalia.nu/marginalia-searc ) ... I want to have his babies.

@Pajo_16 @amin @nixCraft

Amin Hollon 🇺🇸🇲🇾🇮🇳🇦🇫

@5ciFiGirl @ErictheCerise @Pajo_16 @nixCraft

A ton of people link to his "The Small Website Discoverability Crisis" when justifying their own search engines (which is great) but I also find it kinda hilarious that they often don't seem to realize he has his own search engine too. XD

Go Up