What happens when private search monopolies lock away our shared knowledge? Discover how AI, embeddings, and Aaron Swartz’s vision can set it free. Join Kush…
Notes & highlights
Published: · 32:23
What happens when private search monopolies lock away our shared knowledge?
Discover how AI, embeddings, and Aaron Swartz’s vision can set it free.
Join Kush at the Vancouver AI Community Meetup as he traces search from 200 BC scroll catalogs to today’s embedding-driven engines.
------
In this bada$$ Vancouver AI Community Meetup talk, Kush unpacks:
The hidden history of search – from ancient scroll catalogs to today’s high-dimensional embeddings.
Why monopolies broke the public web – how private indexes (Google, closed APIs) stole our shared knowledge.
Aaron Swartz’s Open Access legacy – lessons from the Guerilla Open Access Manifesto and his fight for information freedom.
How AI & embeddings can reclaim the web – practical demos of semantic search and a roadmap for truly public indexing.
The evolution of search – from manual taxonomy in the Library of Alexandria to Boolean operators, TF-IDF, and BM25.
Why exact keyword matching still fails you, even with typo correction and edit distance.
How semantic search & embeddings map meaning into high-dimensional “latent space” for smarter retrieval.
A live demo showing how concepts cluster by genre—dark comedy, sci-fi, action—and why small datasets hide the magic of PCA visualizations.
The hidden crisis of modern search: monopolized private indexes, closed APIs (Reddit, Twitter), and the decline of open communities.
A call to action inspired by Aaron Swartz’s Open Access manifesto: we must build and defend truly public indexes.
👍 If you found this helpful, like and subscribe!
🔔 Hit the bell to catch more Vancouver AI community deep dives.
📍 Chapters:
00:00 – Intro & Meet Kush
01:20 – Manual Taxonomies & Ancient Catalogs
03:45 – Digital String Matching & Levenshtein Distance
05:30 – Boolean Search & TF-IDF Evolution
07:50 – Enter Semantic Search & Embeddings
10:15 – Visualizing Latent Space with PCA
12:00 – Why Search Still Sucks Today
14:10 – Closed APIs & Private Index Monopolies
16:00 – Aaron Swartz & the Fight for Open Access
18:00 – Q&A and Next Steps