Essential Reading For Engineering Leaders

- Max Bernstein

Search
Guide

tl;dr: A guide to building a simple search engine from scratch using word embeddings (word2vec). It explains embedding words into vectors, computing cosine similarity to rank results, and includes implementations of terminal and web-based interfaces, along with thoughts on evaluation and improvements.

featured in #618

Static Search Trees: 40x Faster Than Binary Search

- Ragnar Groot Koerkamp

Search

tl;dr: “In this post, we will implement a static search tree for high-throughput searching of sorted data. We’ll mostly take the code presented there as a starting point, and optimize it to its limits.”

featured in #580

How To Improve Search Without Looking At Queries Or Results

Search
ML

tl;dr: “Canva celebrated the milestone of 200M monthly active users (MAUs). Our customers have over 30 billion designs on Canva and create almost 300 new designs every second. With this growth rate, the ability for Canva Community members to effectively search for and find their designs, as well as those shared to them by team members, is becoming an increasingly challenging and essential problem to solve.”

featured in #569

Postgres As A Search Engine

- Eric Zakariasson

PostgreSQL
Search

tl;dr: “While semantic search is trendy, good old lexical search is still the backbone. Semantic techniques can improve results, but they work best when added to a solid text-based search foundation. In this post, we’ll explore how to use Postgres to create a robust search engine.”

featured in #544

How Google Handles JavaScript Throughout The Indexing Process

JavaScript
Search

tl;dr: “Understanding how search engines crawl, render, and index web pages is crucial for optimizing sites for search engines. Over the years, as search engines like Google change their processes, it’s tough to keep track of what works and doesn’t—especially with client-side JavaScript.” This post looks at how rendering has evolved and the real-world impact on modern web apps.

featured in #538

6 Hard Problems Scaling Vector Search

- Louis Brandy

Search

tl;dr: Vector search is increasingly used in applications implementing recommendations, image search and natural language processing. It can also be difficult to scale. Here are some questions to ask before you run into these problems.

featured in #518

I Accidentally Built A Meme Search Engine

- Harper Reed

Search
AI

tl;dr: “I built a meme search engine using siglip / CLIP and vector encoding images. It was fun and I learned a lot. I have been building a lot of applied AI tools for a while. One of the components that always seemed the most magical has always been vector embeddings. Word2Vec and the like have straight blown my mind. It is like magic.” Harper describes his journey and shares the results.

featured in #508

How Levels.fyi Built Scalable Search With PostgreSQL

- Tanishq Singh

Search
PostgreSQL

tl;dr: The post outlines how Levels.fyi built a scalable fuzzy search solution using PostgreSQL that handles over 10 million search queries per month with p99 query performance under 20ms, outlining the key steps.

featured in #504

A Search Engine In 80 Lines Of Python

- Alex Molas

Python
Search

tl;dr: “Ever heard of the “Small Website Discoverability Crisis”? The problem it’s basically that small websites, ones like this one, are impossible to be found using Google or any other search engine. My mission? Making those tiny websites great again. In this post I will walk you through the journey of buliding a search engine from scratch using Python. This implementation doesn’t pretend to be a production-ready search engine, just a usable toy example showing how a search engine works under the hood.”

featured in #487

The Largest Money-Printing UI Element Ever Made

- Jim Nelson

Trends
Search

tl;dr: "The largest source of money flowing into the world of programming languages comes from Google paying to be the default search engine... Google took in $283bn in revenue in one year. Of that, $49bn went towards “traffic acquisition costs” which includes Google paying other browsers for the preference of being the default search engine."

featured in #473

/Search