mirza.town
about archive rss

24/06/2026

Map. Cheese. Cheese Map!

The need to visualize everything might be consuming me.

While Türkiye might not be the biggest cheese producer or have the vast variety of France, it’s still important to map our cultural treasures.

I scraped Wikipedia for text and images of the cheeses. Then I used the Gemini API’s embedding model to compare the cheeses to one another and obtain a similarity matrix. It didn’t work exactly as intended because regional names overlap so much, but it’s still a good starting point. Currently, I use regex to determine which province a cheese is from and which ingredients are used. The texture is also collected that way.

In the future, I want to use an LLM to generate structured 250- to 500-word summaries from grounded data so these embeddings work even better. Right now, some cheeses lack information, while the ones with long summaries tend to score too similarly.

Anyway, you can check it out here or just below:

Update: I’ve now generated summaries using Google’s search tool to find the texture, taste, ingredients, production steps, and origin for every single cheese in my dataset. My goal was to create a balanced summary that encapsulates every aspect of a cheese without overemphasizing any single trait. If you’re curious about the UMAP version of the embeddings, check it out here or right here:

Did I miss your hometown’s famous cheese?

Just send me an angry email. :^)


Other Mapping & Data Projects

Here is a short summary of the other projects I’ve been working on in the meantime:

Real-time Turkish Railways map

Turkish railways ticket availability statistics

Turkish cuisine taste map

Turkish parliament interruptions scoreboard

Türkiye population density map

Türkiye population density map detail