Map
%3 Content creation workflow--Daily open-source software guide Content creation workflow--Caddy Daily open-source software guide--Decentralisation Daily open-source software guide--Software alternatives Caddy--Daily open-source software guide Caddy--Decentralisation MacBook Air M4 Adventures--2025's Devlog MacBook Air M4 Adventures--NixOS 2025's Devlog--NixOS 2025's Devlog--Contact 2025's Devlog--ReqSnake 2025's Devlog--2023's Devlog NixOS--Daily open-source software guide NixOS--Software alternatives Bookmarks--Registry-based search engine Bookmarks--Website experience Registry-based search engine--Website experience Registry-based search engine--Decentralisation Website experience--Content creation workflow Website experience--Decentralisation Audio & DSP--Sonification Audio & DSP--Custom synth Sonification--Custom synth Sonification--Making music on Linux Custom synth--Luthier Custom synth--Custom sequencer Luthier--Audio & DSP Luthier--Sonification Custom sequencer--Luthier Custom sequencer--Sounds & Melodies 2024's Devlog--About 2024's Devlog--Contact About--2025's Devlog About--2023's Devlog Contact--About Contact--Exocortex Piano--Music Transcribing Piano--Sounds & Melodies Music Transcribing--Sounds & Melodies Music Transcribing--Rust Sounds & Melodies--Audio & DSP Sounds & Melodies--Rust Flying Shooting Game--2025's Devlog Flying Shooting Game--2022's Devlog 2022's Devlog--2025's Devlog 2022's Devlog--The Waiting room Colour scheme--System configuration Colour scheme--Model-Based Systems Engineering System configuration--NixOS System configuration--Software alternatives Model-Based Systems Engineering--MacBook Air M4 Adventures Model-Based Systems Engineering--ReqSnake GPU Synth--Sonification GPU Synth--Luthier Excellent Words--How to think Excellent Words--Reading How to think--The word *Bączek* How to think--The Waiting room Reading--Movies/shows Reading--Cosmic Horror ReqSnake--MacBook Air M4 Adventures ReqSnake--About Exocortex--Website experience Exocortex--Map Map--Bookmarks Map--Website experience 2023's Devlog--2024's Devlog 2023's Devlog--The Waiting room Travel--The word *Bączek* Travel--The Ławka Initiative The word *Bączek*--Excellent Words The word *Bączek*--Scratchpad The Ławka Initiative--The word *Bączek* The Ławka Initiative--Paintings Decentralisation--Content creation workflow Decentralisation--Bookmarks Paintings--Travel Paintings--The word *Bączek* The Waiting room--2025's Devlog The Waiting room--2024's Devlog Making music on Linux--Audio & DSP Making music on Linux--GPU Synth Scratchpad--2024's Devlog Scratchpad--Contact Movies/shows--2022's Devlog Movies/shows--Cosmic Horror Cosmic Horror--2022's Devlog Cosmic Horror--The Waiting room Rust--Audio & DSP Rust--Sonification Software alternatives--Caddy Software alternatives--MacBook Air M4 Adventures Listening--Movies/shows Listening--Cosmic Horror Storybook--Website experience Storybook--Map Content creation workflow Content creation workflow Daily open-source software guide Daily open- source software guide Caddy Caddy MacBook Air M4 Adventures MacBook Air M4 Adventures 2025's Devlog 2025's Devlog NixOS NixOS Bookmarks Bookmarks Registry-based search engine Registry-based search engine Website experience Website experience Audio & DSP Audio & DSP Sonification Sonification Custom synth Custom synth Luthier Luthier Custom sequencer Custom sequencer 2024's Devlog 2024's Devlog About About Contact Contact Piano Piano Music Transcribing Music Transcribing Sounds & Melodies Sounds & Melodies Flying Shooting Game Flying Shooting Game 2022's Devlog 2022's Devlog Colour scheme Colour scheme System configuration System configuration Model-Based Systems Engineering Model-Based Systems Engineering ! GPU Synth GPU Synth Excellent Words Excellent Words ! How to think How to think Reading Reading ReqSnake ReqSnake Exocortex Exocortex Map Map 2023's Devlog 2023's Devlog Travel Travel The word *Bączek* The word *Bączek* The Ławka Initiative The Ławka Initiative Decentralisation Decentralisation Paintings Paintings The Waiting room The Waiting room Making music on Linux Making music on Linux Scratchpad Scratchpad Movies/shows Movies/shows Cosmic Horror Cosmic Horror Rust Rust Software alternatives Software alternatives Listening Listening Storybook Storybook
You can click on each node, they are links!

This is an automatically generated graph containing all pages on my website, along with the connections calculated using sentence embeddings. If you're interested, you can read the source code.

How is this thing generated?

Explained non-technically

  1. Using an AI-esque tool, I'm generating a mathematical representation of what each page on my site contains, in terms of contents
  2. I'm laying out each page on a graph, so that it is placed close to pages with similar contents and far away from pages with different contents. E.x. programming-related stuff will be grouped together, far away from something travel-related.
  3. I'm drawing links between pages which are the closest. This also generates the "related posts" section at the bottom of each page. The drawn links only serve aesthetic purposes.
  4. Posts are colored depending on their relatedness to 3 topics:
    • More red: art-related
    • More green: computers-related
    • More blue: music-related
    • I'm working on better coloring algorithms based on various gradients

The gory technical details

  1. All of the posts are fed through an embeddings generator, I'm using the Sentence Transformers Python library.
  2. The embeddings are passed to UMAP, a dimensionality reduction algorithm, which takes in multi-dimensional embeddings and projects it down to a 2D representation, which can be drawn as a graph. The projection is done so that the high-level "structure" of the data is preserved (at least that's what the UMAP paper states, I'm not data scientist to argue with the experts).
  3. I'm connecting each post with its top 2 nearest posts (using more clutters up the map).
  4. Coloring is done via calculating cosine similarity between the post content embeddings and embeddings of simple tag-based sentences, such as "music, melodies" or "art, beauty". Currently the gradient is dead-simple, similarity directly affects the R/G/B channel.
  5. graphviz renders the graphs and outputs them as SVGs.

A much better description of a similiar idea on Simon Willison's blog.

Future plans

  1. Make this thing look more "map-alike", whatever that might mean.
  2. Experiment with text clusterisation & dimensionality reduction algorithms, such as:
    • tSNE
    • K-means clustering
    • UMAP
    • Latent Dirichlet allocation
    • DBSCAN
  3. Add #tags. Automatically assign posts to categories with cosine distances.
  4. Introduce color-coding and other visual markers, allowing viewers to make sense of the data based on different metrics:
    • Post tags
    • Links to/from other posts
    • Links outside (to the netsphere)
    • Other connections generated by NLP
  5. Check out KagiSearch/vectordb
  6. Color gradients with Python
  7. Circos