Map

Contact--2024's Devlog Contact--Website experience 2024's Devlog--2025's Devlog 2024's Devlog--Flying Shooting Game Website experience--2024's Devlog Website experience--Scratchpad Cosmic Horror--Reading Cosmic Horror--Movies/shows Reading--The Waiting room Reading--How to think Movies/shows--Reading Movies/shows--Listening Daily open-source software guide--Content creation workflow Daily open-source software guide--2025's Devlog Content creation workflow--Software alternatives Content creation workflow--NixOs 2025's Devlog--2024's Devlog 2025's Devlog--The Waiting room 2025's Devlog--ReqSnake 2025's Devlog--2022's Devlog Excellent Words--Reading Excellent Words--The word *Bączek* The word *Bączek*--Paintings The word *Bączek*--The Ławka Initiative Piano--Sounds & Melodies Piano--Music Transcribing Sounds & Melodies--Making music on Linux Sounds & Melodies--Custom sequencer Music Transcribing--Sounds & Melodies Music Transcribing--Custom sequencer Flying Shooting Game--2025's Devlog Flying Shooting Game--Scratchpad Scratchpad--2024's Devlog Scratchpad--2025's Devlog Map--Website experience Map--Exocortex Exocortex--Website experience Exocortex--Travel The Waiting room--Movies/shows The Waiting room--2023's Devlog 2023's Devlog--2025's Devlog 2023's Devlog--2022's Devlog Luthier--Making music on Linux Luthier--GPU Synth Making music on Linux--GPU Synth Making music on Linux--Custom sequencer GPU Synth--Resources on audio & DSP GPU Synth--Custom synth ReqSnake--2024's Devlog ReqSnake--About Resources on audio & DSP--Making music on Linux Resources on audio & DSP--Custom sequencer Custom sequencer--GPU Synth Custom sequencer--Custom synth Custom synth--Making music on Linux Custom synth--Resources on audio & DSP Listening--Piano Listening--2022's Devlog Rust--Luthier Rust--Making music on Linux Travel--Map Travel--Paintings How to think--Scratchpad How to think--The Waiting room Paintings--Excellent Words Paintings--How to think The Ławka Initiative--Travel The Ławka Initiative--Paintings Bookmarks--Website experience Bookmarks--Registry-based search engine Registry-based search engine--Website experience Registry-based search engine--Map Decentralisation--Bookmarks Decentralisation--Caddy Caddy--Daily open-source software guide Caddy--Content creation workflow About--2025's Devlog About--2022's Devlog Software alternatives--Daily open-source software guide Software alternatives--Colour scheme NixOs--Software alternatives NixOs--Colour scheme 2022's Devlog--Movies/shows 2022's Devlog--The Waiting room System configuration--NixOs System configuration--Colour scheme Colour scheme--ReqSnake Colour scheme--About Contact Contact 2024's Devlog 2024's Devlog Website experience Website experience Cosmic Horror Cosmic Horror Reading Reading Movies/shows Movies/shows Daily open-source software guide Daily open- source software guide Content creation workflow Content creation workflow 2025's Devlog 2025's Devlog Excellent Words Excellent Words The word *Bączek* The word *Bączek* Piano Piano Sounds & Melodies Sounds & Melodies Music Transcribing Music Transcribing Flying Shooting Game Flying Shooting Game ! Scratchpad Scratchpad ! Map Map Exocortex Exocortex The Waiting room The Waiting room 2023's Devlog 2023's Devlog Luthier Luthier Making music on Linux Making music on Linux GPU Synth GPU Synth ReqSnake ReqSnake Resources on audio & DSP Resources on audio & DSP Custom sequencer Custom sequencer Custom synth Custom synth Listening Listening Rust Rust Travel Travel How to think How to think Paintings Paintings The Ławka Initiative The Ławka Initiative Bookmarks Bookmarks Registry-based search engine Registry-based search engine Decentralisation Decentralisation Caddy Caddy About About Software alternatives Software alternatives NixOs NixOs 2022's Devlog 2022's Devlog System configuration System configuration Colour scheme Colour scheme
You can click on each node, they are links!

This is an automatically generated graph containing all pages on my website, along with the connections calculated using sentence embeddings. If you're interested, you can read the source code.

How is this thing generated?

Explained non-technically

  1. Using an AI-esque tool, I'm generating a mathematical representation of what each page on my site contains, in terms of contents
  2. I'm laying out each page on a graph, so that it is placed close to pages with similar contents and far away from pages with different contents. E.x. programming-related stuff will be grouped together, far away from something travel-related.
  3. I'm drawing links between pages which are the closest. This also generates the "related posts" section at the bottom of each page. The drawn links only serve aesthetic purposes.
  4. Posts are colored depending on their relatedness to 3 topics:
    • More red: art-related
    • More green: computers-related
    • More blue: music-related
    • I'm working on better coloring algorithms based on various gradients

The gory technical details

  1. All of the posts are fed through an embeddings generator, I'm using the Sentence Transformers Python library.
  2. The embeddings are passed to UMAP, a dimensionality reduction algorithm, which takes in multi-dimensional embeddings and projects it down to a 2D representation, which can be drawn as a graph. The projection is done so that the high-level "structure" of the data is preserved (at least that's what the UMAP paper states, I'm not data scientist to argue with the experts).
  3. I'm connecting each post with its top 2 nearest posts (using more clutters up the map).
  4. Coloring is done via calculating cosine similarity between the post content embeddings and embeddings of simple tag-based sentences, such as "music, melodies" or "art, beauty". Currently the gradient is dead-simple, similarity directly affects the R/G/B channel.
  5. graphviz renders the graphs and outputs them as SVGs.

A much better description of a similiar idea on Simon Willison's blog.

Future plans

  1. Make this thing look more "map-alike", whatever that might mean.
  2. Experiment with text clusterisation & dimensionality reduction algorithms, such as:
    • tSNE
    • K-means clustering
    • UMAP
    • Latent Dirichlet allocation
    • DBSCAN
  3. Add #tags. Automatically assign posts to categories with cosine distances.
  4. Introduce color-coding and other visual markers, allowing viewers to make sense of the data based on different metrics:
    • Post tags
    • Links to/from other posts
    • Links outside (to the netsphere)
    • Other connections generated by NLP
  5. Check out KagiSearch/vectordb
  6. Color gradients with Python
  7. Circos