[OC] 70,000 images of clothes sorted by visual similarity into a 3D point cloud

This visualization represents the Fashion-MNIST dataset, which consists of 70,000 grayscale images across 10 distinct clothing categories (T-shirts, trousers, sneakers, etc.).

I trained a Convolutional Neural Network (CNN) to recognize these items. Instead of just looking at the final classification, I extracted the internal 512-dimensional vector produced by the convolution layers. This vector represents the "features" the AI sees.

To visualize this, I used dimensionality reduction algorithms (t-SNE and UMAP) to project those 512 dimensions down into a 3D cloud. The result is that items the AI finds visually similar drift together, creating natural clusters.

It’s interesting to see how the classes corresponding to Shirt, T-shirt, Pullover, and Coat form overlapping clusters in the latent space due to their visual similarity, whereas footwear classes such as Sneaker and Boot form distinct, dense clusters that are well separated. High-cut sneakers and some boots lie between the two clusters, forming a transition zone.

Take a look at it here: bulovic.at/fmnist

^Reposting because I didn’t include the source and tools last time.

Posted by BeginningDept

View 13 Comments

13 Comments

BeginningDept on December 20, 2025 7:41 pm

Visualization Tool: plotly. Data Source: Fashion-MNIST. Analysis: python.
GoodPlantain3865 on December 20, 2025 7:48 pm

this is beautiful! had no idea you could make 3D graph w/ plotly!
Bendoair on December 20, 2025 7:49 pm

Now this is beautiful data
dbg96 on December 20, 2025 7:54 pm

dude, didn’t you post this like yesterday?
ErosHD on December 20, 2025 7:54 pm

Is this the dataset Sheldon and Penny were creating in that one episode to develop an app to identify shoes?
Yongtre100 on December 20, 2025 7:57 pm

Why does it look like Fr*nce

Also damn that’s awesome.
Neuro-Byte on December 20, 2025 8:44 pm

Given how distinct the clusters are, I wonder if you could achieve the same level of performance with much higher efficiency with a CNN that uses only a 3-dimensional vector.
YakzitNood on December 20, 2025 9:02 pm

Me and gemini are discussing in depth your post

We looked at a visualization of a Convolutional Neural Network (CNN) trained on the Fashion-MNIST dataset.
The Process: The AI takes an image of clothing, processes it through layers to detect edges/textures, and converts it into a 512-dimensional vector (an embedding).
The Visualization: Using t-SNE, these 512 dimensions are projected into 3D space. The result is “clouds” of data where all Trousers cluster together and all Sandals cluster together, proving the AI understands the categories.
The “Crystal” Metaphor: We discussed how the data points form a “crystalline structure.”
The Insight: This is a perfect metaphor for Linear Algebra. Each image is a Vector (a line shooting from zero).
The Structure: The “lines” connecting them are the mathematical relationships (angles and distances) that hold each data point in its specific place relative to the others. The “crystal” is the rigid mathematical logic (manifold) the AI has learned to separate the items

I’m just venturing into ai and have had at most a basic understanding of calculus…

Right now I’m working through in my mind how vectors can create crystal shapes versus golf course putting green shapes, and how they are the same and different. Since i can’t post a direct link to my discussion with gemini, as it includes a Google link shortner, I’d love to dm it you so you can see my nuanced conversation…

Ty so much for the graph
vincenzodelavegas on December 20, 2025 9:03 pm

I actually think this is VERY cool. Imagine looking for a type of shoes online and being offered all those that really look like it, it’s save so much time and effort.
peter303_ on December 20, 2025 9:36 pm

I dont think my closet is that crowded. But getting there.
AppropriateCover7972 on December 20, 2025 10:01 pm

You are a saint for sharing the source. Amazing 😍
FreshPitch6026 on December 20, 2025 10:12 pm

What a surprise. Clothes for a handful body parts can be clustered in a handful clusters. Lol
AUG-mason-UAG on December 20, 2025 10:40 pm

https://preview.redd.it/x5so2ghqsf8g1.jpeg?width=434&format=pjpg&auto=webp&s=5a639e32a2fb688dfdddecb730d2f814e3d48040

Strange bag

Tags

[OC] 70,000 images of clothes sorted by visual similarity into a 3D point cloud

13 Comments