Sadly your browser is not compatible with this site

You can use Google Chrome, Mozilla Firefox or Microsoft Internet Explorer (v9 or above) to access the PopCha Movie Network

Movie Network
This is a subset of the movie network produced from data in the mobile PopCha! app. Movies relate to each other as they get picked by PopCha! users (note that movie details are not used to build the graph). For additional information, check the FAQ.
Where does this come from?
The PopCha! mobile app lets users search for movies, view their details, rate them, add them to bags to build personalized movie lists and receive notifications when they are available in theaters, TV or Video on Demand platforms. Its recommendation engine allows proposing users movies they might be interested in. PopCha! users have so far currently rated, added to bags or otherwise showed interest for more than 11,000 movies
And how was this concrete "movie network" made?
This is how we did it:
  1. We selected the movies with the top ratings (the most highly valued movies for PopCha! users). Those were our initial seed, and are the nodes with a red border in the graph. As you can see, they are mostly the "usual suspects", but with also a few non-conventional movies added in.
  2. Then we used our recommendation engine to find the movies most similar to them. Not by looking at their genre, director or cast, but by checking how users interact with them. That is, two movies are considered similar just by checking if users rate them equally, or usually appear together in their bags. Those movies make up the second level in our network, marked with a grey border.
  3. Finally, we rounded up the graph by adding a third level, computed as the movies most similar to the ones in the second level
Ok, but how was the graph actually computed?
For those of you technically oriented, the links between movies were extracted as item-item similarity scores computed in our Collaborative Filtering engine, using all events produced by users of the PopCha! Android/iOS apps. Those links were then fed to a force-directed graph layout in D3.js.
No movie metadata (title, director, cast, whatever) was used or harmed in any way during the production of this graph.
What do the graph elements mean?
Each circle is a movie; it is linked to the set of movies most similar to it according to our engine (and therefore to PopCha! users). The size of the circle represents how it is valued by PopCha! users; the set of 25 most valued movies is highlighted with a red border.
When a movie is selected (hovering or clicking) it is shown as purple in the network. The set of movies linked to it (the ones more similar to it) are also highlighted, this time in blue.
The width of the link between two movies reflect how similar they are (in terms of user response to them); this similarity translates to the force attracting the nodes (so that two very similar movies will tend to stay close, within the constraints imposed by the rest of the forces from the other nodes).
The force-directed layout is an iterative algorithm; each time the page is loaded the rendering starts anew, and converges towards a final stable state. There is some randomness in it, so that no two final states are equal (but relative positioning of movies, and the formed clusters, should be similar).
What is the resulting structure?
The graph is heavily connected: there are connections between many of the movies in the network (note that this is only a partial view of the overall network: in the full graph there are many more connections, and also to the movies that here appear to be at the edges).
There is a strong central cluster containing some of the most important movies (it is so dense that it cannot be clearly seen unless you zoom in). Some of them are deeply connected, check for instance Seven or Forrest Gump. Within that central cluster we can also identify thematic subclusters, such as the Tolkien saga.
Outside the graph core we can also find other areas with semantic meaning (e.g. the set of Star Wars movies, or a group of classic Hitchcock thrillers). There is also a small Western cluster (mostly Clint Eastwood movies) and a Tarantino group.
As a curiosity, there seems to be an exaggerated importance given to movies that are seemingly minor, but that share title with more significant ones (i.e. there is a second Godfather or a second Gladiator in the graph, try to spot them!). These might stem from user confusion when selecting the movie they actually wanted.
Credits
Movie statistics by the PopCha! team
Graph computation & visualization by Paulo Villegas
Movie details from The Movie Database