By Ben Pearson/Nov. 2, 2018 3:30 pm EST

The studio is looking for data-driven connections here: if audiences have previously seen action films with male leads, are they more likely to see another action film with a male lead? Using the trailer for Logan, Merlin discovered that these were the most common labels:

But they’re not just concerned with what appears in the trailers – they’re also looking at when it appears, and for how long:

After analyzing the data from the Logan trailer and after Logan was released in theaters, the study produced a table that showed the actual top 20 moviegoer audiences (on the left) next to Merlin’s predictions (on the right):

Temporal sequencing (for example, a long shot of an object versus intermittent short shots) can convey information about the movie type, the movie plot, the roles of the main characters, and the filmmakers’ cinematographic choices. When combined with historical customer data, sequencing analysis can be used to create predictions of customer behavior.

Translation: Merlin wasn’t exactly spot-on, but it was pretty close. It knew that Logan audiences would also be interested in those other films, but it just couldn’t correctly predict how interested they were in seeing them. Sounds like there’s still some more work to be done here, but having access to these very specific data points can point the studio in different directions when it comes to marketing and even green lighting certain projects:

Will studios be able to rely on machine learning to predict audience behavior? Maybe one day, but it looks like we’re not there yet.

The impact of these tools on 20th Century Fox’s marketing and data teams is significant. Instead of depending solely on high level audience survey results, the team can now deploy more precise instruments to determine customer intent. The insights are at least two orders of magnitude more detailed than the previous sets of analytics the studio had been relying on.