“She’s learning where she fits on the food chain, and I’m not sure you want her to figure that out” [Owen Grady, Jurassic World]
Three databases were used to answer the research questions :
For this question, we need to retrieve gender and age of speakers. This is done with the wikidata parquets. Then, an analysis of these variable can be realized using aggregation, and then statistic and/or distribution.
For this question we hypothesize that for a given film, the more people who talk about the film the more memorable it will be.
Then when the film is made by many men, it will be more memorable.
We wonder if it’s possible to predict movie’s ratings regarding several features.