Stochastic Query Optimization and Bias Characterization for Large Scale Text Search

Legendary is a leading film production company, with 43 feature films released, 6 films currently in production and $13 billion in box office revenues in 2015. Identifying the correct search terms to find social media posts about an entity or concept is a highly challenging task. For instance, the word Fargo may refer to a place (in North Dakota), a TV show, a movie, or a bank (Wells Fargo). The student team analyzed 4 million tweets to produce a text-query generation & optimization system. The search index query, constructed from combinations of text tokens constrained to simple logical operators, returns a highly pure set of text documents relevant to a property, such as a film, and also provide a characterization of the query bias.