2016

Pipeline for Identifying Potentially Hazardous Asteroids

Potentially hazardous objects (PHOs) are currently defined based on parameters that measure the object's potential to make threatening and close approaches to the Earth. To be considered a PHO, objects generally have an Earth minimum orbit intersection distance (MOID) of 0.05 AU or less and an absolute magnitude (H) of 22.0 or brighter (a rough indicator of large size). In collaboration with the Harvard-Smithsonian Center for Astrophysics, the goal of this project is to develop the full pipeline which includes data management, algorithmic development and probabilistic predictions...

Read more about Pipeline for Identifying Potentially Hazardous Asteroids

Dynamic Factor Selection for Determining Market Exposure

Market exposure is a key concept in quantitative finance. This is classically measured by estimating a beta coefficient in a linear equation where beta (exposure) expresses the returns of the market. Returns with low exposure to the market are desired, as they are not affected by downturns. This exposure modeling can be generalized to multiple factors and the exposures to factors are used to determine if a strategy or asset is protected enough from changes in certain risk factors, and to purchase hedges that cancel out this risk exposure.

Partner: Delaney Granizo-...

Read more about Dynamic Factor Selection for Determining Market Exposure

Stochastic Query Optimization and Bias Characterization for Large Scale Text Search

Legendary is a leading film production company, with 43 feature films released, 6 films currently in production and $13 billion in box office revenues in 2015. Identifying the correct search terms to find social media posts about an entity or concept is a highly challenging task. For instance, the word Fargo may refer to a place (in North Dakota), a TV show, a movie, or a bank (Wells Fargo). The student team analyzed 4 million tweets to produce a text-query generation & optimization system. The search index query, constructed from combinations of text tokens constrained to simple...

Read more about Stochastic Query Optimization and Bias Characterization for Large Scale Text Search

Restaurant Photo Classification Algorithm and Business Viability Tool

TripAdvisor users write reviews and upload photos from their various restaurant visits. These photos can be categorized/analyzed to reveal information about the restaurant's menu, dishes, pricing, etc. The first step in this analysis is the classification of photos into simple, broad groups: food, drinks, menus, inside and outside photos of the establishment. Students' goal for this project was to build an image classifier using Convolutional Neural Networks and images aquired by the students themselves.

Nester for Design

Nester is a platform where companies can find the best designs for a project, using Kaggle. Kaggle is a platform that hosts machine learning competitions where companies and researchers post data and pose challenges. Data scientists from all over the world compete to answer the questions and to produce the best results, in effect, crowdsourcing the most efficient technique or solution to the questions.

Through Nester, companies post brief design challenges. Designers then propose solutions and vote for other people's projects. Experts refine projects. Companies give feedback...

Read more about Nester for Design

Negotiation Tool for Airbnb

Airbnb is a global marketplace of rentals of apartments that reach 190 countries and 34,000 cities. In Airbnb, citizens insert their rental offers and rent their own apartments to other citizens, thereby defining a parallel market to traditional offers based upon hotels. We propose to integrate data from Airbnb with data from other sources, including open data, census information, real estate, information about the district, about the house interiors, social sources such as Instagram and Twitter, etc., so as to develop a new scoring system for Airbnb offers, similar to the hotel star...

Read more about Negotiation Tool for Airbnb