Welcome to the
BIG
DATA
CUP
2022

Our goal for the 2022 Big Data Cup is to both identify and provide opportunity for burgeoning analysts in the hockey analytics field while also pushing forward women’s hockey and research in the broader sphere.

Good luck to all the Big Data Cup-ers!

Big Data Cup Puck

2022 Big Data Cup

Data Sets

The data sets available for use have been crafted specifically for the Big Data Cup. They are not in original format but are intended to be a small portion of Stathletes’ data that is translated for public consumption. They focus on two key areas of the game that have grown in prominence over the past year:

  1. Women’s hockey play by play: (2018 and 2022 Women’s Hockey Tournament sample and NCAA games)

    Interest in women’s sports has increased during the pandemic. By providing access to data about the game played at its highest level, we hope participants work with this robust data set from international tournaments.

  2. Tracking Data: (2022 Women’s Hockey Tournament sample)

    Player tracking is being introduced across many professional sports leagues. Tracking data generated from the broadcast camera is an easy way to develop advanced statistics for leagues that may have fewer resources.

Access the data here.

NOTE: As noted on the legal agreement, this data cannot be re-sold and is intended to be used for research purposes only.

2022 Timeline and Key Dates

Mentoring Opportunities

We want to foster each participant’s ability to both evaluate data from a process and a technical perspective. To assist with this, we will be scheduling Office Hours for the Big Data Cup (specific times to be announced in April). Subject matter experts in both sport and analysis will be available to answer your questions and provide feedback. Check back for details on ways to sign up.

Interested in participating?

Anyone interested is encouraged to apply and data will be provided publicly to advance hockey research.

There will be 2 categories for participants:

  1. Highschool & undergraduate - all participants must provide proof of enrollment at high school or undergraduate level
  2. Open - This category includes graduate students and anyone interested in hockey research.

Teams can be 1-4 participants.

Finalists will be selected* on the week of May 30th, will have the opportunity to present their findings to our panel of NHL executives.

Prizes will be awarded to top qualifiers.

Participation in Big Data Cup competition is open any individual regardless of background, experience, previous analysis, or public work.

*Evaluation criteria includes, but is not limited to: a demonstrated ability to creating actionable insights for a general manager or head coach working in hockey and not just research); generating creative ideas, which may mean borrowing and applying ideas from other sports, leveraging domain knowledge, and/or filling gaps created by limitations of public data; a performative understanding of how to work with large data sets.

Submissions

Final submissions will be due May 15, 2022 and should:

Maximum 6 pages, including figures (size limit 10GB on submission).

Submissions can be emailed to: bigdatacup@stathletes.com with subject line: Big Data Cup 2022.

Please note that email size is limited to 25MB, to send larger submissions (up to 10GB), use Dropbox, Google Drive or other file-sharing services and include the link in your submission email.