Group 317-5: Stephen Ling, Lewis Clay Ballard, Hongwei Tian, Chester Zhang

Introduction

Hockey is a sport in which two teams play against each other by trying to maneuver a ball or a puck into the opponent’s goal using a hockey stick. We chose to focus on hockey because we were all interested in sports and it is easy to obtain a comprehensive data frame of different sports leagues. In this project, we plan to answer the questions: is there a trend in the birthdays of NHL players? Will taking more shots in a competition increase the goal percentage? What is the relationship between penalty minutes of NHL players and their age? What are the body features (Weight, Height, BMI) of NHL players in different positions? Does the experience in the league relate to the goal percentage of NHL players?

Based on our knowledge of Hockey (before analyzing the data), we expect there is a trend in birthdays of NHL players that the number of players increases when birthdays of players become closer to January in a year; taking more shots in a competition is associated with the goal percentage; the younger players tend to receive more penalty minutes; there are some outstanding body features of NHL players at different positions; finally, experience in the league is associated with the goal percentage of NHL players.

In general, there are significant trends in the features (birthday, BMI, height, weight) of NHL players, and some factors have a strong correlation with NHL players’ performances, but some factors do not.

Background

Data

Source of Data

Background Information

To help better understand our analysis of data, we would like to illustrate some terms in background information part.

Unusual Influencing Factors

Focuses

Analysis

Trend in Birthdays of NHL Players

Body Features of NHL Players

Factors Influencing Goal Percentage

Penalty Minutes

Hypothesis Test

\[ H_0: {p}_{\text{experienced}} = {p}_{\text{not experienced}} \\ H_a: {p}_{\text{experienced}} \neq {p}_{\text{not experienced}} \]

Statistics Values of Hypothesis Test
est goal_sum n_experienced n_not_experienced Attempt p_pool se_pool
-0.0035607 192282 1175564 733724 1909288 0.1007087 0.0004494

Confidence Interval

\[ \text{SE}(\hat{p}_1 - \hat{p}_2) = \sqrt{ \frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2} } \]

Statistics Values for Confidence Interval
group n shots goals
Less Experienced Players 11986 733724 75501
More Experienced Players 12816 1175564 116781

Discussion

Trend in Birthdays

Body Features of NHL Players

Goal Percentage vs. Shooting Rate (Shots Taken per Second)

Penalty Minute vs. Players’ Age

Hypothesis Test

Confidence Interval

New Data

References

  1. The Data Frame We Used.
  2. Some Coding Ideas We Used.
  3. An Introduction and Background Information for Hockey Knowledge.
  4. The History of the NHL.
  5. The Definition and Meaning of BMI.
  6. The Rules of Hockey.
  7. The NHL Selection Rules.
  8. The Effect of Age on Players

  1. https://www.kaggle.com/alexbenzik/nhl-players-statistics↩︎

  2. https://r4ds.had.co.nz/↩︎

  3. https://www.britannica.com/topic/National-Hockey-League↩︎

  4. https://en.wikipedia.org/wiki/Hockey↩︎

  5. https://cdc.gov/healthyweight/assessing/bmi/index.html↩︎

  6. https://cms.nhl.bamgrid.com/images/assets/binary/319997074/binary-file/file.pdf↩︎

  7. https://en.wikipedia.org/wiki/NHL_Entry_Draft↩︎

  8. https://www.degruyter.com/journal/key/JQAS/html↩︎