(You should start with the “.Rmd” file that produced what you’re reading: hw4.Rmd. But this HTML version is easier to read.)

STAT 303 Homework 2

Name: Kayley Seow Email: kseow@wisc.edu

You should replace all occurrences of “…” with your

code. If you’d like to replace a single “…” with more

than one line of code (using your own variables), you may

certainly do that. Please don’t remove all my comments, as they

help orient us when we’re reading your code.

My solution to this homework uses information from the first two

handouts and the Vector portion of the third handout.

We’ll grade your homework by running this “.R” file via

source(“hw2.R”)

(in a directory containing the data files) and reading your code.

Name: Kayley Seow

Email: kseow@wisc.edu

rm(list = ls()) # Remove all objects defined in workspace.

Download WI_donations and NY_donations from the class website. (Do

this outside of R. They are donations from Wisconsin and New York to

the 2012 presidential candidates. I got them from files at

www.fec.gov/disclosurep/PDownload.do, a Federal Election Commission

website.)

Read these two files into two vectors, one for each state.

WID<-scan(“WI_tiny.txt”) NYD<-scan(“NY_tiny.txt”)

WID<-scan(“WI_donations.txt”) NYD<-scan(“NY_donations.txt”)

Find the sum of donations from WI and the sum of donations from NY.

WI.sum = sum(WID) NY.sum = sum(NYD) cat(sep=““,”WI sum=“, WI.sum,”“) cat(sep=”“,”NY sum=“, NY.sum,”“)

Find the sum of small donations from each state. I’ve set small=100

(below). Let’s say “small donation” are those such that

-100 <= donation <= 100.

(Note that there are negative donations, which are refunds.)

small = 100 WI.sum.small = sum(WID[(WID >= -100) & (WID <= 100)]) NY.sum.small = sum(NYD[(NYD >= -100) & (NYD <= 100)]) cat(sep=““,”WI.sum.small=“, WI.sum.small,”“) cat(sep=”“,”NY.sum.small=“, NY.sum.small,”“)

Find the sum of big donations from each state. Let’s say “big

donations” are those such that

donation < -100 or donation > 100.

WI.sum.big = sum(WID[(WID > 100) | (WID < -100)]) NY.sum.big = sum(NYD[(NYD > 100) | (NYD < -100)]) cat(sep=““,”WI.sum.big=“, WI.sum.big,”“) cat(sep=”“,”NY.sum.big=“, NY.sum.big,”“)

Find the ratio of the sum of small donations to the sum of all

donations from each state.

WI.small.ratio = WI.sum.small/WI.sum NY.small.ratio = NY.sum.small/NY.sum cat(sep=““,”WI.small.ratio=“, WI.small.ratio,”“) cat(sep=”“,”NY.small.ratio=“, NY.small.ratio,”“)

Find the mean positive donation from each state. (Do not include

zero or negative donations.)

WI.mean.pos = mean(WI.positive) NY.mean.pos = mean(NY.positive) cat(sep=““,”WI.mean.pos=“, WI.mean.pos,”“) cat(sep=”“,”NY.mean.pos=“, NY.mean.pos,”“)

Find the median positive donation from each state. (Do not include

zero or negative donations.)

WI.median.pos = median(WI.positive) NY.median.pos = median(NY.positive) cat(sep=““,”WI.median.pos=“, WI.median.pos,”“) cat(sep=”“,”NY.median.pos=“, NY.median.pos,”“)

Find the largest and second-largest positive donation from each

state. Write code, including a cat() statement or two, to produce

formatted output exactly like this (except that the numbers should

be correct, not 0):

WI.largest=0, WI.second.largest=0

NY.largest=0, NY.second.largest=0

…

Note that your code should work on the current data set, and also on

a new data set. For example, if I ask for a sum, don’t use “17”,

even if the sum is 17, because 17 will probably be wrong for a new

data set. Instead, use sum(…), since this second solution will be

correct even for a new data set.

NY_tiny, which are fake data files for which you can check your

computations by hand. Get your code working on them, and then switch

to the real data files specified above. Your submitted code should

use the real files, “WI_donations” and “NY_donations”.

STAT 303 Homework 2

You should replace all occurrences of “…” with your

code. If you’d like to replace a single “…” with more

than one line of code (using your own variables), you may

certainly do that. Please don’t remove all my comments, as they

help orient us when we’re reading your code.

My solution to this homework uses information from the first two

handouts and the Vector portion of the third handout.

We’ll grade your homework by running this “.R” file via

source(“hw2.R”)

(in a directory containing the data files) and reading your code.

Name: Kayley Seow

Email: kseow@wisc.edu

Download WI_donations and NY_donations from the class website. (Do

this outside of R. They are donations from Wisconsin and New York to

the 2012 presidential candidates. I got them from files at

www.fec.gov/disclosurep/PDownload.do, a Federal Election Commission

website.)

Read these two files into two vectors, one for each state.

Find the sum of donations from WI and the sum of donations from NY.

Find the sum of small donations from each state. I’ve set small=100

(below). Let’s say “small donation” are those such that

-100 <= donation <= 100.

(Note that there are negative donations, which are refunds.)

Find the sum of big donations from each state. Let’s say “big

donations” are those such that

donation < -100 or donation > 100.

Find the ratio of the sum of small donations to the sum of all

donations from each state.

Find the mean positive donation from each state. (Do not include

zero or negative donations.)

Find the median positive donation from each state. (Do not include

zero or negative donations.)

Find the largest and second-largest positive donation from each

state. Write code, including a cat() statement or two, to produce

formatted output exactly like this (except that the numbers should

be correct, not 0):

WI.largest=0, WI.second.largest=0

NY.largest=0, NY.second.largest=0

…

Note that your code should work on the current data set, and also on

a new data set. For example, if I ask for a sum, don’t use “17”,

even if the sum is 17, because 17 will probably be wrong for a new

data set. Instead, use sum(…), since this second solution will be

correct even for a new data set.

I recommend checking that your code works on the files WI_tiny and

NY_tiny, which are fake data files for which you can check your

computations by hand. Get your code working on them, and then switch

to the real data files specified above. Your submitted code should

use the real files, “WI_donations” and “NY_donations”.