R

Blog posts for R programming.

11 Comments Posted on July 30, 2019July 29, 2019 R, Statistics and Data Science

Grades Aren’t Normal

Introduction

This article is also available in PDF form.

A while back someone posted on Reddit about the grading policies of their academic department. Specifically, the department chair made a statement claiming that grades should be Normally distributed with a C average. I responded, claiming that no statistician would ever take the idea that grades follow a Normal distribution seriously. Some asked for context, and I wrote a long response explaining my position. I repeat that argument here, and also give some R code demonstrations showing what curving grades does. Continue reading →

4 Comments Posted on July 24, 2019July 23, 2019 Economics and Finance, R, Research, Statistics and Data Science

CPAT and the Rényi-Type Statistic; End-of-Sample Change Point Detection in R

This article is also available in PDF form.

Introduction

I started my first research project as a graduate student when I was only in the MSTAT program at the University of Utah, at the very end of 2015 (or very beginning of 2016; not sure exactly when) with my current advisor, Lajos Horváth. While I am disappointed it took this long, I am glad to say that the project is finished and I am finally published.

Continue reading →

Leave a comment Posted on May 20, 2019June 8, 2019 Python, R, Statistics and Data Science

What is the probability that in a box of a dozen donuts picked from 14 flavors there’s no more than 3 flavors in the box?

Problem

Dave’s Donuts offers 14 flavors of donuts (consider the supply of each flavor as being unlimited). The “grab bag” box consists of flavors randomly selected to be in the box, each flavor equally likely for each one of the dozen donuts. What is the probability that at most three flavors are in the grab bag box of a dozen?

Continue reading →

Leave a comment Posted on February 25, 2019February 25, 2019 Arkham Horror LCG, R, Statistics and Data Science

Introducing Rank Data Analysis with Arkham Horror Data

Introduction

Last week I analyzed player rankings of the Arkham Horror LCG classes. This week I explain what I did in the data analysis. As I mentioned, this is the first time that I attempted inference with rank data, and I discovered how rich the subject is. A lot of the tools for the analysis I had to write myself, so you now have the code I didn’t have access to when I started.

Continue reading →

2 Comments Posted on February 4, 2019February 4, 2019 R, Research

Organizing R Research Projects: CPAT, A Case Study

Introduction

Months ago, I asked a question to the community: how should I organize my R research projects? After writing that post, doing some reading, then putting a plan in practice, I now have my own answer.

Continue reading →

4 Comments Posted on January 28, 2019January 27, 2019 Economics and Finance, R, Research, Statistics and Data Science

Problems in Estimating GARCH Parameters in R (Part 2; rugarch)

Introduction

Now here is a blog post that has been sitting on the shelf far longer than it should have. Over a year ago I wrote an article about problems I was having when estimating the parameters of a GARCH(1,1) model in R. I documented the behavior of parameter estimates (with a focus on $\beta$ ) and perceived pathological behavior when those estimates are computed using fGarch. I called for help from the R community, including sending out the blog post over the R Finance mailing list.

Continue reading →

2 Comments Posted on December 3, 2018February 11, 2019 Arkham Horror LCG, R, Statistics and Data Science

Making a Profit with Henry Wan in Arkham Horror: The Card Game

Introduction

The Forgotten Age cycle of Arkham Horror is at a close and Fantasy Flight Games already announced the next cycle, The Circle Undone. Not only that, they’ve announced two mythos packs at a rate that… surprised me. A new cycle announcement and two mythos pack announcements in less than two months? Am I the only one who finds the new pace of announcements surprising? Perhaps that means they want to get product out at a faster pace?

Continue reading →

3 Comments Posted on November 19, 2018November 19, 2018 Economics and Finance, R, Statistics and Data Science

The Distribution of Time Between Recessions: Revisited (with MCHT)

Introduction

These past few weeks I’ve been writing about a new package I created, MCHT. Those blog posts were basically tutorials demonstrating how to use the package. (Read the first in the series here.) I’m done for now explaining the technical details of the package. Now I’m going to use the package for purpose I initially had: exploring the distribution of time separating U.S. economic recessions.

Continue reading →

Leave a comment Posted on November 12, 2018October 8, 2018 R, Statistics and Data Science

Time Series and MCHT

Introduction

Over the past few weeks I’ve published articles about my new package, MCHT, starting with an introduction, a further technical discussion, demonstrating maximized Monte Carlo (MMC) hypothesis testing, bootstrap hypothesis testing, and last week I showed how to handle multi-sample and multivariate data. This is the final article where I explain the capabilities of the package. I show how MCHT can handle time series data.

Continue reading →

Leave a comment Posted on November 5, 2018October 8, 2018 R, Statistics and Data Science

Beyond Univariate, Single-Sample Data with MCHT

Introduction

I’ve spent the past few weeks writing about MCHT, my new package for Monte Carlo and bootstrap hypothesis testing. After discussing how to use MCHT safely, I discussed how to use it for maximized Monte Carlo (MMC) testing, then bootstrap testing. One may think I’ve said all I want to say about the package, but in truth, I’ve only barely passed the halfway point!

Continue reading →

Curtis Miller's Personal Website

Curtis Miller's Personal Website

Curtis Miller's personal website, with resume, portfolio, blog, etc.

R

Grades Aren’t Normal

Introduction

CPAT and the Rényi-Type Statistic; End-of-Sample Change Point Detection in R

Introduction

What is the probability that in a box of a dozen donuts picked from 14 flavors there’s no more than 3 flavors in the box?

Problem

Introducing Rank Data Analysis with Arkham Horror Data

Introduction

Organizing R Research Projects: CPAT, A Case Study

Introduction

Problems in Estimating GARCH Parameters in R (Part 2; rugarch)

Introduction

Making a Profit with Henry Wan in Arkham Horror: The Card Game

Introduction

The Distribution of Time Between Recessions: Revisited (with MCHT)

Introduction

Time Series and MCHT

Introduction

Beyond Univariate, Single-Sample Data with MCHT

Introduction