The German Tank Problem (MCMC Edition)

1. Optional. Implement your own Metropolis algorithm

See the bottom of this page for some scaffolding code to get you started. Fill in the appropriate places with the code for making the algorithm work. Alternatively, there is an implementation provided on the github repository.

2. German Tank Problem

Recall the German tank problem presented in lecture. Use the following captured serial numbers:

s = c(147, 126, 183, 88, 9, 203, 16, 10, 112, 205)

Your goal is to estimate a single parameter, \(N\), the highest possible serial number (indicating the number of tanks actually produced).

What likelihood function is appropriate? Can you write this as an equation? The likelihood function should be \(pr(s|N)\).
Translate this likelihood function into R code, and plot the function for varying values of \(N\).
Translate a and b above into a Stan statement for the model block. It will look something like s ~ ...
Add a prior for \(N\) to your Stan program. What prior is reasonable? Bonus: Write a prior and posterior function in R, and plot them as in part a.
Finish the Stan program, then use it to get the MAP estimate for N using the optimizing function. What’s the MAP estimate?
- Hint: You will need to use the vector datatype, which we haven’t seen yet. Look it up in the Stan manual to see if you can understand how and where to use it.

3. Sampling the posterior

Use sampling() to get 5000 samples from the posterior distribution. Alternatively, if you finished part 1 above, try this with your own sampler.

Evaluate your samples using mcmc_trace() and mcmc_hist() from the bayesplot package (or implement your own versions). You might need to use as.array or as.matrix to convert the samples from stan to something that bayesplot can use. Compare the histogram of samples to the posterior density plot you made in 2d.
What summary statistics can we get from the samples? How do your estimates of central tendency (mean, median, etc) compare with the MAP? What metrics of dispersion might be useful? Can you imagine how you might calculate a credible interval (== Bayesian confidence interval)?

Bonus: discrete uniform parameters

You probably have produced a model in 2 that treats N as a continuous varible, resulting of course in estimates that say something like “1457.3 tanks were produced.” This is of course impossible, \(N\) and \(s\) are both discrete parameters. Can you design a model that respects this constraint? How do the results differ?

Scaffolding for implementing the Metropolis algorithm