2021bootcamp/fitting-basic/paramfit1_soln.py at master · capprogram/2021bootcamp · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
"""
Code for Tutorial on Parameter Estimation by Maximum Likelihood Fitting
Modified by Kathleen Eckert from an activity written by Sheila Kannappan,
then further modified by Sheila Kannappan November 2019.
"""

import numpy as np
import matplotlib.pyplot as plt
import numpy.random as npr

# Generate fake data set to start with
alphatrue=2. # slope
betatrue=5.  # intercept
errs=2.5 # sigma (amplitude of errors)

narr=50 # number of data points
xvals = np.arange(narr) + 1. # xvals range from 1-51
yvals = alphatrue*xvals + betatrue + npr.normal(0,errs,narr) # yvals
# What aspect of a real data set does npr.normal emulate here?
'''
It's emulating random measurement errors with amplitude sigma=errs.
'''
# What assumption is made here in the unweighted least squares approach?
'''
The code is assuming that these errors have the same amplitude for all
data points, which is key to the unweighted least squares approach.
'''

# Plot fake data
plt.figure(1)
plt.clf()
plt.plot(xvals,yvals,'b*',markersize=10)
plt.xlabel("x-values")
plt.ylabel("y-values")

# Determine slope & y-intercept using least squares analytic solution

alphaest=(np.mean(xvals)*np.mean(yvals)-np.mean(xvals*yvals)) / \
   (np.mean(xvals)**2 -np.mean(xvals**2)) #  from derivation
betaest= np.mean(yvals) - alphaest * np.mean(xvals) # calculate estimate of y-intercept from derivation

# Why must we use alphaest rather than alphatrue above?
'''
These formulae represent our best *estimates* of the slope and intercept from
the *data*. We do not a priori know the true slope when working with real
data, so we cannot use our knowledge of it in this analysis.
'''

# The MLE values of the slope and y-intercept are equivalent to the "least
# squares" fit results.
print("analytic MLE slope = %0.7f" %alphaest)
print("analytic MLE y-intercept = %0.7f" %betaest)

# Overplot the MLE ("best fit") solution
yfitvals=xvals*alphaest+betaest
plt.plot(xvals,yfitvals,'r')

# Compute analytic uncertainties on slope and y-intercept

alphaunc = np.sqrt(np.sum((yvals - (alphaest*xvals+betaest))**2) / ((narr-2.)*(np.sum((xvals-np.mean(xvals))**2))))
betaunc = np.sqrt((np.sum((yvals - (alphaest*xvals+betaest))**2) / (narr-2.)) * ((1./narr) + (np.mean(xvals)**2)/np.sum((xvals-np.mean(xvals))**2)) )

print("analytic MLE uncertainty on alpha is %0.7f" % (alphaunc))
print("analytic MLE uncertainty on beta is %0.7f" % (betaunc))

print("fractional uncertainty on alpha is %0.7f" % (alphaunc/alphaest))
print("fractional uncertainty on beta is %0.7f" % (betaunc/betaest))
# Which parameter has larger fractional uncertainty?
'''
The intercept beta has much larger fractional uncertainty.
'''

# Most MLE problems do not have analytic solutions.
# Use `np.polyfit` to compute the slope and y-offset for the same fake
# data using numerical maximum likelihood estimation.

# third parameter is order of fit, 1 for linear
pfit = np.polyfit(xvals, yvals, 1) # returns coeff. of highest order term first

print("               ") # whitespace for readability
print("np.polyfit MLE slope = %0.7f" %pfit[0])
print("np.polyfit MLE y-intercept = %0.7f" %pfit[1])

# Do you get the same result as in analytic case?
'''
Yes, the result is the same.
'''

# calculate and print parameter uncertainties from the diagonal terms
# of the covariance matrix, which is the inverse of the Hessian matrix
# and can be computed in np.polyfit by setting cov='True'

pfit,covp = np.polyfit(xvals, yvals, 1, cov='True') # returns coeff. of highest power first

print("slope is %0.7f +- %0.7f" % (pfit[0], np.sqrt(covp[0,0])))
print("intercept is %0.7f +- %0.7f" % (pfit[1], np.sqrt(covp[1,1])))

# How are the errors related to the terms of the covariance matrix?
'''
The error estimates come from the square root of the corresponding diagonal terms.
'''
# Are the uncertainties the same as in the analytic solution?
'''
Depending on your version of numpy, they may be identical or may agree closely with the numerical uncertainties being slightly larger..
'''

# Try changing N to 10 or 100 in the code above. Print out the fractional
# difference in the analytically and numerically derived uncertainties.

fracdiffalpha = (np.sqrt(covp[0,0]) - alphaunc)/alphaunc
fracdiffbeta = (np.sqrt(covp[1,1]) - betaunc)/betaunc

print("fractional difference in uncertainty for slope %0.7f and intercept %0.7f" % (fracdiffalpha,fracdiffbeta))

# What happens to the uncertainties if you increase/decrease the number of points used in the fit (try N=100, N=10) ?
'''
The uncertainties get larger for N=10, smaller for N=100.
'''
# What happens to the percentage difference between the analytical and numerical methods for computing the uncertanties if you increase/decrease the number of points (try N=100, N=10)?
'''
The answer depends on your version of numpy. Some versions give the exact same answer as the analytic calculation, while others show a small discrepancy. If there is a discrepancy, the percentage difference between methods gets larger for N=10, smaller for N=100.
'''