BL40A2011 Introduction to Cyber-Physical Systems

Answer:

1. Imports & setup

import numpy as np import matplotlib.pyplot as plt from scipy.stats import binom import warnings warnings.filterwarnings('ignore') np.random.seed(42) plt.rcParams["figure.figsize"] = (16, 7)

Run to view results

2. Generate two Binomial arrays (10000 samples each)

snapshots = 10000 n1, p1 = 100, 0.3 X1 = np.random.binomial(n=n1, p=p1, size=snapshots) n2, p2 = 10000, 0.6 X2 = np.random.binomial(n=n2, p=p2, size=snapshots) X1.shape, X2.shape

Run to view results

3. Case (a): Empirical histogram vs analytical PMF(n=100, p=0.3)

k1 = np.arange(0, n1 + 1) pmf1 = binom.pmf(k1, n1, p1) plt.figure(figsize=(16, 7)) bins1 = np.arange(-0.5, n1 + 1.5, 1) plt.hist(X1, bins=bins1, rwidth=0.7, density=True, label='Empirical histogram (NumPy samples)') plt.step(k1, pmf1, where='mid', label='Analytical PMF (scipy.stats.binom)') plt.xlim([0, n1]) plt.title('Binomial distribution: Empirical histogram and analytical PMF (n=100, p=0.3)', size=18) plt.xlabel('Number of successes $k$', size=18) plt.ylabel('$P(X=k)$', size=18) plt.grid(True) plt.legend(fontsize=14) plt.show()

Run to view results

Note for Case (b)

For n=10000, plotting the full support k=0,…,10000 is visually uninformative because almost all probability mass is concentrated near the mean np=6000. Therefore, the plot for Case (b) displays only the range of k values observed in the 10000 samples, which provides a clearer comparison between the empirical histogram and the analytical PMF.

4. Case (b): Empirical histogram vs analytical PMF (n=10000, p=0.6)

k2_min, k2_max = X2.min(), X2.max() k2 = np.arange(k2_min, k2_max + 1) pmf2 = binom.pmf(k2, n2, p2) plt.figure(figsize=(16, 7)) bins2 = np.arange(k2_min - 0.5, k2_max + 1.5, 1) plt.hist(X2, bins=bins2, rwidth=0.7, density=True, label='Empirical histogram (NumPy samples)') plt.step(k2, pmf2, where='mid', label='Analytical PMF (scipy.stats.binom)') plt.xlim([k2_min, k2_max]) plt.title('Binomial distribution: Empirical histogram and analytical PMF (n=10000, p=0.6)', size=18) plt.xlabel('Number of successes $k$', size=18) plt.ylabel('$P(X=k)$', size=18) plt.grid(True) plt.legend(fontsize=14) plt.show() (k2_min, k2_max)

Run to view results

5. Brief numerical check (empirical mean/variance vs theoretical)

emp_mean_1, emp_var_1 = X1.mean(), X1.var() the_mean_1, the_var_1 = n1 * p1, n1 * p1 * (1 - p1) emp_mean_2, emp_var_2 = X2.mean(), X2.var() the_mean_2, the_var_2 = n2 * p2, n2 * p2 * (1 - p2) { "case_a": {"emp_mean": emp_mean_1, "the_mean": the_mean_1, "emp_var": emp_var_1, "the_var": the_var_1}, "case_b": {"emp_mean": emp_mean_2, "the_mean": the_mean_2, "emp_var": emp_var_2, "the_var": the_var_2}, }

Run to view results

Conclusion

In both cases, the empirical histograms from 10000 simulated samples closely match the analytical PMFs computed with scipy.stats.binom. This confirms that the NumPy-generated samples follow the Binomial distribution. Case (b) is much more concentrated around np due to the large n, so the observed spread of k values is relatively narrow compared with the full support.

a) Which data process, (P_X) or (P_Y), has a greater level (Definition 4.4)? Why?

Answer: They have the same level, typically Level 1.

Reason:By Definition 4.4, Level 1 processes are symbolic processes involving data directly obtained from physical reality.Both (X) (global average temperature increase) and (Y) (temperature observed in city A) originate from physical temperature measurements. Even though (X) is an average and (Y) is local, both are still derived from direct physical measurements, hence both correspond to Level 1 data processes.

b) Let (Z) be the random variable answering: “Is global warming true?” What is the relation between (H(Z)), H(Z|X), and H(Z|Y)?

Answer (guaranteed):

H(Z∣X)≤H(Z),H(Z∣Y)≤H(Z).

Reason: From Proposition 4.2, conditioning on additional knowledge cannot increase uncertainty. Knowing (X) (or (Y)) provides information about (Z), so the conditional entropy is not larger than the unconditional entropy.

Typical (context-based) strengthening:** Since (X) is a global-average indicator more directly linked to the global-warming claim than a single-city record (Y), one would typically expect:

H(Z∣X)≤H(Z∣Y),

equivalently I (Z;X)≥I(Z;Y). (This comparison relies on the assumption that 𝑋 X is more informative about 𝑍 than 𝑌.)

c) Let (W) be the random variable answering: “How was the weather of city A during last winter?” What is the relation between H(W), H(W|X), and H(W|Y)?

Answer (guaranteed):

H(W∣X)≤H(W),H(W∣Y)≤H(W).

Reason: Again by Proposition 4.2, conditioning on knowledge cannot increase uncertainty.

Typical (context-based) strengthening: Because (W) is specifically about city A’s winter weather, it is usually more directly related to local temperature information (Y) than to the global average (X). Thus one typically expects:

H(W∣Y)≤H(W∣X), equivalently I(W;Y)≥I(W;X). (This comparison also depends on the assumed relevance/association.)

d) What can be said about the mutual information I(Z;X), I(Z;Y), I(W;X), I(W;Y), and I(X;Y)?

Using Eq. (4.3), for any variables (A,B),

I(A;B)=H(A)−H(A∣B)=H(B)−H(B∣A)≥0.

Therefore:

I(Z;X)=H(Z)−H(Z∣X)≥0,I(Z;Y)=H(Z)−H(Z∣Y)≥0,

I(W;X)=H(W)−H(W∣X)≥0,I(W;Y)=H(W)−H(W∣Y)≥0.

Typical (context-based) ordering:

(X) is a global indicator and is usually more informative for (Z) than a single-city event (Y):

I(Z;X)≥I(Z;Y).

(Y) is local and is usually more informative for (W) than the global average (X):

I(W;Y)≥I(W;X).

For (I(X;Y)):

If (X) and (Y) were independent, then (I(X;Y)=0). In realistic climate contexts, (X) (global trend) and (Y) (local temperatures) are generally not fully independent, but local variability can be large, so (I(X;Y)) may be positive yet not necessarily large.

---

e) What is the problem with the argument used by negationists?

Answer:The argument uses a local, short-term observation(Y: “city A was extremely cold”) to deny a global, long-term claim(Z: “global warming is true”), which is a mismatch of scope and evidence.

Information-theoretic interpretation:A single-city cold event (Y) typically provides limited information about the global claim (Z) (i.e., (I(Z;Y)) is small compared with the information carried by global aggregated indicators such as (X)). Hence, observing (Y) does not substantially reduce the uncertainty about (Z), and it is not a sound basis to reject the global-warming hypothesis. The argument also often reflects selective reasoning (focusing on one extreme local datapoint rather than the overall process/trend).

---

Run to view results

.css-15w88e5{color:var(--chakra-colors-fg-neutral-primary);font-weight:inherit;letter-spacing:-0.09px;}Answer:

1. Imports & setup

2. Generate two Binomial arrays (10000 samples each)

3. Case (a): Empirical histogram vs analytical PMF(n=100, p=0.3)

Note for Case (b)

4. Case (b): Empirical histogram vs analytical PMF (n=10000, p=0.6)

5. Brief numerical check (empirical mean/variance vs theoretical)

Conclusion

a) Which data process, (P_X) or (P_Y), has a greater level (Definition 4.4)? Why?

b) Let (Z) be the random variable answering: “Is global warming true?” What is the relation between (H(Z)), H(Z|X), and H(Z|Y)?

c) Let (W) be the random variable answering: “How was the weather of city A during last winter?” What is the relation between H(W), H(W|X), and H(W|Y)?

d) What can be said about the mutual information I(Z;X), I(Z;Y), I(W;X), I(W;Y), and I(X;Y)?

e) What is the problem with the argument used by negationists?

Answer: