Explore the fundamental limits of data compression and transmission through interactive simulations of Entropy, Source Coding, and Channel Capacity.
Upon completion of this lab, students will be able to:
Calculate the self-information and entropy of a discrete memoryless source based on symbol probabilities.
Understand the Shannon Source Coding Theorem and design efficient codes using Huffman coding principles.
Analyze the effect of noise on binary transmission and calculate the theoretical channel capacity limit.
Information theory, founded by Claude Shannon, quantifies information. The information content I(x) of an event x with probability P(x) is defined as:
Entropy (H) is the average information content of a source. It represents the uncertainty or randomness.
The theorem states that the average code length L must be greater than or equal to the entropy H(X) for lossless compression.
Huffman coding is a greedy algorithm used to construct an optimal prefix code that approaches this entropy limit.
The Shannon-Hartley theorem defines the maximum rate at which information can be transmitted over a noisy channel without error.
Where C is capacity in bits/sec, B is bandwidth, and SNR is Signal-to-Noise Ratio. For a Binary Symmetric Channel (BSC) with error probability p:
Navigate to Experiment 1.
Observe the generated codes.
Navigate to Experiment 2.
Visualize how probability distribution affects entropy and code efficiency.
Adjust probabilities for symbols A, B, C, D. (Auto-normalized)
Simulate data transmission over a noisy channel and observe the relationship between Error Probability (p) and Mutual Information.
C = 1 - H(p)
A well-structured lab report is essential for documenting your experimental work and demonstrating your understanding of information theory concepts. Follow these guidelines to prepare your report:
Provide a brief overview (150-200 words) summarizing the objectives, key methods used (entropy calculation, Huffman coding, BSC simulation), and main findings regarding source coding efficiency and channel capacity.
Describe the step-by-step methodology:
Present your findings with appropriate tables, graphs, and analysis:
List all textbooks, online resources, and research papers consulted. Use standard citation format (IEEE or APA).
Include screenshots of simulation results, sample calculations, or additional data tables.
Content Accuracy (40%): Correct calculations, proper use of formulas, accurate data recording
Analysis & Discussion (30%): Depth of understanding, proper interpretation of results, connection to theory
Organization & Presentation (20%): Logical structure, clear graphs/tables, proper formatting
Conclusion Quality (10%): Summary of findings, insight into practical applications