The Transistor Revolution
On December 23, 1947, at Bell Labs, William Shockley, John Bardeen, and Walter Brattain demonstrated the first working transistor. This tiny device—a few centimeters of germanium, gold contacts, and a paper clip—would become the foundation of the digital age.
Today, your smartphone contains approximately 15 billion transistors. A modern data center GPU like the NVIDIA H100 packs 80 billion transistors onto a chip smaller than your thumbnail. Each transistor switches on and off billions of times per second, performing the computations that power our digital world.
But what exactly is a transistor? How does a piece of silicon "think"? Let's go from atoms to architecture.
A transistor is a semiconductor device that acts as an electrically-controlled switch or amplifier. By applying a small voltage to one terminal, you control a larger current flow between two other terminals. This simple principle enables all digital computation.
Semiconductor Physics
To understand transistors, we must first understand semiconductors—materials that conduct electricity better than insulators but worse than metals.
Band Theory: Why Materials Conduct
In solid materials, electrons occupy energy bands. Two bands matter most:
- Valence Band: The highest energy band that's fully occupied at absolute zero
- Conduction Band: The next higher band where electrons can move freely
The band gap is the energy difference between these bands:
Material Band Gap (eV) Classification
─────────────────────────────────────────────────
Copper (Cu) 0.0 Conductor
Silicon (Si) 1.12 Semiconductor
Germanium (Ge) 0.67 Semiconductor
Gallium Arsenide 1.43 Semiconductor
Diamond (C) 5.5 Insulator
Silicon Dioxide 9.0 Insulator
// At room temperature (kT ≈ 0.026 eV), thermal energy can
// excite electrons across small band gaps but not large onesSilicon: The Goldilocks Element
Silicon (atomic number 14) has four valence electrons. In a crystal, each silicon atom shares electrons with four neighbors, forming covalent bonds in a diamond cubic structure.
At absolute zero, all electrons are locked in bonds—silicon is an insulator. But at room temperature, thermal energy (~0.026 eV) occasionally kicks an electron free, leaving behind a "hole"—a positive charge carrier.
For silicon at 300K: n_i ≈ 1.5 × 10¹⁰ carriers/cm³. This sounds like a lot, but compare it to copper's 8.5 × 10²² free electrons/cm³. Pure silicon is a poor conductor.
Doping: Engineering Conductivity
The magic happens when we intentionally add impurities—a process called doping.
N-TYPE DOPING (Donor Impurities)
─────────────────────────────────
Add elements with 5 valence electrons:
- Phosphorus (P)
- Arsenic (As)
- Antimony (Sb)
Result: Extra electron per dopant atom
Majority carriers: Electrons (negative)
Minority carriers: Holes
P-TYPE DOPING (Acceptor Impurities)
─────────────────────────────────
Add elements with 3 valence electrons:
- Boron (B)
- Gallium (Ga)
- Indium (In)
Result: Missing electron (hole) per dopant atom
Majority carriers: Holes (positive)
Minority carriers: ElectronsTypical doping concentrations range from 10¹⁵ to 10²⁰ atoms/cm³—one impurity per million to one per ten thousand silicon atoms. This dramatically increases conductivity while maintaining semiconductor properties.
The P-N Junction
When P-type and N-type semiconductors meet, something remarkable happens at the interface.
Junction Formation
At the instant of contact:
- Electrons from N-region diffuse into P-region
- Holes from P-region diffuse into N-region
- Diffusing carriers recombine, leaving behind fixed ions
- These ions create an electric field opposing further diffusion
Equilibrium forms a depletion region—a zone stripped of mobile carriers, with a built-in voltage (about 0.7V for silicon).
Forward and Reverse Bias
Forward Bias (positive to P, negative to N): External voltage opposes the built-in field, shrinking the depletion region. Above ~0.7V, current flows exponentially:
Where V_T = kT/q ≈ 26mV at room temperature, and n is the ideality factor (1-2).
Reverse Bias (negative to P, positive to N): External voltage reinforces the built-in field, widening the depletion region. Only a tiny leakage current flows (until breakdown).
Bipolar Junction Transistors
The Bipolar Junction Transistor (BJT) sandwiches one semiconductor type between two of the other, creating either NPN or PNP structures.
NPN Transistor Operation
Collector (C)
│
┌──────┴──────┐
│ N-type │ (lightly doped, wide)
│ region │
└──────┬──────┘
┌──────┴──────┐
│ P-type │ (thin base)
│ base │◄── Base (B)
└──────┬──────┘
┌──────┴──────┐
│ N-type │ (heavily doped)
│ region │
└──────┬──────┘
│
Emitter (E)The key insight: the base is very thin (less than 1 micrometer) and lightly doped.
How It Works
- Forward bias the base-emitter junction (V_BE ≈ 0.7V): Electrons flood from emitter into base
- Reverse bias the collector-base junction: Creates a strong electric field pulling electrons toward collector
- Most electrons transit the thin base without recombining: They're swept into the collector
- Small base current controls large collector current: Current gain β = I_C/I_B ≈ 100-300
I_E = I_C + I_B
I_C = β × I_B
I_C = α × I_E, where α = β/(β+1) ≈ 0.99
Region B-E Junction C-B Junction Use Case
───────────────────────────────────────────────────────
Cutoff Reverse Reverse Switch OFF
Active Forward Reverse Amplifier
Saturation Forward Forward Switch ON
Reverse Reverse Forward (Rarely used)
// For digital logic, we toggle between Cutoff and Saturation
// For analog amplification, we bias in the Active regionTransistor Simulator
COMING SOONThis interactive tool is being developed. Check back soon for a fully functional simulation!
MOSFETs: The Modern Workhorse
While BJTs were revolutionary, the Metal-Oxide-Semiconductor Field-Effect Transistor (MOSFET) dominates modern electronics. Your CPU uses MOSFETs almost exclusively.
MOSFET Structure
How MOSFETs Work
Unlike BJTs which are current-controlled, MOSFETs are voltage-controlled.
- Gate voltage = 0: Two back-to-back P-N junctions block current between source and drain
- Apply positive gate voltage: Electric field repels holes from the substrate surface
- Above threshold voltage (V_th): Electrons accumulate at the surface, forming a conductive inversion layer (channel)
- Current flows: Source and drain are now connected by an N-type channel
Type Substrate Source/Drain Channel Turns ON when
─────────────────────────────────────────────────────────────
NMOS P-type N+ Electrons V_GS > V_th (positive)
PMOS N-type P+ Holes V_GS < V_th (negative)
// PMOS is slower (lower hole mobility) but essential for CMOS
// CMOS = Complementary MOS: uses both NMOS and PMOSWhy MOSFETs Dominate Digital Logic
- Near-zero gate current: Voltage control means almost no power to switch states
- High input impedance: 10¹² ohms or more
- Scalability: Can be made extremely small (down to ~3nm today)
- CMOS efficiency: No static power consumption in complementary circuits
- Manufacturing: Self-aligned gate process is highly manufacturable
From Transistors to Logic Gates
Now comes the conceptual leap: using transistors as switches to compute.
The CMOS Inverter
The simplest logic gate, built with one NMOS and one PMOS transistor:
V_DD (+)
│
─┴─
│ │ PMOS
│ ○─┤ ← Gate connected to input
│ │
─┬─
├───────○ Output
─┴─
│ │ NMOS
│ ─┤ ← Gate connected to input
│ │
─┬─
│
GND
Input = LOW (0V):
- NMOS OFF (V_GS < V_th)
- PMOS ON (V_GS < V_th, remember it's negative for PMOS)
- Output pulled to V_DD → HIGH
Input = HIGH (V_DD):
- NMOS ON
- PMOS OFF
- Output pulled to GND → LOW
Result: Output = NOT(Input)In steady state, one transistor is always OFF, blocking the path from V_DD to GND. Current only flows during switching transitions. This is why CMOS circuits consume power proportional to switching frequency: P = CV²f
NAND Gate: The Universal Gate
V_DD
┌───┴───┐
─┴─ ─┴─
│ │ │ │
A ──○│ │ │ │○── B (PMOS in parallel)
│ │ │ │
─┬─ ─┬─
└───┬───┘
│
├──────────○ Output (Y)
│
─┴─
│ │
A ──│ │ (NMOS in series)
│ │
─┬─
─┴─
│ │
B ──│ │
│ │
─┬─
│
GND
Truth Table:
A B │ Y
────────┼────
0 0 │ 1 (Both PMOS ON, NMOS path broken)
0 1 │ 1 (Left PMOS ON)
1 0 │ 1 (Right PMOS ON)
1 1 │ 0 (Both NMOS ON, output to GND)
Y = NOT(A AND B) = NAND(A, B)NAND is "universal" because any logic function can be built using only NAND gates:
// NOT from NAND
NOT(A) = NAND(A, A)
// AND from NAND
AND(A, B) = NOT(NAND(A, B)) = NAND(NAND(A,B), NAND(A,B))
// OR from NAND
OR(A, B) = NAND(NOT(A), NOT(B)) = NAND(NAND(A,A), NAND(B,B))
// XOR from NAND
XOR(A, B) = NAND(NAND(A, NAND(A,B)), NAND(B, NAND(A,B)))From Gates to Processors
Transistors
↓
Logic Gates (AND, OR, NOT, XOR, NAND, NOR)
↓
Combinational Logic (Adders, Multiplexers, Decoders)
↓
Sequential Logic (Flip-flops, Registers, Counters)
↓
Functional Units (ALU, Register File, Control Unit)
↓
Processor Core
↓
Multi-core CPU / GPU
↓
System-on-Chip (SoC)Moore's Law and Scaling
In 1965, Gordon Moore observed that transistor density doubles approximately every two years. This observation, known as Moore's Law, has held remarkably well for nearly 60 years.
Scaling Trends
Year Process Node Transistors/Chip Example
──────────────────────────────────────────────────────
1971 10 μm 2,300 Intel 4004
1978 3 μm 29,000 Intel 8086
1989 1 μm 1,200,000 Intel 486
1999 180 nm 9,500,000 Pentium III
2006 65 nm 291,000,000 Core 2 Duo
2012 22 nm 1,400,000,000 Core i7 (Ivy Bridge)
2020 7 nm 50,000,000,000 Apple M1
2023 3 nm 114,000,000,000 Apple M3 Max
// Note: "nm" nodes are now marketing terms
// Actual transistor dimensions don't match node namesPhysical Limits
As transistors shrink, we encounter fundamental physics limits:
- Quantum tunneling: Below ~5nm, electrons tunnel through barriers probabilistically
- Gate oxide limits: SiO₂ below ~1nm allows excessive leakage; solved with high-κ dielectrics
- Variability: With billions of transistors, statistical variation becomes significant
- Power density: "Dennard scaling" broke down ~2006; frequency stopped climbing
Modern Solutions
FinFET (3D Tri-Gate)
- Channel wrapped around by gate on 3 sides
- Better electrostatic control
- Used: 22nm to 5nm nodes
GAA (Gate-All-Around) / Nanosheet
- Gate completely surrounds channel
- Multiple stacked nanosheets
- Used: 3nm and below (Samsung, Intel)
CFET (Complementary FET)
- NMOS and PMOS stacked vertically
- 2x density improvement
- Expected: 2nm and below
Alternative Computing
- Chiplets and 3D stacking
- Photonic interconnects
- Neuromorphic chips
- Quantum computing (different paradigm entirely)Conclusion: The Elegance of the Transistor
From a lump of silicon sand to billions of precisely controlled switches—the transistor story is one of humanity's greatest engineering achievements.
What makes it remarkable:
- Physical elegance: Exploiting quantum mechanics and solid-state physics
- Scalability: The same principles work from millimeters to nanometers
- Universality: Simple switches compose into arbitrary computation
- Economics: Mass production makes each transistor virtually free
Every computation, every pixel, every bit of data you've ever processed—it all comes down to transistors switching on and off, billions of times per second, doing exactly what the laws of physics and the ingenuity of engineers designed them to do.