Opening subject page...
Loading your content
Fossils and DNA sequences reveal that all living organisms share deep evolutionary connections.
For centuries, naturalists noticed that certain organisms looked strikingly similar even when living on different continents. Fossils of marine creatures found on mountaintops and the skeletons of whales that contained tiny, seemingly useless leg bones hinted that life had a connected history. The question was whether these patterns reflected a shared origin or were merely coincidental. Common ancestry — the idea that all species descended from shared ancestors — slowly emerged as the most powerful explanation for the unity of life.
From Cuvier's fossil comparisons to modern genome sequencing, two independent lines of evidence — the fossil record and molecular biology — converge on the same conclusion. How exactly do these two sources of data support common ancestry? That is the central question of this lesson.
Before examining evidence, it helps to define several foundational ideas. These principles connect three dimensions of NGSS learning: the Disciplinary Core Idea that genetic information provides evidence of evolution (LS4.A), the Science and Engineering Practice of analyzing and interpreting data, and the Crosscutting Concept of patterns. Together, they form a framework for understanding why fossils and molecules tell the same evolutionary story.
The fossil record provides a physical timeline of evolutionary change. When paleontologists examine rock layers, or strata, they observe a clear pattern: simpler organisms appear in the oldest rocks, while more complex or modern forms appear in younger layers. This ordering is consistent across the globe. The diagram below illustrates how fossils in successive rock layers document the transition from ancient fish to modern tetrapods — a transition supported by extraordinary transitional fossils like Tiktaalik.
Notice how the diagram reveals a pattern — one of the NGSS crosscutting concepts. Fossils do not appear randomly in the rock layers. Instead, they form a logical progression from fully aquatic fish to land-dwelling tetrapods. Each species in the sequence shares features with the ones above and below it, consistent with gradual modification over millions of years. Transitional fossils like Tiktaalik are especially powerful because scientists predicted where to find them before they were discovered.
Fossils tell us about anatomy and where organisms lived, but molecular evidence reveals evolutionary relationships at the level of DNA, RNA, and proteins. All known life on Earth uses the same genetic code — DNA composed of the nucleotides adenine, thymine, guanine, and cytosine. This universality is itself evidence of common ancestry. If organisms had independent origins, there would be no reason for them all to use the same molecular language.
When scientists align comparable DNA or protein sequences from two species, they can count the number of differences. The central idea is straightforward: species that diverged recently will have fewer differences because there has been less time for mutations to accumulate. Species that split long ago will have more differences. For example, the gene for cytochrome c, a protein essential for cellular respiration, differs by only one amino acid between humans and chimpanzees but by approximately 44 amino acids between humans and yeast.
If mutations accumulate at a roughly constant rate in a particular gene, the number of differences between two species can serve as a molecular clock. Scientists calibrate the clock using fossils of known age. Once calibrated, they can estimate divergence times even for lineages that have poor fossil records.
One of the most compelling arguments for common ancestry is that phylogenetic trees built from fossil data consistently match trees built from molecular data. This convergence of independent evidence is powerful. If evolution were false, there would be no reason for anatomical comparisons and DNA comparisons to produce the same branching pattern of relationships. The diagram below shows a simplified phylogenetic tree for five vertebrate groups, constructed from molecular sequence data, with fossil divergence dates indicated at each branching point.
In the diagram above, the branching pattern is the critical feature. Birds and mammals share a more recent common ancestor (approximately 310 Ma) than either shares with fish (approximately 530 Ma). This hierarchical nesting — where groups within groups share progressively more recent ancestors — is exactly what common ancestry predicts. The DNA similarity percentages in the inset reinforce this pattern: humans share 98.7% of their DNA with chimpanzees but only about 60% with fish. Both the fossil record and molecular data independently produce the same tree topology.
| Species Comparison | Cytochrome c Amino Acid Differences | Approximate DNA Similarity (%) | Estimated Divergence (Ma) |
|---|---|---|---|
| Human vs. Chimpanzee | 0 | 98.7 | ~6–7 |
| Human vs. Dog | 11 | 84 | ~95 |
| Human vs. Chicken | 13 | 65 | ~310 |
| Human vs. Frog | 18 | 55 | ~360 |
| Human vs. Yeast | 44 | ~26 | ~1,500 |
Let's apply the molecular clock equation to estimate when two species diverged, using amino acid sequence data from the protein cytochrome c. This example walks through the process step by step.
Both fossil and molecular evidence are powerful, but each has inherent strengths and limitations. Understanding these helps us appreciate why combining multiple lines of evidence produces the strongest scientific arguments. This kind of evaluation reflects the NGSS Science and Engineering Practice of engaging in argument from evidence.
| Criterion | Fossil Evidence | Molecular Evidence |
|---|---|---|
| What it reveals | Physical anatomy, body size, habitat, and when organisms lived | Genetic relatedness, divergence times, and evolutionary rates at the molecular level |
| Key strength | Direct evidence of what past organisms looked like and when they existed; provides absolute dates via radiometric dating | Can compare any organisms with DNA (including those without fossils); highly quantitative and reproducible |
| Key limitation | Fossilization is rare; record is incomplete — especially for soft-bodied organisms. Gaps can make transitions appear sudden. | Assumes a roughly constant mutation rate, which varies between genes and lineages. Cannot directly reveal physical appearance. |
| Availability | Limited to organisms that were preserved under specific geological conditions | Available for all living species and some recently extinct ones (ancient DNA is recoverable from permafrost, amber, etc.) |
| Best for | Documenting major transitions (e.g., water-to-land); establishing the order of appearance of body plans | Resolving relationships among living species; estimating divergence times for lineages with poor fossil records |
The evidence for common ancestry examined in this lesson provides the foundation for more advanced topics in evolutionary biology. At the college and graduate level, scientists use sophisticated computational methods to build phylogenetic trees from thousands of genes simultaneously, a field called phylogenomics. They also study how evolution shapes genomes over time through processes like horizontal gene transfer and gene duplication, which complicate simple tree-like models of descent.
| Topic | This Lesson (HS Level) | Advanced Biology |
|---|---|---|
| Phylogenetic trees | Trees show branching relationships based on shared traits and DNA similarity | Maximum likelihood and Bayesian methods build trees from statistical models of molecular evolution |
| Molecular clock | Assumes a roughly constant mutation rate to estimate divergence times | Relaxed clock models allow rate variation across lineages; calibrated with multiple fossil constraints |
| Homology | Compares anatomical structures and single genes across species | Comparative genomics analyzes entire genomes, identifying synteny (conserved gene order) and regulatory element evolution |
| Evidence scope | Fossil record plus DNA/protein sequences | Adds developmental biology (evo-devo), biogeography, paleogenomics (ancient DNA), and epigenomics |
As you progress in biology, you will encounter cases where molecular data and fossil data initially seem to disagree — for example, when convergent evolution produces similar body forms in unrelated lineages (like the wings of bats and birds), but DNA clearly shows they evolved independently. These apparent conflicts are resolved through more rigorous analysis and demonstrate why using multiple independent lines of evidence is a cornerstone of scientific reasoning.
Two independent lines of evidence powerfully support the idea of common ancestry. The fossil record preserves a chronological sequence of life forms in Earth's rock layers, with transitional fossils like Tiktaalik showing intermediate traits between ancestral and descendant groups. Molecular evidence — including the universal genetic code, DNA and protein sequence comparisons, and molecular clocks — reveals that closely related species share more genetic similarity than distant relatives, exactly as predicted by common descent.
The most compelling feature of these two evidence types is their convergence: phylogenetic trees built from fossil anatomy match trees built from DNA sequences. Each line of evidence has limitations — fossils are incomplete and molecular clocks assume roughly constant mutation rates — but together they form an overwhelming case. Understanding how to evaluate and integrate these multiple lines of evidence reflects the NGSS practices of analyzing and interpreting data and engaging in argument from evidence, while the crosscutting concept of patterns unites the entire investigation.