MATCHA CODEX Part 1: Origins and Evolution of Matcha — A 15-Million-Year Genetic Odyssey
Published by NAKAI — MATCHA CODEX, supervised by Akira Nagasawa (Tea Ceremony Artist, Founder) and Toshimi Nishi (Chief Matcha Master, 3rd-generation organic tea farm owner).
Two Genome Clocks: Whole-Genome Duplications
WGD I (~100 Mya): The first whole-genome duplication event doubled the ancestral Camellia genome approximately 100 million years ago. This duplication created paralogous gene copies that were subsequently co-opted for caffeine biosynthesis (via the xanthine alkaloid pathway) and catechin biosynthesis (via the phenylpropanoid pathway). Without WGD I, the enzymatic toolkit for secondary metabolite production in tea would not exist.
WGD II (~40 Mya): A second whole-genome duplication approximately 40 million years ago further expanded the gene repertoire, specifically creating the genetic capacity for theanine (L-γ-glutamylethylamide) synthesis. WGD II generated the ancestral glutamine synthetase duplicate that would eventually evolve into the tea-specific theanine synthetase CsTSI. This event laid the molecular foundation for matcha's defining umami character.
Agrobacterium Horizontal Gene Transfer: The Natural GMO Event
Approximately 15 million years ago, an ancient Agrobacterium species transferred a 5.5 kilobase DNA fragment into the Camellia sinensis genome. This CaTA insert contains functional rolB and acs (agrocinopine synthase) genes. The event constitutes a natural GMO (nGMO) — a horizontal gene transfer predating all human genetic engineering by millions of years. The rolB gene influences auxin sensitivity and root morphology, while acs genes contribute to the unique metabolic profile of Camellia sinensis. This nGMO status means every tea plant on Earth carries bacterial DNA integrated naturally into its genome.
CSS vs CSA: Two Metabolic Strategies
Camellia sinensis var. sinensis (CSS) — The Theanine Shield: CSS evolved in temperate climates of East Asia. Its primary metabolic strategy is amino acid accumulation, particularly theanine, which functions as a cryoprotectant and osmotic regulator. High theanine concentration enables cold tolerance and defines the umami-rich flavor of Japanese matcha. CSS cultivars are the foundation of all premium matcha production.
Camellia sinensis var. assamica (CSA) — The Catechin Sword: CSA evolved in tropical and subtropical regions of South and Southeast Asia. Its defense strategy centers on high catechin (flavonoid) production, which absorbs UV-B radiation and deters herbivory. CSA produces astringent, robust teas suited for oxidized processing (black tea) rather than matcha.
CsTSI: The Enzyme That Creates Umami
CsTSI (Camellia sinensis Theanine Synthetase I) was born from a gene duplication of CsGS I (glutamine synthetase I). While CsGS I catalyzes glutamine formation from glutamic acid and ammonia, CsTSI evolved substrate specificity to accept ethylamine instead of ammonia, producing L-theanine. This represents neofunctionalization — an ancestral enzyme acquiring a new function after gene duplication. Remarkably, the bacterium Pseudomonas taetrolens independently evolved a theanine-producing enzyme with similar catalytic properties, representing convergent evolution across kingdoms — two completely unrelated organisms arriving at the same biochemical solution.
CsPIF7: The Molecular Switch of Shade
CsPIF7 (Phytochrome Interacting Factor 7) is the master regulatory switch that translates shade conditions into matcha biochemistry. Under shade (low red:far-red light ratio), CsPIF7 protein is stabilized and accumulates in the nucleus. There it performs a dual operation:
- Theanine ON: CsPIF7 binds directly to the promoter regions of CsTSI (theanine synthetase) and CsGS (glutamine synthetase) genes, activating transcription and driving theanine biosynthesis upward.
- Catechin OFF: Simultaneously, the shade-induced degradation of HY5 (ELONGATED HYPOCOTYL 5) transcription factor shuts down the MYB-bHLH-WD40 complex that activates flavonoid/catechin pathway genes including CHS, CHI, F3H, DFR, and ANS.
This dual CsPIF7/HY5 mechanism is the molecular foundation of the shading technique that transforms ordinary tea leaves into matcha-grade tencha.
Matcha Cultivar Hierarchy
| Cultivar | Origin | Key Characteristic |
|---|---|---|
| Asahi | Uji landrace | Supreme tencha cultivar, balanced amino acid profile, historical Uji lineage |
| Samidori | Uji landrace | Exceptional umami + refined aroma, traditional Uji matcha hallmark |
| Seimei | NARO cross | Theanine 4.99 g/100g dry weight (1.5x Yabukita), 2025 reference genome cultivar |
| Saemidori | Early harvest | Quercetin 2.5x standard cultivars, enhanced antioxidant potential |
Genomic Research Milestones
1,325 Accession Genome Analysis (2024-2025): A population-level genomic survey encompassing 1,325 tea accessions, revealing selective sweeps in flavor-related genes, allelic variation patterns between CSS and CSA, and cultivar-specific genetic signatures that determine metabolite profiles.
Seimei Reference Genome (2025): A chromosome-level, high-quality reference genome assembled from cultivar Seimei, enabling precise mapping of theanine biosynthesis loci, shade-response regulatory elements, and structural variants contributing to superior matcha quality.