In the 21st century, the decoding of the human genome has become one of the most significant scientific achievements in biology and medicine. The completion of the Human Genome Project provided researchers with a reference sequence of human DNA, making it possible to study the genetic basis of human development, physiology, and disease. However, while the project delivered a complete sequence, it also raised important new questions. One of the most striking issues was how the relatively small number of human protein-coding genes—approximately 21,000—could account for the remarkable biological complexity of the human organism (Claverie, 2001; Lander et al., 2001).

This unexpected discovery challenged earlier assumptions that organismal complexity is directly related to the number of protein-coding genes. Humans were found to have a similar number of such genes as much simpler organisms, such as the nematode Caenorhabditis elegans (Claverie, 2001). As a result, scientists began to shift their attention toward the large portions of the genome that do not encode proteins. These regions, previously referred to as “junk DNA,” were increasingly recognized as functionally important for regulating gene activity and shaping biological complexity.

To address these questions, the ENCODE (Encyclopedia of DNA Elements) Project was launched in 2003 by the National Human Genome Research Institute (NHGRI). The primary goal of the ENCODE Project was to identify and characterize all functional elements within the human genome, including both protein-coding sequences and noncoding regions (ENCODE Project Consortium, 2004). Rather than focusing only on genes, ENCODE aimed to understand how the genome operates as a dynamic and highly regulated system.

This learning unit introduces students aged 14–18 to the core ideas of genomics and functional DNA through an interactive and inquiry-based learning approach. By combining current scientific research with augmented reality (AR), students explore the hidden regulatory layers of the human genome. These include noncoding RNAs, regulatory DNA elements, chromatin modifications, and mechanisms of transcriptional regulation. The initial pilot phase of the ENCODE Project focused on approximately 1% of the human genome (around 30 megabases) and revealed widespread transcriptional activity, including many previously unknown noncoding RNAs and regulatory signals (ENCODE Project Consortium, 2007).

Through AR-based learning stations and digital models, students are able to visualize biological processes that are normally invisible to the naked eye. Interactive three-dimensional models allow learners to explore chromatin structure, observe how regulatory elements interact with genes, and understand how gene expression differs between cell types. Students are also introduced to key genomic technologies such as chromatin immunoprecipitation followed by sequencing (ChIP-seq), DNase I hypersensitive site sequencing (DNase-seq), and RNA sequencing (RNA-seq). These tools help scientists analyze genome activity and identify functional DNA elements (Landt et al., 2012; Djebali et al., 2012; Gerstein et al., 2012).

Research generated by the ENCODE Project has demonstrated that regulatory elements such as promoters and enhancers work together to fine-tune gene expression in a cell-type-specific manner. These interactions depend strongly on chromatin organization and histone modifications, highlighting the importance of genome structure in gene regulation (Kundaje et al., 2012). Such findings have reshaped scientific understanding of how genetic information is controlled within cells.

The project has also emphasized the relevance of noncoding regions in human health and disease. By integrating ENCODE data with genome-wide association studies (GWAS), researchers have been able to functionally annotate a large proportion of disease-associated genetic variants. Many of these variants are located in regulatory regions rather than in protein-coding genes, suggesting that changes in gene regulation can play a critical role in disease development (Schaub et al., 2012; Boyle et al., 2012).

By the end of this learning unit, students will not only gain foundational knowledge in molecular biology and bioinformatics but also develop digital literacy, teamwork, and scientific communication skills. The unit is designed to foster curiosity, critical thinking, and an appreciation for the complexity of the human genome. At the same time, it highlights the collaborative nature of modern scientific research and the importance of large-scale international projects such as ENCODE. While some DNA sequences are directly involved in coding for proteins, a significant portion of the genome consists of noncoding regions that perform essential regulatory and structural roles.