Many biobanks were established as biorepositories for biomedical research, and a number of biobanks were founded in the 1990s. The main aim of the biobank is to store and to maintain biomaterials for studying chronic disease, identifying risk factors of specific diseases, and applying personalized drug therapies. This report provides a review of biobanks, including Korean biobanks and an analysis of sample volumes, regulations, policies, and ethical issues of the biobank. Until now, the top 6 countries according to the number of large-scale biobanks are the United Kingdom, United States, Sweden, France, the Netherlands, and Italy, and there is one major National Biobank of Korea (NBK) and 17 regional biobanks in Korea. Many countries have regulations and guidelines for the biobanks, and the importance of good management of biobanks is increasing. Meanwhile, according to a first survey of 456 biobank managers in the United States, biobankers are concerned with the underuse of the samples in their repositories, which need to be advertised for researchers. Korea Biobank Network (KBN) project phase II (2013-2015) was also planned for the promotion to use biospecimens in the KBN. The KBN is continuously introducing for researchers to use biospecimens in the biobank. An accreditation process can also be introduced for biobanks to harmonize collections and encourage use of biospecimens in the biobanks. KBN is preparing an on-line application system for the distribution of biospecimens and a biobank accreditation program and is trying to harmonize the biobanks.
Biobanks provide important materials for many research realms, such as biomarker development for the early diagnoses of specific diseases, including cancer and genetic diseases, and for applying personalized drug therapies. There is no consensus definition for a biobank, but it is considered a biorepository that stores biospecimens for use for diagnostic or research purposes. Also, a biobank is an institution that collects, stores, processes, and uses biological materials, genetic data, and associated epidemiological data from human beings or distributes them to researchers [
The oldest biobank for the Framingham Heart Study (FHS), funded by the National Institute of Health-National Heart, Lung, and Blood Institute (NIH-NHLBI), collects blood samples and data and was established in 1948. The purpose of the Framingham program was the development of case-finding procedures. A total of 5,209 persons between the age of 30 and 62 from Framingham, Massachusetts participated in this study, and three generations of participants (for a total of almost 15,000 participants) were recruited; the researchers began clinical examinations and lifestyle-related interviews to seek risk factors related to cardiovascular disease (CVD) development. FHS has revealed the significant identification of the main CVD risk factors, such as high blood pressure, high blood cholesterol, smoking, and obesity, as well as immense information on factors, such as blood triglyceride and high-density lipoprotein cholesterol levels, age, and so on [
By the late 1990s, scientists realized that some diseases originated from a single defective gene, but most genetic diseases are caused by multiple genetic factors on multiple genes [
Owing to the increased demands on qualified biospecimens for research, the number of biobanks has increased significantly between 1980 and 1999 worldwide [
In 2005, the UK Biobank started collecting DNA samples and personal information from 500,000 volunteers aged between 45 and 69 years. The main purpose of the UK Biobank is to look for the relationship between disease, lifestyle, and genes, as well as to identify risk factors that affect individual response to specific drug treatment. The UK Biobank has collected biospecimens from 500,000 people between 2006 and 2010, and they provided their specimens to researchers starting from March 2012 [
A new report by Visiongain (
In 1995, the Ministry of Education and Science Technology established the Korea National Research Resource Centers for collecting various bioresources. However, the scale and variety of the collections were lacking, because collections were based on an individual research project. The Ministry of Environment established biobanks, but their samples were limited to specimens needed for the research of environmentally induced diseases. Therefore, there was a necessity to establish a biobank that was centered on hospitals for collecting human biospecimens [
In 2008, the Ministry of Health and Welfare started the Korea Biobank Project (KBP) and tried to support establishing biobanks within university-affiliated general hospitals as well as creating a network among these biobanks, including National Biobank of Korea (NBK), which is the largest national biorepository in Korea, located in Osong, the central region in South Korea [
A new building for NBK, the largest in Asia, was constructed in April, 2012 at Osong, Chungcheongbuk-do, Korea. The three-story building, including storage rooms that can contain 26 million vials of biospecimens, was constructed. The structure also has laboratories, an auditorium, and so on. The budget for the project was secured from the Korean central government in 2008, and the construction was started in 2010 and finished in 2012 [
There are many guidelines for the management of biobanks, such as the International Society for Biological and Environmental Repositories (ISBER) guideline and Organization for Economic Cooperation and Development (OECD) [
The standard operating protocols (SOPs), regulations, and management for human biospecimens are important. DNA stability and DNA purity in clinical samples are essential for correct performance and interpretation for diagnosis and research. Generally, quality assurance of human biospecimens consists of a DNA stability test, DNA purity test, microorganism contamination test, and cross-contamination test. All biospecimens collected from the Korea Genome and Epidemiology Study (KoGES) in the case of the NBK underwent a DNA degradation test and purity test. Ten percent of biospecimens stored in NBK were randomly chosen and tested for the quality assurance test to ensure the high quality of DNA extracted from the biospecimens and to check contamination by microorganisms. The quality of DNA is important for genetic research, such as genome-wide association studies (GWASs) and whole-genome sequencing (WGS). In general, pure preparations of DNA and RNA have an OD260/OD280 of 1.8 and 2.0 respectively. LCL quality assurance was also conducted, and 10% of the stocked LCL was randomly selected to undergo survival rate investigation and cross-contamination test in the NBK [
Informed consent is getting a patient to sign a written consent form and is the process of communication between a patient and a medical doctor. In medical diagnosis and treatment, including invasive procedures, a doctor or a researcher must give sufficient information to the patient. Informed consent can be given based on a clear understanding of the process, implications, and consequences of an action. The consent process should include sufficient explanation about how individual information would be handled [
Due to the difficulties of securing specific informed consent, broad consent is a more general form of consent in which individuals agree to have their biosamples and personal information collected and stored in the biobank and for future, unspecified research [
In order to ensure bioethics and bioethical safety in the life sciences and biotechnologies, each of the institutions should set up its own Institutional Review Board (IRB). When a genetic testing or research institution obtains written informed consent from a test subject concerning the use of specimens for research purposes, it may provide the specimens to either a person conducting research on genes or an institution licensed to open a biobank in Korea [
In Korea, the Bioethics and Safety Act was completely revised to reinforce research ethics, including biobanks, and the law took effect in 2013. The number of biobanks within university-affiliated hospitals will increase. This law encourages biobanks to strengthen ethical use of biospecimens, and the law assesses biobanks' responsibility to obtain informed consent from donors and is supervised by the IRB. The law also encourages individual researchers to deposit and use biospecimens at any biobank instead of collecting their own sample collection and distribution processes [
Many biobanks have policies for the distribution of biospecimens and data. For example, there is a guideline to distribute biospecimens in the National Health and Nutrition Examination Survey (NHANES), which is a program of studies designed to assess the health and nutritional status of adults and children in the United States. NHANES's biospecimens are representative samples of Americans and should not be used for the independent case-control study, and 1individual vial must be stored permanently. All proposals for the use of NHANES samples are evaluated by a technical panel for scientific merit and by the NHANES Ethics Review Board (ERB). To determine if this limited resource should be used in the proposed projects, a technical panel will evaluate the public health significance and scientific merit of the proposed research. Distribution of a 0.5-mL sample is available once, and after finishing the research, the remaining samples are abandoned, or remnants more than 0.3 mL should be returned to NHANES; third-party biospecimen transfer is not permitted. Researchers who received specimens must submit the result of the analysis, such as quality control and analytical methods with documents, to the National Center for Health Statistics (NCHS). Surplus data without DNA analysis results would be opened in the website and given a period of 3 months for analysis. A researcher's intellectual property rights are not permitted because of research for public health (
The Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO) biorepository in the United States has 2.9 million biological samples, including blood, and researchers can use samples and data through the process of peer review. Researchers can submit summarized proposals to the PLCO, and the chosen research is recommended for full research proposals and submissions are available twice in June/July or December/January annually. The review process takes 4 or 6 months. Researchers with approved proposals should submit IRB approval documents and contract the Material Transfer Agreement (MTA) mutually, and biospecimens are distributed to researchers when approved; researchers should report experimental results to Information Management Systems (IMS) (
At present, distribution of biospecimens is available to Korean researchers irrespective of research funds, and the distribution for data is accessible to Korean researchers without financial sponsors in Korean biobanks. All proposals for distribution of data and biospecimens should be reviewed and be approved by the IRB belonging to an institute prior to submission. Opened data and biospecimens for distribution in the NBK are shown in
Most of the distributed biospecimens are DNA samples in the NBK, and a few body fluid samples, including sera and plasma, were distributed. According to the advancement of the technology, the minimum quantity for clinical examination and for the analysis of metabolites is diminishing. However, sera and plasma are very limited resources, and political decisions and long-term planning for the use of body fluid samples will be needed [
In 2000, following the HGP, biological research moved into the so-called genomic era. Diseases have been studied by identifying genes and their specific function and understanding the role played by genetics in the beginning and prognosis of diseases, and an advanced world of medicine, known as "personalized medicine," has been initiated [
Accessibility to human specimens and data for the purposes of research is important for realms, such as genomics, proteomics, metabolomics, molecular imaging, and nanotechnology [
Despite intensive research over the last decade, most of the genetic basis of common human diseases remains unknown. The identification of meaningful genetic variants, genes, and pathways involved in special diseases offers a potential route to new therapies, improved diagnosis, and better disease prevention. For some time, it has been hoped that the advent of GWASs would provide a successful new tool for solving the genetic basis of many of these common causes of human morbidity and mortality.
Advances in sequencing technology enabled us to sequence the whole human genome [
Among the new and emerging fields of research is proteomics, the study of the full set of proteins encoded by the genome. Also, proteomics is strongly associated with the discovery phase, the first step in the process chain to create diagnostic content [
Due to the increased demands on qualified biospecimens for research, the number of biobanks has significantly increased recently. By virtue of the advancements in bioinformatics and biotechnology, storing biospecimens and data on a large scale requires that biobanks harmonize biobank processes and regulations [
Most biobanks did their best to secure more biospecimens at the inception of the foundation and comparably distribution rate for the researchers is low. According to a first survey of 456 biobank managers in the United States, nearly 70% of those questioned were concerned with underuse of the samples in their repositories [
For maximizing the value of biospecimens, consistency in the methods by which they are retrieved, processed, stored, and transported is important. Ideally, this should involve the use of agreed SOPs in general [
Horn, the former director of the Genetic Alliance, demonstrates that biobanks have to cooperate together to achieve their mission, and standardization is going to be important for researchers to get samples from different collections; we need catalogs that say where these samples exist [
Biobanks with sample sizes ≥ 200,000 (2012)
Name of Biobank | Size (age group) | Type of biobank | Year of start |
---|---|---|---|
UK Biobank | 500,000 (40‒69 y) | P-B & D-O biobank | 2006 |
Estonian Genome Project | ~1 million | P-B biobank | 2001 |
Icelandic deCode Biobank | ~250,000 | P-B biobank | 1996 |
Kadoorie Study of Chronic Diseases in China | 500,000 (35‒74 y) | P-B biobank | 2004 |
Biobank Japan | 300,000 | D-O biobank | 2002 |
Korea Biobank Project | 500,000 | P-B & D-O biobank | 2008 |
P-B, population-based; D-O, disease-oriented.
Source: GBI Research, P3G data, KBN website, Biobank Japan website.
Data for distribution ofthe National Biobank of Korea (as of June 2013)
Cohort | Epidemiologic data |
Genotype data (after QC) |
||||||
---|---|---|---|---|---|---|---|---|
Person (survey year) | No. of variables | Data | Person | Imputation | Platform | Data | ||
Community-based (Ansung/Ansan) Cohort/KoGES | 1 0,038 (baseline 2001-2002) | 1,922 | Open | 8,842 | O (HapMap) | Affy 5.0 | Open | |
4,690 | CGH array chip | Open | ||||||
100 | Illumina Hiseq | Open | ||||||
8,603 (1st follow-up 2003-2004) | 1,831 | Open | - | - | - | - | ||
7,515 (2nd follow-up 2005-2006) | 2,490 | Open | - | - | - | - | ||
Health Examinee Cohort/KoGES | 3,703 (subset of baselines 2006-2007) | 255 | Open | 3,703 | O (HapMap) | Affy 6.0 | Open | |
CAVAS | CAVAS I | 3,667 (subset of baselines 2004-2008) | 249 | Open | 3,667 | O (HapMap) | Illumina OMni1 | Open |
CAVAS II | 1,816 (subset of baselines 2005-2006) | 1 93 | Open | 1,816 | O (HapMap) | Affy 6.0 | Open | |
Twin Cohort | 1,861 (subset of baselines 2005-2008) | 206 | Open | 1,861 | Affy 6.0 | Open |
QC, quality control; KoGES, Korea Genome and Epidemiology Study; CAVAS, Cardiovascular Disease Association Study.
Biospecimens for distribution of the National Biobank of Korea (as of June 2013)
Cohort | Biospecimens |
||||||
---|---|---|---|---|---|---|---|
Serum |
Plasma |
Urine |
DNA | LCL | LCL-DNA | Release | |
Community-based (Ansung/Ansan) Cohort/KoGES | 11,756 | 9,978 | 11,208 | 12,010 | 10,463 | 10,752 | Open to the researchers |
CAVAS | 64,664 | 64,631 | 64,337 | 54,734 | - | - | Open to the researchers |
Health Examinee Cohort/KoGES | 128,396 | 128,519 | 127,539 | 119,220 | - | - | Open to the researchers |
Twin Cohort | 3,088 | 3,154 | 3,128 | 3,240 | 2,443 | 498 | Open to the researchers |
The Korea National Health and Nutrition Examination Study (KNHNES) | 44,505 | 27,933 | - | 25,644 | - | - | Open to the researchers |
Total | 254,409 | 236,215 | 206,212 | 216,848 | 12,906 | 11,205 |
LCL, lymphoblastoid cell line; KoGES, Korea Genome and Epidemiology Study; CAVAS, Cardiovascular Disease Association Study.
A part of them is being distributed to the researchers.