Identification of rare sequencing variants with a larger functional impact has the potential to highlight new pathways contributing to obesity. Using whole-exome sequencing followed by genotyping, we have identified a low-frequency coding variant rs2076349 (V527M) in the laminin subunit β3 (LAMB3) gene showing strong association with morbid obesity and thereby risk of type 2 diabetes. We exome-sequenced 200 morbidly obese subjects and 100 control subjects with pooled DNA samples. After several filtering steps, we retained 439 obesity-enriched low-frequency coding variants. Associations between genetic variants and obesity were validated sequentially in two case-control cohorts. In the final analysis of 1,911 morbidly obese and 1,274 control subjects, rs2076349 showed strong association with obesity (P = 9.67 × 10−5; odds ratio 1.84). This variant was also associated with BMI and fasting serum leptin. Moreover, LAMB3 expression in adipose tissue was positively correlated with BMI and adipose morphology (few but large fat cells). LAMB3 knockdown by small interfering RNA in human adipocytes cultured in vitro inhibited adipogenesis. In conclusion, we identified a previously not reported low-frequency coding variant that was associated with morbid obesity in the LAMB3 gene. This gene may be involved in the development of excess body fat.
Introduction
Obesity is a common chronic disease with a strong heredity component; it is a major contributor to the global burden of chronic disease such as type 2 diabetes (1). Genome-wide association studies (GWAS) have revolutionized the genetic analysis of multifactorial traits and led to the successful discovery of numerous genetic risk markers for obesity (2). However, large gaps in our understanding of the heredity impact on obesity remain. First, identified single nucleotide variants (SNVs) associated with obesity tend to have a minor impact on disease risk, thereby complicating the characterization of underlying functional genetic variants and pathophysiology. Second, together, identified genetic loci explain <3% of variation in BMI in the population (2).
In an attempt to identify additional susceptibility gene loci for complex disease, researchers have begun to apply the recently developed high-throughput sequencing technology, which has shown power to detect low-frequency and rare disease-causing variants by deep sequencing of all known exons (3,4). A few gene variants have been shown to associate with severe obesity (5,6). Whole-exome sequencing (WES) might be most efficient when applied to patients with more extreme forms of common diseases such as morbid obesity for a number of reasons. First, rare and low-frequency variants must have a high penetrance to allow statistical detection. Second, sequencing variants in the protein coding sequence might be more likely to have a stronger impact on phenotypes than variants in intergenic regions (e.g., known monogenic or major gene causes of obesity are associated with a severe phenotype and major impact on BMI) (7).
In this study, we used WES to detect low-frequency gene variants enriched in adult subjects with morbid obesity and validated them against never obese elderly adults. We believed that it would be possible to enrich for obesity genes among cases and to filter away such genes in the control subjects. In this study, we report a low-frequency obesity-associated variant in the coding region of the laminin subunit β3 (LAMB3) gene.
Research Design and Methods
Sample Selection
Obese and control subjects were recruited since 1993 through local advertisement for the purpose of studying genes regulating body weight or in association with planned visits to our medical or surgical units for morbid obesity or scoliosis, respectively. Subjects were examined in the morning after a night fast, and anthropometric measurements were performed. Serum leptin was measured by ELISA (Mercodia, Uppsala, Sweden).
At present, the total sample set consists of ∼2,400 nonobese and ∼4,300 obese subjects. From the obese subset, we selected, among the 10% with the highest BMI, 200 subjects with extreme obesity for WES. We have performed two rounds of the sequencing, each containing 100 obese samples. The 100 subjects in the first examination were reported previously (6). The results presented in this study included WES data from all 200 obese subjects. As control subjects, we used 100 subjects with idiopathic scoliosis who were sequenced using the same layout setting for WES from a genetic study (8). For the first genotyping validation, we selected the 484 morbidly obese subjects with the highest BMI in the large sample collection described above, including the 200 subjects who had been used in WES and 491 never-obese subjects with age >40 years and BMI always <30 kg/m2. In the second validation, we genotyped 1,427 morbidly obese subjects and 783 never-obese control subjects with age >40 years and BMI always <30 kg/m2.
All investigated obese subjects in this study had morbid obesity (i.e., BMI >40 kg/m2). The nonobese control subjects for WES had scoliosis. All other control subjects were healthy according to self-report. There was an overrepresentation of women in the studied cohorts because obese women are more likely to search for medical advice for their obesity, and scoliosis is more common among women. All subjects were of European ancestry and living in Sweden. The study was approved by the Regional Ethics Committees of Stockholm, and all subjects gave their informed consent to participation.
LAMB3 mRNA expression in abdominal subcutaneous white adipose tissue (WAT) was measured in 114 Swedish women without diabetes who were recruited from the general adult population in Stockholm, Sweden; this dataset is described by Dahlman et al. (9). The women displayed a large interindividual variation in BMI.
DNA Preparation and Pooling
Genomic DNA was prepared from peripheral blood mononuclear cells using the QiAmp DNA Blood Maxi kit (catalog number 51194; Qiagen, Hilden, Germany). DNA purity and quality was confirmed by A260/280 ratio >1.8 on Nanodrop (Thermo Fisher Scientific, Waltham, MA) and agarose gel electrophoresis. DNA concentration was measured by Qubit (Life Technologies, Stockholm, Sweden). Subsequently, we took 0.8 μg of each DNA sample and randomly divided them into 10 pools, each containing 10 samples of either obese cases or control subjects. The concentrations of pooled DNA samples were measured with Qubit (Life Technologies) and the samples run on agarose gel.
WES
WES was performed at the Science for Life Laboratory (Stockholm, Sweden) as previously descried (6). Briefly, each DNA library was prepared from 3 μg of the pooled genomic DNA. DNA was sheared to 300 base pairs (bp) using a Covaris S2 instrument (Covaris, Brighton, U.K.) and enriched by using the SureSelectXT Human All Exon 50 Mb kit (Agilent Technologies) and an Agilent NGS workstation according to the manufacturer’s instructions (SureSelectXT Automated Target Enrichment for Illumina Paired-End Multiplexed Sequencing, version A; Agilent Technologies). The clustering was performed on a cBot cluster generation system using a HiSeq paired-end read cluster generation kit according to the manufacturer’s instructions (Illumina). Samples were sequenced on HiSEq 2500 (Illumina) as paired-end reads to 100 bp/read. The sequencing runs were performed according to the manufacturer’s instructions. Demultiplexing and conversion were done by using CASAVA v1.8.2 (http://support.illumina.com/sequencing/sequencing_software/casava.html).
Sequencing Read Mapping, Variant Calling, and Functional Annotation
WES data processing was described previously (6). All sequencing reads were first aligned to the human reference genome assembly hg19/GRCH37 using Burrows-Wheeler Aligner (BWA version 0.6.1; http://bio-bwa.sourceforge.net/) with a read-trimming parameter quality score of 20. Sequence variants were called by using the mpileup function of samtools-0.1.18 (http://samtools.sourceforge.net/) with a minimum mapping quality of 20 and read depth between 8× and 1,000× for filtering. PCR duplicates were removed using samtools prior to variant calling. Annovar (http://www.openbioinformatics.org/annovar/ [10]) was used to integrate variant information from public databases, including gene reference (http://hgdownload.soe.ucsc.edu/goldenPath/hg19/database/refGene.txt, 2013), Single Nucleotide Polymorphism database (dbSNP; SNP138; http://hgdownload.soe.ucsc.edu/goldenPath/hg19/database/snp137.txt.gz), and the 1000 Genomes Project (http://www.1000genomes.org/).
Variant Filtering and Enrichment for Obesity
Variant enrichment for obesity was achieved through several filtering and comparison steps (Fig. 1). The enrichment was performed based on the WES data from 20 obese pools against 10 control pools. First, we filtered out nonexonic and synonymous SNVs and kept missense and nonsense SNVs or SNVs causing potentially splicing changes. Secondly, we counted appearance frequencies of SNVs in obese and control pools separately and compared differences of the frequencies between the two types of pools. We retained SNVs appearing more often in obese pools using the following criteria: 1) present in at least two obese pools and 2) not present in any of the control pools. Our focus was low-frequency coding SNVs with potential functional impact. The last filtering step was based on minor allele frequency (MAF). With the assumption that a causative variant should be low frequency in the normal population, we filtered for SNVs that were either not found in public databases or known SNVs with an MAF in European origin populations (MAFEur) ≤5% (1000 Genomes Project, 2011 May release).
Overview of working process. MQ, sequencing read mapping quality; unknown (variants), variants that are not included in SNP138.
Overview of working process. MQ, sequencing read mapping quality; unknown (variants), variants that are not included in SNP138.
Variant Validation and Association Studies
A two-stage validation strategy was used (Fig. 1). In the first stage, all obesity-enriched variants from the above filtering steps that could fit into a 384-well panel were genotyped in 484 obese subjects and 491 control subjects using the Illumina GoldenGate assay (Illumina) performed at the SNP&SEQ Technology Platform at Uppsala University (http://www.molmed.medsci.uu.se/SNP+SEQ+Technology+Platform/Genotyping/). The results were analyzed using the software GenomeStudio 2011.1 from Illumina. For the second step, the SNVs showing nominal significant association (P ≤ 0.05) with obesity in the first step were genotyped in additional 1,427 obese and 783 control subjects. Genotyping was performed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (SEQUENOM; Agena Bioscience, San Diego, CA) at the Mutation Analysis Core Facility at Karolinska Institutet (11). Multiplexed assays were designed using MassARRAY Assay Design v4.0 Software (Agena Bioscience). Protocol for allele-specific base extension was performed according to Agena Bioscience’s recommendation.
Additionally, rs2076349 was validated by using Sanger sequencing at Eurofins (http://www.eurofins.com/en.aspx) and TaqMan genotyping (C__22274317_10; Applied Biosystems, Foster City, CA) to confirm the genotypes of all homozygote for the rare allele T of the variant and a few heterozygote subjects.
All genetic association results in validation cohorts 1 and 2 have been submitted to GWAS Central.
Fat Cell Studies in a Population of Healthy Subjects
Patients were investigated in the morning after an overnight fast when abdominal subcutaneous WAT was obtained by fine-needle aspiration. Adipocytes were isolated from abdominal subcutaneous WAT and adipocytes isolated as described (12). By comparing the size of the adipocytes with the total amount of body fat, the morphology of WAT could be quantitatively determined (delta values), as described in detail (13). Positive values are indicative of hypertrophy (few but large fat cells) and negative values by hyperplasia (many small fat cells).
Gene Expression
WAT LAMB3 expression in the population of healthy subjects was assessed by Gene 1.0 and 1.1 ST Affymetrix arrays (Affymetrix) as described (9).
Adipocyte Cell Culture and Small Interfering RNA Transfection of Human Mesenchymal Stem Cells
Isolation, growth, and differentiation of human mesenchymal stem cells (hMSCs) were previously described (14). hMSCs were reverse transfected 24 h before induction of adipogenesis using ON-TARGETplus SMARTpool small interfering RNAs (siRNAs) targeting LAMB3 or nontargeting siRNA pool (Dharmacon, Lafayette, CO). For the knockdown of gene expression, 6.25 pmol of siRNA (giving a final concentration of 50 nmol/L) was incubated together with 0.35 μL of Dharmafect-3 (Thermo Fisher Scientific) and medium (final volume of 25 μL) in a 96-well plate for 30 min at room temperature. Thereafter, 10,000 of cells in a volume of 100 μL were added. When transfection was performed in a 24-well plate, 3.2 μL of Dharmafect-3 was mixed with 50 nmol/L of siRNA in a final volume of 100 μL following addition of 100,000 cells in a volume of 400 µL. Cells were incubated for 24 h in a proliferation medium without fibroblast growth factor 2, and then adipogenesis was induced.
The hMSCs were collected at days 2, 7, 9, and 12 after the induction of differentiation for isolation of RNA. Total RNA was extracted using NucleoSpin RNA II kit (Macherey-Nagel, Düren, Germany) according to the manufacturer’s instructions. Concentration and purity of RNA were measured using a Nanodrop ND-1000 Spectrophotometer (Thermo Fisher Scientific). Reverse transcription was performed using the iScript cDNA synthesis kit (Qiagen) and random hexamer primers (Invitrogen, Carlsbad, CA). Quantitative RT-PCR of coding genes was performed using commercial TaqMan probes (Applied Biosystems). Gene expression was normalized to the internal reference gene 18s. Relative expression was calculated using the 2(−ΔΔ threshold cycle) method (15).
Neutral Lipid and DNA Staining in Adipocytes
The hMSCs were transfected and differentiated in 96-well plates for 12 days as described above, washed with PBS, and fixed with 4% paraformaldehyde solution containing 0.123 mol/L NaH2PO4 × H2O, 0.1 mol/L NaOH, and 0.03 mol/L glucose for 10 min at room temperature. Fixed cells were washed with PBS. Cell nuclei were stained with Hoechst 33342 (2 μg/mL; Molecular Probes, Thermo Fisher Scientific), and neutral lipids were stained with Bodipy 493/503 (0.2 μg/mL; Molecular Probes) diluted in PBS for 20 min at room temperature.
After washing, accumulation of intracellular lipid droplets and cell number (amount of stained nuclei) were quantified with the Acumen eX3 imager (TTP Labtech, Hertfordshire, U.K.). Total Bodipy (lipid droplet) fluorescence was normalized to the amount of nuclei in each well (Hoechst staining).
Statistical Analyses
Genetic association analyses and odds ratio (OR) calculations were performed for A1 (minor alleles) based on genotypes using PLINK (http://pngu.mgh.harvard.edu/∼purcell/plink/). An χ2 test was used in statistical analysis. Hardy-Weinberg equilibrium among case and control subjects was checked separately for each round of validation prior to the association analyses. A nominal P value of 0.01 in control subjects was used as a cutoff value for exclusion of SNVs from further association analyses. Obesity status was characterized by BMI and reported as mean ± SD for obese and control subjects separately (Table 1). Quantitative phenotypes were analyzed using standard linear regression implemented via PLINK. Sex and age were used as covariates to evaluate their influence on the association of a variant with BMI. Correction for multiple tests was performed using PLINK adjust function, with genomic control–corrected p value calculated based on genotypes of all variants in the final analysis. Standard least square algorithm was used in regression analysis of mRNA expression in the population of healthy subjects. Student t test was used in fat cell functional assays.
Clinical information on new samples used for whole-exome sequencing and variant validations
. | Obese cases . | Control subjects . | ||||||
---|---|---|---|---|---|---|---|---|
Nationality . | Female/male . | BMI (kg/m2) . | Age (years) . | Nationality . | Female/male . | BMI (kg/m2) . | Age (years) . | |
Discovery by WES | Swedish | 74/26 | 55.8 ± 3.6 | 41.0 ± 11.0 | Swedish | 76/24 | 21.9 ± 4.3 | 24.5 ± 12.8 |
First validation | Swedish | 349/135 | 51.2 ± 3.7 | 41.0 ± 11.6 | Swedish | 348/143 | 24.4 ± 2.7 | 55.1 ± 5.8 |
Second validation | Swedish | 1,069/358 | 43.1 ± 2.5 | 42.3 ± 11.6 | Swedish | 697/86 | 23.9 ± 2.7 | 44.9 ± 3.9 |
Total* | 1,418/493 | 1,045/229 |
. | Obese cases . | Control subjects . | ||||||
---|---|---|---|---|---|---|---|---|
Nationality . | Female/male . | BMI (kg/m2) . | Age (years) . | Nationality . | Female/male . | BMI (kg/m2) . | Age (years) . | |
Discovery by WES | Swedish | 74/26 | 55.8 ± 3.6 | 41.0 ± 11.0 | Swedish | 76/24 | 21.9 ± 4.3 | 24.5 ± 12.8 |
First validation | Swedish | 349/135 | 51.2 ± 3.7 | 41.0 ± 11.6 | Swedish | 348/143 | 24.4 ± 2.7 | 55.1 ± 5.8 |
Second validation | Swedish | 1,069/358 | 43.1 ± 2.5 | 42.3 ± 11.6 | Swedish | 697/86 | 23.9 ± 2.7 | 44.9 ± 3.9 |
Total* | 1,418/493 | 1,045/229 |
Values for BMI and age are mean ± SD.
Control subjects used in the two runs of validations were nonobese.
*Nonduplicated counting (i.e., individuals used in the WES were included in the first validation).
Results
Clinical Characteristics of Subjects
Identification of Obesity-Associated Low-Frequency Variants by WES
WES generated ∼100 million nonduplicated sequencing reads for each of 20 obese pools and 10 control pools (Supplementary Table 1). The vast majority of paired reads were uniquely mapped onto the same chromosome with a pairing rate up to 90%. These reads were aligned to the human reference genome (hg19/GRCh 37) with mapping rates >95% (except one of the control pools) (Supplementary Table 1), and >90% of the target regions were completely covered with at least 8× depth (Supplementary Table 2).
Potentially interesting variants were identified through several filtering steps (Fig. 1). Obtained variants were summarized in Supplementary Table 3. The majority of the variants (between 111,589 and 201,653 per pool) were SNVs, of which >20,000 were exonic or at potential splicing sites. Our focus was SNVs with functional impact (i.e., missense, stop codon, or potential splicing site variants). We therefore filtered for exonic SNVs with the exclusion of synonymous variants. At this stage, we got >10,000 (10,511–13,297) such functional SNVs from each pool (Supplementary Table 3). Next, we filtered for SNVs shared by at least two obesity pools and not present in any control pools. Finally, after removing common SNVs with MAFEur >0.05 (1000 Genomes Project, http://www.1000genomes.org/), 439 SNVs were retained for the validation studies.
SNV Validation and Association Analyses
Two rounds of validations by genotyping were performed. In the first round, 382 SNVs, out of 439, that successfully passed quality criterion in primer design (final score >0.4) for the Illumina GoldenGate assay (Illumina) were genotyped in 484 morbidly obese subjects and 491 never-obese elderly control subjects. Two additional SNVs (with a score close to 0.4) were added to fill a 384-well genotyping panel. Genotyping of 19 SNVs failed; this was likely due to technical problems with the assay because each SNV had 0% in call rate. For remaining genotyped SNVs, the average sample call rate was 98.6%. Among successfully genotyped SNVs, eight were nonpolymorphic. Hardy-Weinberg equilibrium was checked prior to association analysis. After removal of eight more SNVs with skewed genotypes (P < 0.01), 349 SNVs were used in association analysis.
Twenty-five SNVs showed nominal association (P ≤ 0.05) with obesity in the first validation (Table 2). They were subsequently subjected to a second round of genotyping in an additional large case-control cohort consisting of 1,427 morbidly obese subjects and 783 never-obese elderly control subjects. In this round, three SNVs failed in genotyping, and one failed in primer design. Out of the remaining 21 SNVs (Supplementary Table 4), rs2076349 (P = 0.011 and 0.003 for the first and second validations, respectively) and rs142678624 (P = 0.026 and 0.047, respectively) consistently displayed a significant association with obesity (Supplementary Table 4) in the two rounds of validations. In the final joint data analysis with all genotyped subjects, rs2076349 showed strong association with obesity (P = 9.67 × 10−5; OR [95% CI], 1.84 [1.35–2.53]), whereas the association of rs142678624 with obesity was diminished. Moreover, the association of rs2076349 was demonstrated in both females (P = 0.002; OR [95% CI], 1.69 [1.21–0.2.36]) and males (P = 0.003; OR [95% CI], 4.19 [1.48–11.85]) (Table 3). Besides rs2076349, six other SNVs also showed nominal significant (P ≤ 0.05) associations with obesity in the final joint analysis of all genotyped subjects (Table 3).
Associations of SNVs with obesity in the first validation
CHR . | SNV . | A1 . | Freq_case . | Freq_control . | A2 . | P value . | OR . | L95 . | U95 . | Gene . |
---|---|---|---|---|---|---|---|---|---|---|
1 | rs45588635 | C | 0.04211 | 0.01987 | G | 0.005081 | 2.168 | 1.246 | 3.771 | CLCNKA |
1 | rs11588392 | A | 0.05925 | 0.02342 | G | 7.00E-05 | 2.626 | 1.605 | 4.298 | CLCNKB |
1 | rs116377172* | T | 0.02899 | 0.01527 | A | 0.03947 | 1.924 | 1.021 | 3.626 | SRGAP2 |
1 | rs2076349 | T | 0.04029 | 0.02138 | C | 0.01565 | 1.921 | 1.122 | 3.29 | LAMB3 |
2 | rs150681136 | A | 0.03926 | 0.01833 | G | 0.005668 | 2.188 | 1.24 | 3.862 | GPC1 |
2 | rs34069570* | A | 0.02583 | 0.01227 | G | 0.02857 | 2.134 | 1.066 | 4.273 | ANO7 |
3 | rs11920543 | A | 0.04772 | 0.02851 | G | 0.02679 | 1.707 | 1.058 | 2.755 | IQCB1 |
4 | rs7680970 | A | 0.03838 | 0.06033 | C | 0.02571 | 0.6217 | 0.4081 | 0.9472 | FAM13A |
8 | rs143039156 | A | 0.03878 | 0.0224 | C | 0.03605 | 1.761 | 1.031 | 3.007 | NKX2–6 |
8 | rs80304851 | C | 0.06405 | 0.04388 | T | 0.0487 | 1.491 | 1 | 2.224 | PARP10 |
9 | rs56200518 | A | 0.02169 | 0.009202 | G | 0.02533 | 2.388 | 1.088 | 5.24 | DAB2IP |
10 | rs61753067 | G | 0.0339 | 0.01939 | A | 0.04762 | 1.775 | 0.9988 | 3.153 | HSPA12A |
11 | rs16937251 | C | 0.04307 | 0.02444 | G | 0.02306 | 1.796 | 1.077 | 2.997 | NAV2 |
14 | rs61985140* | A | 0.03512 | 0.01939 | G | 0.0328 | 1.841 | 1.043 | 3.251 | DDHD1 |
15 | rs142678624 | A | 0.01346 | 0.004082 | G | 0.02627 | 3.328 | 1.081 | 10.24 | APBA2 |
15 | rs143879080 | C | 0.006198 | 0 | A | 0.01348 | NA | NA | NA | CYP11A1 |
15 | rs61734378 | T | 0.05475 | 0.02953 | C | 0.00553 | 1.903 | 1.2 | 3.02 | AKAP13 |
16 | rs61737709 | A | 0.06612 | 0.04397 | C | 0.03211 | 1.539 | 1.035 | 2.29 | TMC5 |
16 | rs9928053 | A | 0.02686 | 0.01327 | G | 0.03222 | 2.053 | 1.049 | 4.019 | ACSM5 |
16 | rs74029025 | C | 0.02692 | 0.005102 | G | 0.0001217 | 5.394 | 2.063 | 14.1 | PKD1L2 |
17 | rs150237469 | A | 0.01653 | 0.03157 | G | 0.03038 | 0.5156 | 0.2801 | 0.9489 | PIK3R6 |
17 | rs41298712 | G | 0.04969 | 0.03163 | A | 0.04359 | 1.601 | 1.01 | 2.537 | ENDOV |
20 | rs78568430 | G | 0.03209 | 0.01731 | C | 0.0354 | 1.882 | 1.035 | 3.424 | MYT1 |
22 | rs56340734* | A | 0.01349 | 0.03926 | G | 0.0004111 | 0.3346 | 0.1771 | 0.6321 | TTC38 |
X | rs5977625 | A | 0.05529 | 0.03469 | G | 0.04236 | 1.629 | 1.013 | 2.619 | FRMD7 |
CHR . | SNV . | A1 . | Freq_case . | Freq_control . | A2 . | P value . | OR . | L95 . | U95 . | Gene . |
---|---|---|---|---|---|---|---|---|---|---|
1 | rs45588635 | C | 0.04211 | 0.01987 | G | 0.005081 | 2.168 | 1.246 | 3.771 | CLCNKA |
1 | rs11588392 | A | 0.05925 | 0.02342 | G | 7.00E-05 | 2.626 | 1.605 | 4.298 | CLCNKB |
1 | rs116377172* | T | 0.02899 | 0.01527 | A | 0.03947 | 1.924 | 1.021 | 3.626 | SRGAP2 |
1 | rs2076349 | T | 0.04029 | 0.02138 | C | 0.01565 | 1.921 | 1.122 | 3.29 | LAMB3 |
2 | rs150681136 | A | 0.03926 | 0.01833 | G | 0.005668 | 2.188 | 1.24 | 3.862 | GPC1 |
2 | rs34069570* | A | 0.02583 | 0.01227 | G | 0.02857 | 2.134 | 1.066 | 4.273 | ANO7 |
3 | rs11920543 | A | 0.04772 | 0.02851 | G | 0.02679 | 1.707 | 1.058 | 2.755 | IQCB1 |
4 | rs7680970 | A | 0.03838 | 0.06033 | C | 0.02571 | 0.6217 | 0.4081 | 0.9472 | FAM13A |
8 | rs143039156 | A | 0.03878 | 0.0224 | C | 0.03605 | 1.761 | 1.031 | 3.007 | NKX2–6 |
8 | rs80304851 | C | 0.06405 | 0.04388 | T | 0.0487 | 1.491 | 1 | 2.224 | PARP10 |
9 | rs56200518 | A | 0.02169 | 0.009202 | G | 0.02533 | 2.388 | 1.088 | 5.24 | DAB2IP |
10 | rs61753067 | G | 0.0339 | 0.01939 | A | 0.04762 | 1.775 | 0.9988 | 3.153 | HSPA12A |
11 | rs16937251 | C | 0.04307 | 0.02444 | G | 0.02306 | 1.796 | 1.077 | 2.997 | NAV2 |
14 | rs61985140* | A | 0.03512 | 0.01939 | G | 0.0328 | 1.841 | 1.043 | 3.251 | DDHD1 |
15 | rs142678624 | A | 0.01346 | 0.004082 | G | 0.02627 | 3.328 | 1.081 | 10.24 | APBA2 |
15 | rs143879080 | C | 0.006198 | 0 | A | 0.01348 | NA | NA | NA | CYP11A1 |
15 | rs61734378 | T | 0.05475 | 0.02953 | C | 0.00553 | 1.903 | 1.2 | 3.02 | AKAP13 |
16 | rs61737709 | A | 0.06612 | 0.04397 | C | 0.03211 | 1.539 | 1.035 | 2.29 | TMC5 |
16 | rs9928053 | A | 0.02686 | 0.01327 | G | 0.03222 | 2.053 | 1.049 | 4.019 | ACSM5 |
16 | rs74029025 | C | 0.02692 | 0.005102 | G | 0.0001217 | 5.394 | 2.063 | 14.1 | PKD1L2 |
17 | rs150237469 | A | 0.01653 | 0.03157 | G | 0.03038 | 0.5156 | 0.2801 | 0.9489 | PIK3R6 |
17 | rs41298712 | G | 0.04969 | 0.03163 | A | 0.04359 | 1.601 | 1.01 | 2.537 | ENDOV |
20 | rs78568430 | G | 0.03209 | 0.01731 | C | 0.0354 | 1.882 | 1.035 | 3.424 | MYT1 |
22 | rs56340734* | A | 0.01349 | 0.03926 | G | 0.0004111 | 0.3346 | 0.1771 | 0.6321 | TTC38 |
X | rs5977625 | A | 0.05529 | 0.03469 | G | 0.04236 | 1.629 | 1.013 | 2.619 | FRMD7 |
A1, minor allele; A2, alternative allele; CHR, chromosome; Freq, frequencies of minor allele A1; L95, lower 95% CI; NA, not available; U95, upper 95% CI.
*SNPs not used in the second validation.
Associations of SNVs with obesity in the combined data from two rounds of validations
. | CHR . | SNV . | A1 . | Freq_case . | Freq_control . | A2 . | P value . | OR . | L95 . | U95 . | Gene . |
---|---|---|---|---|---|---|---|---|---|---|---|
Total | 1 | rs11588392 | A | 0.04397 | 0.03199 | G | 0.0161 | 1.39 | 1.06 | 1.82 | CLCNKB |
1 | rs2076349 | T | 0.03935 | 0.02169 | C | 9.67E-05 | 1.85 | 1.35 | 2.53 | LAMB3 | |
8 | rs80304851 | C | 0.05708 | 0.04437 | T | 0.02589 | 1.30 | 1.03 | 1.65 | LOC105375801 | |
11 | rs16937251 | C | 0.03775 | 0.02684 | G | 0.01788 | 1.42 | 1.06 | 1.91 | NAV2 | |
15 | rs61734378 | T | 0.04806 | 0.03275 | C | 0.002905 | 1.49 | 1.14 | 1.94 | AKAP13 | |
16 | rs61737709 | A | 0.06171 | 0.04937 | C | 0.03752 | 1.26 | 1.01 | 1.58 | TMC5 | |
16 | rs9928053 | A | 0.02417 | 0.01657 | G | 0.03938 | 1.47 | 1.02 | 2.13 | ACSM5 | |
Female | 1 | rs11588392 | A | 0.04545 | 0.03182 | G | 0.01579 | 1.45 | 1.07 | 1.96 | CLCNKB |
1 | rs2076349 | T | 0.04066 | 0.02454 | C | 0.002028 | 1.69 | 1.21 | 2.36 | LAMB3 | |
8 | rs80304851 | C | 0.05989 | 0.0411 | T | 0.003453 | 1.49 | 1.14 | 1.94 | LOC105375801 | |
11 | rs16937251 | C | 0.03942 | 0.02697 | G | 0.01785 | 1.48 | 1.07 | 2.05 | NAV2 | |
15 | rs61734378 | T | 0.0471 | 0.03276 | C | 0.01238 | 1.46 | 1.08 | 1.97 | AKAP13 | |
16 | rs61737709 | A | 0.06126 | 0.04817 | C | 0.04836 | 1.28 | 1.00 | 1.65 | TMC5 | |
16 | rs9928053 | A | 0.02337 | 0.01877 | G | 0.2713 | 1.25 | 0.84 | 1.87 | ACSM5 | |
Male | 1 | rs11588392 | A | 0.03971 | 0.03275 | G | 0.5171 | 1.22 | 0.67 | 2.24 | CLCNKB |
1 | rs2076349 | T | 0.03557 | 0.008734 | C | 0.003449 | 4.19 | 1.48 | 11.85 | LAMB3 | |
8 | rs80304851 | C | 0.04898 | 0.05921 | T | 0.4173 | 0.82 | 0.50 | 1.33 | LOC105375801 | |
11 | rs16937251 | C | 0.03292 | 0.0262 | G | 0.4923 | 1.27 | 0.65 | 2.48 | NAV2 | |
15 | rs61734378 | T | 0.05081 | 0.03275 | C | 0.1238 | 1.58 | 0.88 | 2.85 | AKAP13 | |
16 | rs61737709 | A | 0.06301 | 0.05482 | C | 0.5443 | 1.16 | 0.72 | 1.87 | TMC5 | |
16 | rs9928053 | A | 0.02648 | 0.006579 | G | 0.0125 | 4.11 | 1.24 | 13.64 | ACSM5 |
. | CHR . | SNV . | A1 . | Freq_case . | Freq_control . | A2 . | P value . | OR . | L95 . | U95 . | Gene . |
---|---|---|---|---|---|---|---|---|---|---|---|
Total | 1 | rs11588392 | A | 0.04397 | 0.03199 | G | 0.0161 | 1.39 | 1.06 | 1.82 | CLCNKB |
1 | rs2076349 | T | 0.03935 | 0.02169 | C | 9.67E-05 | 1.85 | 1.35 | 2.53 | LAMB3 | |
8 | rs80304851 | C | 0.05708 | 0.04437 | T | 0.02589 | 1.30 | 1.03 | 1.65 | LOC105375801 | |
11 | rs16937251 | C | 0.03775 | 0.02684 | G | 0.01788 | 1.42 | 1.06 | 1.91 | NAV2 | |
15 | rs61734378 | T | 0.04806 | 0.03275 | C | 0.002905 | 1.49 | 1.14 | 1.94 | AKAP13 | |
16 | rs61737709 | A | 0.06171 | 0.04937 | C | 0.03752 | 1.26 | 1.01 | 1.58 | TMC5 | |
16 | rs9928053 | A | 0.02417 | 0.01657 | G | 0.03938 | 1.47 | 1.02 | 2.13 | ACSM5 | |
Female | 1 | rs11588392 | A | 0.04545 | 0.03182 | G | 0.01579 | 1.45 | 1.07 | 1.96 | CLCNKB |
1 | rs2076349 | T | 0.04066 | 0.02454 | C | 0.002028 | 1.69 | 1.21 | 2.36 | LAMB3 | |
8 | rs80304851 | C | 0.05989 | 0.0411 | T | 0.003453 | 1.49 | 1.14 | 1.94 | LOC105375801 | |
11 | rs16937251 | C | 0.03942 | 0.02697 | G | 0.01785 | 1.48 | 1.07 | 2.05 | NAV2 | |
15 | rs61734378 | T | 0.0471 | 0.03276 | C | 0.01238 | 1.46 | 1.08 | 1.97 | AKAP13 | |
16 | rs61737709 | A | 0.06126 | 0.04817 | C | 0.04836 | 1.28 | 1.00 | 1.65 | TMC5 | |
16 | rs9928053 | A | 0.02337 | 0.01877 | G | 0.2713 | 1.25 | 0.84 | 1.87 | ACSM5 | |
Male | 1 | rs11588392 | A | 0.03971 | 0.03275 | G | 0.5171 | 1.22 | 0.67 | 2.24 | CLCNKB |
1 | rs2076349 | T | 0.03557 | 0.008734 | C | 0.003449 | 4.19 | 1.48 | 11.85 | LAMB3 | |
8 | rs80304851 | C | 0.04898 | 0.05921 | T | 0.4173 | 0.82 | 0.50 | 1.33 | LOC105375801 | |
11 | rs16937251 | C | 0.03292 | 0.0262 | G | 0.4923 | 1.27 | 0.65 | 2.48 | NAV2 | |
15 | rs61734378 | T | 0.05081 | 0.03275 | C | 0.1238 | 1.58 | 0.88 | 2.85 | AKAP13 | |
16 | rs61737709 | A | 0.06301 | 0.05482 | C | 0.5443 | 1.16 | 0.72 | 1.87 | TMC5 | |
16 | rs9928053 | A | 0.02648 | 0.006579 | G | 0.0125 | 4.11 | 1.24 | 13.64 | ACSM5 |
Boldface values in the table indicate P < 0.05. A1, minor allele; A2, alternative allele; CHR, chromosome; Freq, frequencies of minor allele A1; L95, lower 95% CI; U95, upper 95% CI.
Additionally, association analyses with quantitative traits were performed to observe possible influence of rs2076349 on obesity-related phenotypes. rs2076349 demonstrated strong associations with BMI (P = 0.0002; β = 2.8). The SNV also showed nominal associations with height (P = 0.0054; β = −1.66), body weight (P = 0.01; β = 6.00), and fasting serum leptin (P = 0.032; β = 6.93) (Table 4). Homozygous subjects for the rs2076349*T allele had the highest average BMI (mean 44.28 kg/m2) when compared with subjects who were heterozygous or homozygous for the rs2076349*C allele (Supplementary Table 5). The T allele was overrepresented in obese subjects (MAF 4%), especially in the homozygous form (10 obese vs. 1 control). To further evaluate any confounding effect of age or sex, we added these variables as covariates in regression models and tested their individual effects separately. The association of rs2076349 with BMI maintained strong with P = 0.0001 after adjustment by sex. The association between rs2076349 and BMI was with P = 0.0079 after adjustment for age (Supplementary Table 6). Analysis of association of rs2076349 with waist after adjustment for BMI was significant with P = 0.003 (Supplementary Table 7).
Quantitative trait analyses of rs2076349T with different phenotypes in total final validation data
Phenotype . | NMISS . | BETA* . | P value . |
---|---|---|---|
BMI | 3,174 | 2.803 | 0.000198 |
Leptin | 1,955 | 6.926 | 0.032 |
Waist | 2,726 | 3.756 | 0.05541 |
Weight | 3,145 | 5.986 | 0.01014 |
Phenotype . | NMISS . | BETA* . | P value . |
---|---|---|---|
BMI | 3,174 | 2.803 | 0.000198 |
Leptin | 1,955 | 6.926 | 0.032 |
Waist | 2,726 | 3.756 | 0.05541 |
Weight | 3,145 | 5.986 | 0.01014 |
NMISS, number of nonmissing genotypes.
*The additive effects of allele A1.
rs2076349 located at 209,800,230 bp on chromosome 1 (hg19) is a missense variant (V527M; valine to methionine) in the exon 12 (NM_001017402 and NP_001017402) of the LAMB3 gene. The variant is located in the laminin-type epidermal growth factor–like 5 domain of LAMB3. We used PolyPhen and PredictSNP to predict the possible impact of rs2076349 of protein function (16,17). According to Poplyphen, the SNV was predicted to be possibly damaging, with a score of 0.6. According to PredictSNP, which applies several independent algorithms to predict the pathogenic potential, rs2076349 is with 61–83% likelihood neutral in its effect on protein function.
rs2076349*T homozygous genotypes were confirmed by Sanger sequencing and TaqMan genotyping. The association of rs2076349 with obesity maintained significant with a Bonferroni adjusted P value of 0.002 (Supplementary Table 8).
Correlation of LAMB3 mRNA With BMI
LAMB3 mRNA expression was investigated in human subcutaneous WAT. The gene expression showed strong positive correlation with BMI (r = 0.38; P < 0.0001) (Fig. 2A). The expression was also positively correlated with adipose morphology (r = 0.25; P = 0.008) (Fig. 2B), meaning that hypertrophy (few large cells) was associated with high expression and the other way around for hyperplasia (many small cells). These correlations remained unchanged in multiple regression including age and array batch as independent factors. For 21 women, we had information about both LAMB3 genotype and adipose tissue LAMB3 expression levels. LAMB3 expression levels did not differ between genotypes (values not shown).
mRNA expression of LAMB3 in abdominal subcutaneous WAT of 114 Swedish women. Gene expression was compared with BMI (A) and adipose morphology (B) by linear regression analysis. For morphology, positive values indicate hypertrophy and negative hyperplasia.
mRNA expression of LAMB3 in abdominal subcutaneous WAT of 114 Swedish women. Gene expression was compared with BMI (A) and adipose morphology (B) by linear regression analysis. For morphology, positive values indicate hypertrophy and negative hyperplasia.
LAMB3 Inhibits Adipogenesis
To study if LAMB3 affected adipogenesis in vitro, its expression was monitored during differentiation of hMSCs. The expression was increased almost twofold (day 13 of differentiation compared with the beginning, day 0) (Fig. 3A). To evaluate the possible role of LAMB3 in the differentiation of adipocyte cells, the gene was knocked down in vitro using siRNA in the hMSCs 24 h before induction of differentiation (day −1). Expression levels of LAMB3 3 days after the transfection of siRNA (day 2 of differentiation) were decreased by ∼90%. The expression of LAMB3 gradually recovered during the differentiation, but was still inhibited by 60% at day 12 of differentiation (Fig. 3B). Knockdown of LAMB3 also significantly decreased expression of adipocyte-specific gene ADIPOQ at days 7 (P < 0.01) and 9 (P < 0.05) of differentiation (Fig. 3B). Differentiation grade was also negatively affected, as evidenced by 15% decreased accumulation of neutral lipids (Fig. 3C).
Reduced levels of LAMB3 inhibit adipogenesis. A: Expression levels of LAMB3 were determined using quantitative RT-PCR during differentiation of hMSCs to adipocytes in vitro. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. negative control of day 0. B: Expression of LAMB3 was knocked down using siRNA in hMSCs in vitro, followed by induction of differentiation, upon which the expression of LAMB3 and ADIPOQ were monitored. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. negative control (NegC) of each time point during differentiation. C: Expression of LAMB3 was knocked down using siRNA in hMSCs in vitro, and accumulation of neutral lipids, as well as number of cells, was evaluated. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. nontargeting siRNA pool (siNegC). *P < 0.05, **P < 0.01, ***P < 0.001.
Reduced levels of LAMB3 inhibit adipogenesis. A: Expression levels of LAMB3 were determined using quantitative RT-PCR during differentiation of hMSCs to adipocytes in vitro. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. negative control of day 0. B: Expression of LAMB3 was knocked down using siRNA in hMSCs in vitro, followed by induction of differentiation, upon which the expression of LAMB3 and ADIPOQ were monitored. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. negative control (NegC) of each time point during differentiation. C: Expression of LAMB3 was knocked down using siRNA in hMSCs in vitro, and accumulation of neutral lipids, as well as number of cells, was evaluated. Results were analyzed using the t test and are presented as relative fold change ± SEM vs. nontargeting siRNA pool (siNegC). *P < 0.05, **P < 0.01, ***P < 0.001.
Discussion
This study has identified a previously unreported low-frequency coding SNV, rs2076349 (V527M) in the LAMB3 gene, associated with morbid obesity in a Swedish population using WES followed by genotyping. LAMB3 mRNA in WAT correlated with BMI and adipose morphology, and reduction of LAMB3 expression by siRNA inhibited fat cell differentiation, pointing to a specific role of LAMB3 in adipose tissue.
The minor allele T of rs2076349 was low frequency (MAFEur 2%) in the European population but enriched in obese subjects, especially among those homozygous, when compared with control subjects (10 vs. 1). Our bioinformatics analysis predicted that the LAMB3 V527M could possibly be damaging, indicating that it could be a causal marker for obesity. It is located in a domain of the gene for which function is not defined in detail. To our knowledge, rs2076349 has not been reported as a cause of other diseases.
LAMB3 forms one subunit of the Laminin 5 (also called Laminin-332) heterotrimer, which is a component of the extracellular matrix and thought to mediate the organization of cells into tissues (18). Laminin 5 is typically found in the dermoepidermal junction of the skin (19). Mutations in each of the three subunits of Laminin 5 are present in patients with junctional epidermolysis bullosa, an autosomal-recessive disease (20,21). Moreover, Laminin 5 induces osteogenic gene expression and promotes osteogenic differentiation of MSCs (18,22). Laminin 5 is also important for β-cell function (23) and has a role in various types of cancer (24–26). Whether Laminin 5 or LAMB3 has functions in organs of importance for the control of obesity such as the brain or WAT has not been described before.
Adipocyte number and size are the major determinants for fat mass, and obese subjects have a higher absolute production of new fat cells and a higher number of fat cells in total (27). About 10% of adipocytes in adult humans are renewed every year, suggesting that adipogenesis might contribute quite substantially to body fat mass (13,27). In agreement with this, GWAS support a role for adipogenesis in obesity susceptibility (2). Our observation that WAT LAMB3 mRNA levels correlate with BMI and that knockdown of LAMB3 in vitro inhibits of adipogenesis give additional support to the notion that production of fat cells is important for various aspects of development of obesity
The hypothesis underlying our study is that more severe forms of obesity might be due to low-frequency gene variants with a larger impact on the phenotype. A minor portion of morbidly obese individuals has been shown to be due to genetic variants with high penetrance (7,28). The association of rs2076349 in LAMB3 with obesity provides another supporting evidence for the notion that low-frequency variants contribute to morbid obesity. One explanation why rs2076349 has not been reported before as a cause of obesity is that this SNV has not been assayed in commonly used genome-wide genotyping platforms. Another explanation is that most GWAS have focused on BMI or common obesity, rather than morbid obesity. However, a noncoding SNP, rs12130212, located distal to the 3′ end of LAMB3, has been reported to be associated with extreme obesity in a Danish cohort (29).
One important limitation of the current study is relatively low sequencing depth and thereby low power to detect rare (MAF <1%) genetic variants causing morbid obesity in the general population. In contrast, analysis of association between such rare SNVs and obesity requires a very large sample size to achieve significance, which is beyond reach using available cohorts of morbidly obese cases. Another limitation is that although the association between SNV rs2076349 and obesity was prominent, it did not reach the level of genome-wide statistical significance. However, it is likely that the present level of association, in combination with functional data for the gene, represents a biological relevant link between LAMB3 and obesity.
In conclusion, a low-frequency SNV rs2076349 (V527M) in the LAMB3 gene is strongly associated with morbid obesity. This gene may be involved in the development of excess body fat at least in part by controlling adipogenesis and the production of new fat cells.
Article Information
Acknowledgments. The authors thank the mutation analysis facility at the Karolinska Institutet and the SNP&SEQ Technology Platform at Uppsala University for performing genotyping and technical assistance. The authors also thank the Science for Life Laboratory, the National Genomics Infrastructure, and Uppmax for providing assistance in massive parallel sequencing and computational infrastructure; Anna Grauers (Clinical Science, Intervention and Technology, Karolinska Institutet) for scoliosis sample collection and characterization; Elisabeth Dungner (Lipid Laboratory, Department of Medicine, Huddinge, Karolinska Institutet) for DNA preparation; and Christel Björk (Lipid Laboratory, Department of Medicine, Huddinge, Karolinska Institutet) for the reverse-transfection protocol of hMSCs.
Funding. The project was supported by grants from Stockholm County (ALF), the Swedish Research Council, Novo Nordisk Foundation, the Diabetes Strategic Research Program at Karolinska Institutet, Center for Innovative Medicine, and the Erling-Persson Family Foundation.
Duality of Interest. No potential conflicts of interest relevant to this article were reported.
Author Contributions. H.J. provided study design, data analysis, and wrote the first version of the manuscript. A.K. provided research data. E.N. and A.T. performed sample collection. P.G. provided the scoliosis sample collection. J.K., P.A., and I.D. provided study design. All authors contributed to revision of the first version of the manuscript and approved the final version. I.D. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.