Back to Multiple platform build/check report for BioC 3.21: simplified long |
|
This page was generated on 2024-11-22 11:34 -0500 (Fri, 22 Nov 2024).
Hostname | OS | Arch (*) | R version | Installed pkgs |
---|---|---|---|---|
nebbiolo1 | Linux (Ubuntu 24.04.1 LTS) | x86_64 | R Under development (unstable) (2024-10-21 r87258) -- "Unsuffered Consequences" | 4742 |
palomino7 | Windows Server 2022 Datacenter | x64 | R Under development (unstable) (2024-10-26 r87273 ucrt) -- "Unsuffered Consequences" | 4456 |
Click on any hostname to see more info about the system (e.g. compilers) (*) as reported by 'uname -p', except on Windows and Mac OS X |
Package 1369/2270 | Hostname | OS / Arch | INSTALL | BUILD | CHECK | BUILD BIN | ||||||||
MungeSumstats 1.15.2 (landing page) Alan Murphy
| nebbiolo1 | Linux (Ubuntu 24.04.1 LTS) / x86_64 | OK | OK | OK | |||||||||
palomino7 | Windows Server 2022 Datacenter / x64 | OK | OK | OK | OK | |||||||||
To the developers/maintainers of the MungeSumstats package: - Allow up to 24 hours (and sometimes 48 hours) for your latest push to git@git.bioconductor.org:packages/MungeSumstats.git to reflect on this report. See Troubleshooting Build Report for more information. - Use the following Renviron settings to reproduce errors and warnings. - If 'R CMD check' started to fail recently on the Linux builder(s) over a missing dependency, add the missing dependency to 'Suggests:' in your DESCRIPTION file. See Renviron.bioc for more information. |
Package: MungeSumstats |
Version: 1.15.2 |
Command: E:\biocbuild\bbs-3.21-bioc\R\bin\R.exe CMD check --no-multiarch --install=check:MungeSumstats.install-out.txt --library=E:\biocbuild\bbs-3.21-bioc\R\library --no-vignettes --timings MungeSumstats_1.15.2.tar.gz |
StartedAt: 2024-11-22 03:32:12 -0500 (Fri, 22 Nov 2024) |
EndedAt: 2024-11-22 03:50:44 -0500 (Fri, 22 Nov 2024) |
EllapsedTime: 1112.3 seconds |
RetCode: 0 |
Status: OK |
CheckDir: MungeSumstats.Rcheck |
Warnings: 0 |
############################################################################## ############################################################################## ### ### Running command: ### ### E:\biocbuild\bbs-3.21-bioc\R\bin\R.exe CMD check --no-multiarch --install=check:MungeSumstats.install-out.txt --library=E:\biocbuild\bbs-3.21-bioc\R\library --no-vignettes --timings MungeSumstats_1.15.2.tar.gz ### ############################################################################## ############################################################################## * using log directory 'E:/biocbuild/bbs-3.21-bioc/meat/MungeSumstats.Rcheck' * using R Under development (unstable) (2024-10-26 r87273 ucrt) * using platform: x86_64-w64-mingw32 * R was compiled by gcc.exe (GCC) 13.2.0 GNU Fortran (GCC) 13.2.0 * running under: Windows Server 2022 x64 (build 20348) * using session charset: UTF-8 * using option '--no-vignettes' * checking for file 'MungeSumstats/DESCRIPTION' ... OK * checking extension type ... Package * this is package 'MungeSumstats' version '1.15.2' * package encoding: UTF-8 * checking package namespace information ... OK * checking package dependencies ... OK * checking if this is a source package ... OK * checking if there is a namespace ... OK * checking for hidden files and directories ... NOTE Found the following hidden files and directories: .BBSoptions These were most likely included in error. See section 'Package structure' in the 'Writing R Extensions' manual. * checking for portable file names ... OK * checking whether package 'MungeSumstats' can be installed ... OK * checking installed package size ... OK * checking package directory ... OK * checking 'build' directory ... OK * checking DESCRIPTION meta-information ... OK * checking top-level files ... OK * checking for left-over files ... OK * checking index information ... OK * checking package subdirectories ... OK * checking code files for non-ASCII characters ... OK * checking R files for syntax errors ... OK * checking whether the package can be loaded ... OK * checking whether the package can be loaded with stated dependencies ... OK * checking whether the package can be unloaded cleanly ... OK * checking whether the namespace can be loaded with stated dependencies ... OK * checking whether the namespace can be unloaded cleanly ... OK * checking whether startup messages can be suppressed ... OK * checking dependencies in R code ... OK * checking S3 generic/method consistency ... OK * checking replacement functions ... OK * checking foreign function calls ... OK * checking R code for possible problems ... OK * checking Rd files ... NOTE checkRd: (-1) check_no_chr_bp.Rd:56-57: Lost braces 56 | \item \code{sumstats_dt}{ | ^ checkRd: (-1) check_no_chr_bp.Rd:58-59: Lost braces 58 | \item \code{rsids}{ | ^ checkRd: (-1) check_no_chr_bp.Rd:60-61: Lost braces 60 | \item \code{log_files}{ | ^ checkRd: (-1) check_on_ref_genome.Rd:65-66: Lost braces 65 | \item \code{sumstats_dt}{ | ^ checkRd: (-1) check_on_ref_genome.Rd:67-68: Lost braces 67 | \item \code{rsids}{ | ^ checkRd: (-1) check_on_ref_genome.Rd:69-70: Lost braces 69 | \item \code{log_files}{ | ^ checkRd: (-1) compute_nsize.Rd:32: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_nsize.Rd:33-36: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_nsize.Rd:37-38: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_nsize.Rd:39-40: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_nsize.Rd:41-42: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_nsize.Rd:43-44: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size.Rd:21-28: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size.Rd:30-34: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size.Rd:36-40: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size.Rd:42-46: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size.Rd:48-52: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_n.Rd:16-23: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_n.Rd:25-29: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_n.Rd:31-35: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_n.Rd:37-41: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_n.Rd:43-47: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_neff.Rd:21-28: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_neff.Rd:30-34: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_neff.Rd:36-40: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_neff.Rd:42-46: Lost braces in \itemize; meant \describe ? checkRd: (-1) compute_sample_size_neff.Rd:48-52: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_sumstats.Rd:29: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_sumstats.Rd:30: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_sumstats.Rd:31-32: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf.Rd:64: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf.Rd:65: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf.Rd:66-67: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf_parallel.Rd:40: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf_parallel.Rd:41: Lost braces in \itemize; meant \describe ? checkRd: (-1) read_vcf_parallel.Rd:42-43: Lost braces in \itemize; meant \describe ? checkRd: (-1) select_vcf_fields.Rd:27: Lost braces in \itemize; meant \describe ? checkRd: (-1) select_vcf_fields.Rd:28: Lost braces in \itemize; meant \describe ? checkRd: (-1) select_vcf_fields.Rd:29-30: Lost braces in \itemize; meant \describe ? checkRd: (-1) sort_coords.Rd:19-21: Lost braces in \itemize; meant \describe ? checkRd: (-1) sort_coords.Rd:22-24: Lost braces in \itemize; meant \describe ? * checking Rd metadata ... OK * checking Rd cross-references ... OK * checking for missing documentation entries ... OK * checking for code/documentation mismatches ... OK * checking Rd \usage sections ... OK * checking Rd contents ... OK * checking for unstated dependencies in examples ... OK * checking contents of 'data' directory ... OK * checking data for non-ASCII characters ... OK * checking data for ASCII and uncompressed saves ... OK * checking R/sysdata.rda ... OK * checking files in 'vignettes' ... OK * checking examples ... OK Examples with CPU (user + system) or elapsed time > 5s user system elapsed get_genome_builds 154.61 6.46 189.84 format_sumstats 140.68 5.22 153.64 liftover 4.13 0.05 6.03 * checking for unstated dependencies in 'tests' ... OK * checking tests ... Running 'testthat.R' OK * checking for unstated dependencies in vignettes ... OK * checking package vignettes ... OK * checking running R code from vignettes ... SKIPPED * checking re-building of vignette outputs ... SKIPPED * checking PDF version of manual ... OK * DONE Status: 2 NOTEs See 'E:/biocbuild/bbs-3.21-bioc/meat/MungeSumstats.Rcheck/00check.log' for details.
MungeSumstats.Rcheck/00install.out
############################################################################## ############################################################################## ### ### Running command: ### ### E:\biocbuild\bbs-3.21-bioc\R\bin\R.exe CMD INSTALL MungeSumstats ### ############################################################################## ############################################################################## * installing to library 'E:/biocbuild/bbs-3.21-bioc/R/library' * installing *source* package 'MungeSumstats' ... ** using staged installation ** R ** data ** inst ** byte-compile and prepare package for lazy loading ** help *** installing help indices ** building package indices ** installing vignettes ** testing if installed package can be loaded from temporary location ** testing if installed package can be loaded from final location ** testing if installed package keeps a record of temporary installation path * DONE (MungeSumstats)
MungeSumstats.Rcheck/tests/testthat.Rout
R Under development (unstable) (2024-10-26 r87273 ucrt) -- "Unsuffered Consequences" Copyright (C) 2024 The R Foundation for Statistical Computing Platform: x86_64-w64-mingw32/x64 R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type 'license()' or 'licence()' for distribution details. R is a collaborative project with many contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications. Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R. > library(testthat) > library(MungeSumstats) > > test_check("MungeSumstats") Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401071b71.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4039db4faf Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A0 A1 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A0 A1 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401071b71.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.09 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40c6f1fc6.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4039db4faf Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40c6f1fc6.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Sorting coordinates with "data.table". ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40750e7d11.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Found 1 Indels. These will be removed from the sumstats. WARNING If you want to keep Indels, set the drop_indel param to FALSE & rerun MungeSumstats::format_sumstats() Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/indel.tsv.gz Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Sorting coordinates with "data.table". Filtering SNPs based on INFO score. 46 SNPs are below the INFO threshold of 0.9 and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/info_filter.tsv.gz INFO_filter==0. Skipping INFO score filtering step. Filtering SNPs based on INFO score. All rows have INFO>=0.9 Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Sorting coordinates with "data.table". 3 p-values are >1 which LDSC/MAGMA may not be able to handle. These will be converted to 1. 5 p-values are <0 which LDSC/MAGMA may not be able to handle. These will be converted to 0. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Sorting coordinates with "data.table". 8 p-values are <=5e-324 which LDSC/MAGMA may not be able to handle. These will be converted to 0. Reading header. Tabular format detected. Reading header. Tabular format detected. Reading header. Tabular format detected. Reading header. VCF format detected.This will be converted to a standardised table format. Importing tabular file: E:/biocbuild/bbs-3.21-bioc/R/library/MungeSumstats/extdata/eduAttainOkbay.txt Checking for empty columns. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Computing Z-score from P using formula: `sign(BETA)*sqrt(stats::qchisq(P,1,lower=FALSE)` Standardising column headers. First line of summary statistics file: SNP CHR BP A1 A2 FRQ BETA SE P Z newZ Computing Z-score from BETA ans SE using formula: `BETA/SE` Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4054a79a0.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4029f34c76 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName EAF Beta SE Pval CHR_BP_A2_A1 Standardising column headers. First line of summary statistics file: MarkerName EAF Beta SE Pval CHR_BP_A2_A1 Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Summary statistics file does not have obvious CHR/BP columns. Checking to see if they are joined in another column. Column CHR_BP_A2_A1 has been separated into the columns CHR, BP, A2, A1 If this is the incorrect format for the column, update the column name to the correct format e.g.`CHR:BP:A2:A1` and format_sumstats(). Standardising column headers. First line of summary statistics file: SNP FRQ BETA SE P CHR BP A2 A1 Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4054a79a0.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.115 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406b285088.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4029f34c76 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406b285088.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.057 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4050865a04.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40623a26f2 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName EAF Beta SE Pval CHR_BP_A2_A1 Standardising column headers. First line of summary statistics file: MarkerName EAF Beta SE Pval CHR_BP_A2_A1 Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Summary statistics file does not have obvious CHR/BP columns. Checking to see if they are joined in another column. Column CHR_BP_A2_A1 has been separated into the columns CHR, BP, A2, A1 If this is the incorrect format for the column, update the column name to the correct format e.g.`CHR:BP:A2:A1` and format_sumstats(). Standardising column headers. First line of summary statistics file: SNP FRQ BETA SE P CHR BP A2 A1 Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4050865a04.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.113 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40748b5af2.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40623a26f2 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40748b5af2.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402eaf1a73.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404a555ab1 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS EAF Beta SE Pval alleles allele Standardising column headers. First line of summary statistics file: MarkerName CHR POS EAF Beta SE Pval alleles allele Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Warning: Multiple columns in the sumstats file seem to relate to alleles A1>A2. The column ALLELES will be kept whereas the column(s) ALLELE will be removed. If this is not the correct column to keep, please remove all incorrect columns from those listed here before running `format_sumstats()`. Column ALLELES has been separated into the columns A1, A2 Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402eaf1a73.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.055 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4057e06499.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404a555ab1 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4057e06499.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.057 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40285b4297.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403c84593 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName A1 A2 EAF Beta SE Pval CHR_BP Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName A1 A2 EAF Beta SE Pval CHR_BP Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Summary statistics file does not have obvious CHR/BP columns. Checking to see if they are joined in another column. Column CHR_BP has been separated into the columns CHR, BP Standardising column headers. First line of summary statistics file: SNP A1 A2 FRQ BETA SE P CHR BP Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40285b4297.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.111 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40427ae9d.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403c84593 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40427ae9d.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.062 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40757f4e8c.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40161a2e47 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName A1 A2 EAF Beta SE Pval CHR_BP CHR_BP_2 Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName A1 A2 EAF Beta SE Pval CHR_BP CHR_BP_2 Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Summary statistics file does not have obvious CHR/BP columns. Checking to see if they are joined in another column. Warning: Multiple columns in the sumstats file seem to relate to Chromosome:Base Pair position. The column CHR_BP_2 will be kept whereas the column(s) CHR_BP will be removed. If this is not the correct column to keep, please remove all incorrect columns from those listed here before running `format_sumstats()`. Column CHR_BP_2 has been separated into the columns CHR, BP Standardising column headers. First line of summary statistics file: SNP A1 A2 FRQ BETA SE P CHR BP Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40757f4e8c.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.12 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407b2e13a7.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40161a2e47 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407b2e13a7.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.061 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40687a643d.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4076d668c9 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40687a643d.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.063 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40649816ec.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407fb4724c Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40649816ec.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Setting sorted=FALSE (required when formatted=FALSE). Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402ad8fc.tsv.gz Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Assigning N=1000 for all SNPs. N already exists within sumstats_dt. [1] "Testing: compute_n='ldsc'" Computing effective sample size using the LDSC method: Neff = (N_CAS+N_CON) * (N_CAS/(N_CAS+N_CON)) / mean((N_CAS/(N_CAS+N_CON))[(N_CAS+N_CON)==max(N_CAS+N_CON)])) [1] "Testing: compute_n='giant'" Computing effective sample size using the GIANT method: Neff = 2 / (1/N_CAS + 1/N_CON) [1] "Testing: compute_n='metal'" Computing effective sample size using the METAL method: Neff = 4 / (1/N_CAS + 1/N_CON) [1] "Testing: compute_n='sum'" Computing sample size using the sum method: N = N_CAS + N_CON Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402e475e35.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de405a15633 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402e475e35.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.055 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406b1b3d5d.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Saving output messages to: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/file1de406b1b3d5d_log_msg.txt Any runtime errors will be saved to: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/file1de406b1b3d5d_log_output.txt Messages will not be printed to terminal. Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40bcb7b21.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401f511176 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40bcb7b21.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.06 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4072b35a8e.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407162111d Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 186 rows - 93 unique variants - 140 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. 93 sumstat rows are duplicated. These duplicates will be removed. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4072b35a8e.tsv.gz Summary statistics report: - 93 rows (50% of original 186 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.06 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40446c6980.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407162111d Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40446c6980.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.08 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404dfb97.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407162111d Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 94 rows - 94 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicate SNPs from SNP ID. Checking for SNPs with duplicated base-pair positions. 1 base-pair positions are duplicated in the sumstats file. These duplicates will be removed. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. Checking for bi-allelic SNPs. Loading SNPlocs data. Loading reference genome data. Preprocessing RSIDs. Validating RSIDs of 93 SNPs using BSgenome::snpsById... BSgenome::snpsById done in 89 seconds. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404dfb97.tsv.gz Summary statistics report: - 93 rows (98.9% of original 94 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 1.559 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403d0e2709.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4048f64896 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Filtering effect columns, ensuring none equal 0. 5 SNPs have effect values = 0 and will be removed Ensuring all SNPs have N<5 std dev above mean. 44 SNPs (50%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403d0e2709.tsv.gz Summary statistics report: - 88 rows (94.6% of original 93 rows) - 88 unique variants - 65 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4071103035.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4036437dc2 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval FRQ Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval FRQ Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs based on FRQ. 38 SNPs are below the FRQ threshold of 0.9 and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/frq_filter.tsv.gz Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 55 SNPs (100%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4071103035.tsv.gz Summary statistics report: - 55 rows (59.1% of original 93 rows) - 55 unique variants - 41 genome-wide significant variants (P<5e-8) - 16 chromosomes Done munging in 0.061 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 EAF BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 2: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3: rs1008078 1 91189731 T C 0.37310 -0.016 0.003 6.005e-10 4: rs61787263 1 98618714 T C 0.76120 0.016 0.003 5.391e-08 FRQ <num> 1: 1.863269 2: 1.169733 3: 1.401423 4: 1.873332 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404a475e02.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4036437dc2 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval FRQ Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval FRQ Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs based on FRQ. 38 SNPs are below the FRQ threshold of 0.9 and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/frq_filter.tsv.gz Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 55 SNPs (100%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=FALSE, the FRQ column will be renamed MAJOR_ALLELE_FRQ to differentiate the values from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404a475e02.tsv.gz Summary statistics report: - 55 rows (59.1% of original 93 rows) - 55 unique variants - 41 genome-wide significant variants (P<5e-8) - 16 chromosomes Done munging in 0.056 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 EAF BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 2: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3: rs1008078 1 91189731 T C 0.37310 -0.016 0.003 6.005e-10 4: rs61787263 1 98618714 T C 0.76120 0.016 0.003 5.391e-08 MAJOR_ALLELE_FRQ <num> 1: 1.863269 2: 1.169733 3: 1.401423 4: 1.873332 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4041346c82.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4014b2238f Checking for empty columns. Infer Effect Column First line of summary statistics file: SNP CHR BP A1 A2 FRQ BETA SE P Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: SNP CHR BP A1 A2 FRQ BETA SE P Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4041346c82.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.059 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40fba26f2.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4069c6451b Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval INFO Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval INFO Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. Filtering SNPs based on INFO score. 38 SNPs are below the INFO threshold of 0.9 and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/info_filter.tsv.gz Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 28 SNPs (50.9%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40fba26f2.tsv.gz Summary statistics report: - 55 rows (59.1% of original 93 rows) - 55 unique variants - 41 genome-wide significant variants (P<5e-8) - 16 chromosomes Done munging in 0.065 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 2: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3: rs1008078 1 91189731 T C 0.37310 -0.016 0.003 6.005e-10 4: rs61787263 1 98618714 T C 0.76120 0.016 0.003 5.391e-08 INFO <num> 1: 1.863269 2: 1.169733 3: 1.401423 4: 1.873332 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4027ff603e.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40306f733 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4027ff603e.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.063 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4030343b47.tsv.gz Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4030343b47.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.07 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file1\\file1de4061a310a.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file2\\file1de407a6c7a4.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file3\\file1de40c151e1.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file4\\file1de4040d265f3.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file5\\file1de4047e3887.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file6\\file1de4043ef47ad.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file7\\file1de4050bc1626.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file8\\file1de40bd761ea.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file9\\file1de405f6c382a.tsv.gz" [1] "E:\\biocbuild\\bbs-3.21-bioc\\tmpdir\\RtmpEnXngU/data/file10\\file1de408d35a1c.tsv.gz" 10 file(s) found. Parsing info from 10 log file(s). Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4025232d77.tsv.gz sumstats has -log10 P-values; these will be converted to unadjusted p-values in the 'P' column. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval_org LP P Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval_org LP P Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4025232d77.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.063 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE PVAL_ORG <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 LP P <num> <num> 1: 7.746178 1.794e-08 2: 9.627272 2.359e-10 3: 13.424581 3.762e-14 4: 7.745452 1.797e-08 Returning data directly. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401335270a.tsv.gz sumstats has -log10 P-values; these will be converted to unadjusted p-values in the 'P' column. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval_org LP P Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval_org LP P Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401335270a.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.065 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE PVAL_ORG <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 LP P <num> <num> 1: -7.746178 1.794e-08 2: -9.627272 2.359e-10 3: -13.424581 3.762e-14 4: -7.745452 1.797e-08 Returning data directly. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de405a35618c.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4033173163 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 92 unique variants - 69 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. WARNING: 1 rows in sumstats file are missing data and will be removed. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 46 SNPs (50%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de405a35618c.tsv.gz Summary statistics report: - 92 rows (98.9% of original 93 rows) - 92 unique variants - 69 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.065 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs10061788 5 87934707 A G 0.2164 0.021 0.004 2.464e-09 2: rs1007883 16 51163406 T C 0.3713 -0.015 0.003 5.326e-08 3: rs1008078 1 91189731 T C 0.3731 -0.016 0.003 6.005e-10 4: rs1043209 14 23373986 A G 0.6026 0.018 0.003 1.816e-11 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4049fc77ea.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4033173163 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4049fc77ea.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.062 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs10061788 5 87934707 A G 0.2164 0.021 0.004 2.464e-09 2: rs1007883 16 51163406 T C 0.3713 -0.015 0.003 5.326e-08 3: rs1008078 1 91189731 T C 0.3731 -0.016 0.003 6.005e-10 4: rs1043209 14 23373986 A G 0.6026 0.018 0.003 1.816e-11 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401e9397a.tsv.gz Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 21 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Loading SNPlocs data. There is no Chromosome or Base Pair Position column found within the data. It must be inferred from other column information. Loading reference genome data. Preprocessing RSIDs. Validating RSIDs of 1 SNPs using BSgenome::snpsById... BSgenome::snpsById done in 2 seconds. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401e9397a.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.122 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs10061788 5 87934707 A G 0.2164 0.021 0.004 2.464e-09 2: rs1007883 16 51163406 T C 0.3713 -0.015 0.003 5.326e-08 3: rs1008078 1 91189731 T C 0.3731 -0.016 0.003 6.005e-10 4: rs1043209 14 23373986 A G 0.6026 0.018 0.003 1.816e-11 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40161157.tsv.gz Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval extra Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval extra Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 21 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Loading SNPlocs data. There is no Chromosome or Base Pair Position column found within the data. It must be inferred from other column information. Loading reference genome data. Preprocessing RSIDs. Validating RSIDs of 1 SNPs using BSgenome::snpsById... BSgenome::snpsById done in 2 seconds. Checking for missing data. WARNING: 93 rows in sumstats file are missing data and will be removed. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4016cf39e9.tsv.gz Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval extra Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval extra Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 21 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Loading SNPlocs data. There is no Chromosome or Base Pair Position column found within the data. It must be inferred from other column information. Loading reference genome data. Preprocessing RSIDs. Validating RSIDs of 1 SNPs using BSgenome::snpsById... BSgenome::snpsById done in 2 seconds. Checking for missing data. WARNING: None of the inputted columns: CHRA APOS To be checked for missing data were found in the sumstats. Sumstats columns: SNP CHR BP A1 A2 FRQ BETA SE P EXTRA This check will not be run. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4016cf39e9.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.115 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P EXTRA <char> <int> <int> <char> <char> <num> <num> <num> <num> <lgcl> 1: rs10061788 5 87934707 A G 0.2164 0.021 0.004 2.464e-09 NA 2: rs1007883 16 51163406 T C 0.3713 -0.015 0.003 5.326e-08 NA 3: rs1008078 1 91189731 T C 0.3731 -0.016 0.003 6.005e-10 NA 4: rs1043209 14 23373986 A G 0.6026 0.018 0.003 1.816e-11 NA Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4025b11d87.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4054b557f1 Checking for empty columns. Infer Effect Column First line of summary statistics file: chromosome rs_id markername position_hg18 Effect_allele Other_allele EAF_HapMapCEU N_SMK Effect_SMK StdErr_SMK P_value_SMK N_NONSMK Effect_NonSMK StdErr_NonSMK P_value_NonSMK Standardising column headers. First line of summary statistics file: chromosome rs_id markername position_hg18 Effect_allele Other_allele EAF_HapMapCEU N_SMK Effect_SMK StdErr_SMK P_value_SMK N_NONSMK Effect_NonSMK StdErr_NonSMK P_value_NonSMK Summary statistics report: - 5 rows - 5 unique variants - 1 chromosomes Checking for multi-GWAS. WARNING: Multiple traits found in sumstats file only one of which can be analysed: SMK, NONSMK Standardising column headers. First line of summary statistics file: CHR SNP MARKERNAME POSITION_HG18 A2 A1 EAF_HAPMAPCEU N EFFECT STDERR P_VALUE N_NONSMK EFFECT_NONSMK STDERR_NONSMK P_VALUE_NONSMK Checking for multiple RSIDs on one row. Checking SNP RSIDs. 1 SNP IDs are not correctly formatted and will be removed. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Summary statistics file does not have obvious CHR/BP columns. Checking to see if they are joined in another column. Column MARKERNAME has been separated into the columns CHR, BP Standardising column headers. First line of summary statistics file: CHR SNP POSITION_HG18 A2 A1 EAF_HAPMAPCEU N BETA SE P N_NONSMK EFFECT_NONSMK STDERR_NONSMK P_VALUE_NONSMK BP Checking for incorrect base-pair positions Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Ensuring that the N column is all integers. The sumstats N column is not all integers, this could effect downstream analysis. These will be converted to integers. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4025b11d87.tsv.gz Summary statistics report: - 4 rows (80% of original 5 rows) - 4 unique variants - 0 genome-wide significant variants (P<5e-8) - 1 chromosomes Done munging in 0.184 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 POSITION_HG18 EAF_HAPMAPCEU N <char> <char> <int> <char> <char> <int> <num> <int> 1: rs1000050 chr1 161003087 C T 161003087 0.9000 36257 2: rs1000073 chr1 155522020 G A 155522020 0.3136 36335 3: rs1000075 chr1 94939420 C T 94939420 0.3583 38959 4: rs1000085 chr1 66630503 G C 66630503 0.1667 38761 BETA SE P N_NONSMK EFFECT_NONSMK STDERR_NONSMK P_VALUE_NONSMK <num> <num> <num> <int> <num> <num> <num> 1: 0.0001 0.0109 0.9931 127514 0.0058 0.0059 0.3307 2: 0.0046 0.0083 0.5812 126780 0.0038 0.0045 0.3979 3: -0.0013 0.0082 0.8687 147567 -0.0043 0.0044 0.3259 4: 0.0053 0.0095 0.5746 147259 -0.0034 0.0052 0.5157 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402525436.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407b0853ed Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N N_fixed Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N N_fixed Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Ensuring that the N column is all integers. The sumstats N column is not all integers, this could effect downstream analysis. These will be converted to integers. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402525436.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.057 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P N <char> <int> <int> <char> <char> <num> <num> <num> <num> <int> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 5 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 1 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 1 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 7 N_FIXED <int> 1: 5 2: 1 3: 1 4: 7 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4054c93eb6.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406bdd531 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. The sumstats N column is not all integers, this could effect downstream analysis.These will NOT be converted to integers. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 1 SNPs have N values 5 standard deviations above the mean and will be removed Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/n_large.tsv.gz 47 SNPs (51.1%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4054c93eb6.tsv.gz Summary statistics report: - 92 rows (98.9% of original 93 rows) - 92 unique variants - 69 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.059 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P N <char> <int> <int> <char> <char> <num> <num> <num> <num> <int> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 3 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 5 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 3 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403ae04bb8.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406bdd531 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. The sumstats N column is not all integers, this could effect downstream analysis.These will NOT be converted to integers. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 1 SNPs have N values 5 standard deviations above the mean and will be removed Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/n_large.tsv.gz 47 SNPs (51.1%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403ae04bb8.tsv.gz Summary statistics report: - 92 rows (98.9% of original 93 rows) - 92 unique variants - 69 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.061 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P N <char> <int> <int> <char> <char> <num> <num> <num> <num> <int> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 3 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 5 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 3 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4062557530.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406bdd531 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval N Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. The sumstats N column is not all integers, this could effect downstream analysis.These will NOT be converted to integers. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 1 SNPs have N values 5 standard deviations above the mean and will be removed Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/n_large.tsv.gz Removing rows where is.na(N) 0 SNPs have N values that are NA and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/n_null.tsv.gz 47 SNPs (51.1%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4062557530.tsv.gz Summary statistics report: - 92 rows (98.9% of original 93 rows) - 92 unique variants - 69 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.066 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P N <char> <int> <int> <char> <char> <num> <num> <num> <num> <int> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 3 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 5 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 3 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 3 Returning path to saved data. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4043847c92.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404de5572 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 23 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions 1 SNPs have been removed as their BP column is not in the range of 1 to the length of the chromosome Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/bad_bp.tsv.gz Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 2 SNPs are on chromosomes X, Y, MT and will be removed. Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/chr_excl.tsv.gz 45 SNPs (50%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4043847c92.tsv.gz Summary statistics report: - 90 rows (96.8% of original 93 rows) - 90 unique variants - 67 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.058 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407fd06ce0.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404de5572 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407fd06ce0.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.054 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403532325b Checking for empty columns. Standardising column headers. First line of summary statistics file: SNP CHR BP A1 A2 FRQ BETA SE P Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de405eaf3 Checking for empty columns. Standardising column headers. First line of summary statistics file: SNP CHR BP A1 A2 FRQ BETA SE P Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407d7c1085.vcf.bgz Sorting coordinates with "data.table". Converting summary statistics to GenomicRanges. Converting summary statistics to VRanges. Writing in VCF format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407d7c1085.vcf.bgz Loading required namespace: GenomicFiles Using local VCF. Finding empty VCF columns based on first 10,000 rows. 1 sample detected: GWAS Constructing ScanVcfParam object. Reading VCF file: single-threaded Converting VCF to data.table. Expanding VCF first, so number of rows may increase. Checking for empty columns. Time difference of 0.1 secs VCF data.table contains: 93 rows x 11 columns. Time difference of 0.7 secs No INFO (SI) column detected. Standardising column headers. First line of summary statistics file: ID chr BP end REF ALT SNP FRQ BETA SE P Using local VCF. bgzip-compressing VCF file. Finding empty VCF columns based on first 10,000 rows. Dropping 1 duplicate column(s). 1 sample detected: EBI-a-GCST005647 Constructing ScanVcfParam object. VCF contains: 39,630,630 variant(s) x 1 sample(s) Reading VCF file: single-threaded Converting VCF to data.table. Expanding VCF first, so number of rows may increase. Dropping 1 duplicate column(s). Checking for empty columns. Unlisting 3 columns. Dropped 314 duplicate rows. Time difference of 0.1 secs VCF data.table contains: 101 rows x 11 columns. Time difference of 0.7 secs Renaming ID as SNP. sumstats has -log10 P-values; these will be converted to unadjusted p-values in the 'P' column. No INFO (SI) column detected. Standardising column headers. First line of summary statistics file: SNP chr BP end REF ALT FILTER AF ES LP SE P Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402689516b.vcf.bgz Sorting coordinates with "data.table". Converting summary statistics to GenomicRanges. Converting summary statistics to VRanges. Writing in VCF format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402689516b.vcf.bgz Using local VCF. Finding empty VCF columns based on first 10,000 rows. 1 sample detected: GWAS Constructing ScanVcfParam object. Reading VCF file: single-threaded Converting VCF to data.table. Expanding VCF first, so number of rows may increase. Checking for empty columns. Time difference of 0.1 secs VCF data.table contains: 101 rows x 13 columns. Time difference of 0.4 secs sumstats has -log10 P-values; these will be converted to unadjusted p-values in the 'P' column. No INFO (SI) column detected. Standardising column headers. First line of summary statistics file: ID chr BP end REF SNP END FILTER FRQ BETA LP SE P ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404fb13a35.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Infer Effect Column First line of summary statistics file: SNP P FRQ BETA CHR BP Standardising column headers. First line of summary statistics file: SNP P FRQ BETA CHR BP Summary statistics report: - 5 rows - 5 unique variants - 0 genome-wide significant variants (P<5e-8) - 1 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. 5 SNP IDs contain other information in the same column. These will be separated. Checking for merged allele column. Column SNP_INFO has been separated into the columns A1, A2 Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Coercing BP column to numeric. Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. SE is not present but can be imputed with BETA & P. Set impute_se=TRUE and rerun to do this. Ensuring all SNPs have N<5 std dev above mean. 3 SNPs (60%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404fb13a35.tsv.gz Summary statistics report: - 5 rows (100% of original 5 rows) - 5 unique variants - 0 genome-wide significant variants (P<5e-8) - 1 chromosomes Done munging in 0.049 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 P FRQ BETA <char> <int> <int> <char> <char> <num> <num> <num> 1: rs140052487 1 54353 C A 0.037219838 0.3000548 0.8797957 2: rs558796213 1 54564 G T 0.004382482 0.5848666 0.7068747 3: rs561234294 1 54591 A G 0.070968402 0.3334671 0.7319726 4: rs2462492 1 54676 C T 0.065769040 0.6220120 0.9316344 Returning data directly. ******::NOTE::****** - Log results will be saved to `tempdir()` by default. - This means all log data from the run will be deleted upon ending the R session. - To keep it, change `log_folder` to an actual directory (e.g. log_folder='./'). ******************** Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406ab87992.tsv.gz Log data to be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU Infer Effect Column First line of summary statistics file: SNP P FRQ BETA CHR BP A1 A2 Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: SNP P FRQ BETA CHR BP A1 A2 Summary statistics report: - 5 rows - 5 unique variants - 0 genome-wide significant variants (P<5e-8) - 1 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Coercing BP column to numeric. Reordering so first three column headers are SNP, CHR and BP in this order. Reordering so the fourth and fifth columns are A1 and A2. Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. SE is not present but can be imputed with BETA & P. Set impute_se=TRUE and rerun to do this. Ensuring all SNPs have N<5 std dev above mean. 3 SNPs (60%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406ab87992.tsv.gz Summary statistics report: - 5 rows (100% of original 5 rows) - 5 unique variants - 0 genome-wide significant variants (P<5e-8) - 1 chromosomes Done munging in 0.056 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 P FRQ BETA <char> <int> <int> <char> <char> <num> <num> <num> 1: rs140052487 1 54353 C A 0.037219838 0.3000548 0.8797957 2: rs558796213 1 54564 G T 0.004382482 0.5848666 0.7068747 3: rs561234294 1 54591 A G 0.070968402 0.3334671 0.7319726 4: rs2462492 1 54676 C T 0.065769040 0.6220120 0.9316344 Returning data directly. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40580e7ebe.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de404d49f4 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40486e3787.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403d206c77 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40486e3787.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.052 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407b3e434a.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403d206c77 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407b3e434a.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.055 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40631b2f7c.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de405ca16f08 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40631b2f7c.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.058 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4034c3369f.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de403aa36294 Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. 47 SNPs (50.5%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4034c3369f.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402eb73d5c.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40394c280e Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. 5 SNPs have SE values <= 0 and will be removed Ensuring all SNPs have N<5 std dev above mean. 44 SNPs (50%) have FRQ values > 0.5. Conventionally the FRQ column is intended to show the minor/effect allele frequency. The FRQ column was mapped from one of the following from the inputted summary statistics file: FRQ, EAF, FREQUENCY, FRQ_U, F_U, MAF, FREQ, FREQ_TESTED_ALLELE, FRQ_TESTED_ALLELE, FREQ_EFFECT_ALLELE, FRQ_EFFECT_ALLELE, EFFECT_ALLELE_FREQUENCY, EFFECT_ALLELE_FREQ, EFFECT_ALLELE_FRQ, A2FREQ, A2FRQ, ALLELE_FREQUENCY, ALLELE_FREQ, ALLELE_FRQ, AF, MINOR_AF, EFFECT_AF, A2_AF, EFF_AF, ALT_AF, ALTERNATIVE_AF, INC_AF, A_2_AF, TESTED_AF, ALLELEFREQ, ALT_FREQ, EAF_HRC, EFFECTALLELEFREQ, FREQ.B, FREQ_EUROPEAN_1000GENOMES, FREQ_HAPMAP, FREQ_TESTED_ALLELE_IN_HRS, FRQ_U_113154, FRQ_U_31358, FRQ_U_344901, FRQ_U_43456, POOLED_ALT_AF, AF_ALT, AF.ALT, AF-ALT, ALT.AF, ALT-AF, A2.AF, A2-AF, AF.EFF, AF_EFF, ALL_AF As frq_is_maf=TRUE, the FRQ column will not be renamed. If the FRQ values were intended to represent major allele frequency, set frq_is_maf=FALSE to rename the column as MAJOR_ALLELE_FRQ and differentiate it from minor/effect allele frequency. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de402eb73d5c.tsv.gz Summary statistics report: - 88 rows (94.6% of original 93 rows) - 88 unique variants - 65 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.064 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 FRQ BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning path to saved data. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Support Returning unmapped column names without making them uppercase. Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Support Returning unmapped column names without making them uppercase. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4023332e2c.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4070e3530.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407f094588 Checking for empty columns. Non-standard mapping file detected.Making sure all entries in `Uncorrected` are in upper case. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Pval Summary statistics report: - 93 rows - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions Checking for missing data. Checking for duplicate columns. Checking for duplicated rows. INFO column not available. Skipping INFO score filtering step. Filtering SNPs, ensuring SE>0. Ensuring all SNPs have N<5 std dev above mean. Sorting coordinates with "data.table". Writing in tabular format ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4070e3530.tsv.gz Summary statistics report: - 93 rows (100% of original 93 rows) - 93 unique variants - 70 genome-wide significant variants (P<5e-8) - 20 chromosomes Done munging in 0.061 minutes. Successfully finished preparing sumstats file, preview: Reading header. SNP CHR BP A1 A2 EAF BETA SE P <char> <int> <int> <char> <char> <num> <num> <num> <num> 1: rs301800 1 8490603 T C 0.17910 0.019 0.003 1.794e-08 2: rs11210860 1 43982527 A G 0.36940 0.017 0.003 2.359e-10 3: rs34305371 1 72733610 A G 0.08769 0.035 0.005 3.762e-14 4: rs2568955 1 72762169 T C 0.23690 -0.017 0.003 1.797e-08 Returning data directly. Converting summary statistics to GenomicRanges. Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4063134b6b.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4025666e5b.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401ef97a19.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de40338031e9.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4056046e20.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4031323bbf.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407f0231e8.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4072d47422.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4071344f7d.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de406ad71215.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de401ca47790.tsv.gz Formatted summary statistics will be saved to ==> E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de4055242fde.tsv.gz Reading header. Tabular format detected. Importing tabular file: E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU\file1de407a5942fe Checking for empty columns. Infer Effect Column First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Allele columns are ambiguous, attempting to infer direction Can't infer allele columns from sumstats Standardising column headers. First line of summary statistics file: MarkerName CHR POS A1 A2 EAF Beta SE Summary statistics report: - 93 rows - 93 unique variants - 20 chromosomes Checking for multi-GWAS. Checking for multiple RSIDs on one row. Checking SNP RSIDs. Checking for merged allele column. Checking A1 is uppercase Checking A2 is uppercase Checking for incorrect base-pair positions [ FAIL 0 | WARN 2 | SKIP 1 | PASS 121 ] ══ Skipped tests (1) ═══════════════════════════════════════════════════════════ • empty test (1): 'test-write_sumstats.R:1:1' [ FAIL 0 | WARN 2 | SKIP 1 | PASS 121 ] Warning message: In if (!all(names(sys.call()) %in% c("", "drop")) && !isS4(x)) warning("named arguments other than 'drop' are discouraged") : closing unused connection 4 (E:\biocbuild\bbs-3.21-bioc\tmpdir\RtmpEnXngU/file1de406b1b3d5d_log_msg.txt) > > proc.time() user system elapsed 375.54 8.87 390.10
MungeSumstats.Rcheck/MungeSumstats-Ex.timings
name | user | system | elapsed | |
compute_nsize | 4.22 | 0.17 | 4.39 | |
download_vcf | 0 | 0 | 0 | |
find_sumstats | 0 | 0 | 0 | |
format_sumstats | 140.68 | 5.22 | 153.64 | |
formatted_example | 2.84 | 0.26 | 3.11 | |
get_genome_builds | 154.61 | 6.46 | 189.84 | |
import_sumstats | 0 | 0 | 0 | |
index_tabular | 3.64 | 0.44 | 4.12 | |
index_vcf | 3.25 | 0.03 | 3.29 | |
infer_effect_column | 4.59 | 0.29 | 4.89 | |
liftover | 4.13 | 0.05 | 6.03 | |
list_sumstats | 0.01 | 0.00 | 0.01 | |
load_snp_loc_data | 0 | 0 | 0 | |
parse_logs | 0 | 0 | 0 | |
read_header | 0.00 | 0.02 | 0.03 | |
read_sumstats | 0.02 | 0.00 | 0.02 | |
read_vcf | 2.02 | 0.04 | 2.08 | |
standardise_header | 3.75 | 0.02 | 3.76 | |
vcf2df | 0.79 | 0.03 | 0.83 | |
write_sumstats | 0.02 | 0.00 | 0.02 | |