snipar.scripts.ibd module

Infers identity-by-descent (IBD) segments shared between full-siblings.

Minimally: the script requires observed sibling genotypes in either .bed or .bgen format, along with information on the relations present in the dataset, which can be provided using a pedigree file or the results of KING kinship inference along with age and sex information (from which a pedigree can be constructed).

Args:

‘-h’, ‘–help’, default===SUPPRESS==: show this help message and exit
‘–bgen’str: Address of the phased genotypes in .bgen format. If there is a @ in the address, @ is replaced by the chromosome numbers in the range of chr_range for each chromosome (chr_range is an optional parameters for this script).
‘–bed’str: Address of the unphased genotypes in .bed format. If there is a @ in the address, @ is replaced by the chromosome numbers in the range of chr_range for each chromosome (chr_range is an optional parameters for this script).
‘–chr_range’: number of the chromosomes to be imputed. Should be a series of ranges with x-y format or integers.
‘–king’str: Address of the king file
‘–agesex’str: Address of file with age and sex information
‘–pedigree’str: Address of pedigree file
‘–map’str: None
‘–out’str, default=ibd: The IBD segments will output to this path, one file for each chromosome. If the path contains ‘#’, the ‘#’ will be replaced with the chromosome number. Otherwise, the segments will be output to the given path with file names chr_1.ibd.segments.gz, chr_2.segments.gz, etc.
‘–p_error’float: Probability of genotyping error. By default, this is estimated from genotyped parent-offspring pairs.
‘–min_length’float, default=0.01: Smooth segments with length less than min_length (cM)
‘–threads’int: Number of threads to use for IBD inference. Uses all available by default.
‘–min_maf’float, default=0.01: Minimum minor allele frequency
‘–max_missing’float, default=5: Ignore SNPs with greater percent missing calls than max_missing (default 5)
‘–max_error’float, default=0.01: Maximum per-SNP genotyping error probability
‘–ibdmatrix’: Output a matrix of SNP IBD states (in addition to segments file)
‘–ld_out’: Output LD scores of SNPs (used internally for weighting).
‘–chrom’int: The chromosome of the input .bgen file. Helpful if inputting a single .bgen file without chromosome information.
‘–batches’int, default=1: Number of batches to split the data (by sibpair) into for IBD inference. Useful for large datasets.

Results:

IBD segments: For each chromosome, a gzipped text file containing the IBD segments for the siblings is output.

snipar.scripts.ibd.main(args)[source]

“Calling this function with args is equivalent to running this script from commandline with the same arguments. Args:

args: list
list of all the desired options and arguments. The possible values are all the values you can pass this script from commandline.