Skip to content

Using population-level sequencing data in SLiM for neutral evolution simulations #606

@B-Demirbas

Description

@B-Demirbas

Hi! Following a question I posted on slim-discuss (which was answered), Ben Haller suggested I raise an issue on the GitHub on the specific problem I'm facing.

My initial goal for a project is to model neutral evolution in a population of ~500 individuals over N generations using standard Wright-Fisher simulations. The project is supposed to complement experimental work, and hence I want to use real sequencing data in the form of .vcf files containing information on SNPs.

The problem is that I only have population sequence data (for a population of ~500 individuals). It does contain information on average allele frequencies, but there is no individual-level data. As far as I can tell from the SLiM manual there is no straightforward way to load in and use population-level .vcf files directly in SLiM (it seems it always assumes it is individual-level data), is this correct?

As a workaround I now use the allelic depth data to estimate the frequency of individuals with a given SNP in a population. I use that to generate artificial individuals in a new .vcf (in Python) which I then load into SLiM. This seems to work, but it would be more convenient if there was a way to do this directly in SLiM. I attached an (abridged) example of a .vcf file that I would like to use.

example VCF population single chromosome.txt

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions