-
Notifications
You must be signed in to change notification settings - Fork 1
Vizualizer
The package is designed for vizualizing the data. This is documentation for vizualizer package. Functions for general use:
Functions for advanced use:
Detailed description of functions are given below.
Plots ENc curve from given fasta file
-
handle: Handle to the file, or the filename as a string -
genetic_table_num: Genetic table number for codon table -
min_len_threshold: Minimum length of nucleotide sequence to be considered as gene -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)
In the plot, the super title is name of the organism or only name if organism_name is not given. The title of the
curve is number of genes or genome. On the right side, a color bar is provided for better understanding.
See example.
Plots A3/AT3 values against G3/GC3 values from given fasta file
-
handle: Handle to the file, or the filename as a string -
min_len_threshold: Minimum length of nucleotide sequence to be considered as gene -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)
In the plot, the super title is name of the organism or only name if organism_name is not given. The title of the
curve is number of genes or genome. Each quadrant has the count for the occurrence of purines and pyrimidines. Again the
counts can be seen in the lower left side. See example.
Plots neutrality plot from given fasta file
-
handle: Handle to the file, or the filename as a string -
min_len_threshold: Minimum length of nucleotide sequence to be considered as gene -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)
In the plot, the super title is name of the organism or only name if organism_name is not given. The title of the
curve is number of genes or genome and the R^2 value. On the right side, a color bar is provided for better
understanding. See example.
Calculates Theoretical ENC value based on Wright (1989) (click here).
-
x: GC3 value
Returns the ENc value.
Plots ENc value against GC3 values
-
enc_val_lst: Values of ENc (in range 0 to 1) -
gc_val_lst: Values of GC3 (in range 0 to 1) -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)
Plots A3/AT3 values against G3/GC3 values
-
gc_val_lst: Values of GC3 (in range 0 to 1) -
at_val_lst: Values of AT3 (in range 0 to 1) -
g3_val_lst: Values of G3 (in range 0 to 1) -
a3_val_lst: Values of A3 (in range 0 to 1) -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)
Plots the neutrality plot
-
gc12_lst: The list containing values of G or C at 1 or 2 positions -
gc3_lst: The list containing values of G or C ar 3 positions -
organism_name: Name of organism (optional) -
save_image: Options for saving the image (optional) -
folder_path: Folder path where image should be saved (optional) -
gene_analysis: Option if gene analysis (True) or genome analysis (False) (optional)




The color-bar depends on the span of GC3 or GC12. As can be seen below, the color-bar for agnetis is based on the span of GC3 whereas for argenteus it is based on span of GC12.



© 2023 Souradipto Choudhuri
© 2023 Souradipto Choudhuri