This function formats the LTR prediction data.frame
generated by LTRharvest
, LTRdigest
, or LTRpred
to a data.frame
in GFF3
file format.
pred2gff(LTR.data, output = "output.gff", program = "LTRpred")
LTR.data | the LTR prediction |
---|---|
output | filename of the output GFF file. |
program | program used to generate the prediction table, e.g. |
The GFF3 file format is defined by: chromosome; start; end; name; columns.
http://www.ensembl.org/info/website/upload/gff.html
Hajk-Georg Drost
gff.file <- system.file("TAIR10_chr_all_LTRdigestPrediction.gff", package = "LTRpred") tabout.file <- system.file("TAIR10_chr_all-ltrdigest_tabout.csv" ,package = "LTRpred") LTRfile <- read.prediction(gff.file,tabout.file, program = "LTRdigest") #> #> Input: /private/var/folders/yn/mgwl8_b56hz4v2c2vlfxj07w00076j/T/Rtmp9EQNCE/temp_libpath123013836f1/LTRpred/TAIR10_chr_all_LTRdigestPrediction.gff -> Row Number: 3323 #> Remove 'NA' -> New Row Number: 3323 #> (1/8) Filtering for repeat regions has been finished. #> (2/8) Filtering for LTR retrotransposons has been finished. #> (3/8) Filtering for inverted repeats has been finished. #> (4/8) Filtering for LTRs has been finished. #> (5/8) Filtering for target site duplication has been finished. #> (6/8) Filtering for primer binding site has been finished. #> (7/8) Filtering for protein match has been finished. #> (8/8) Filtering for RR tract has been finished. # generate GFF file pred2gff(LTRfile$ltr.retrotransposon, output = "test.gff")