BPP is a method that can identify branch point based on only the intron sequence.
BPP predicts the branch point sequence by integrating the degenerative motif of BPS and PPT characteristics. Specifically, BPP uses a mixture model to infer the BPS motif and a set of weighted octanucleotides to estimate the contribution of the 65kDa subunit of U2AF (U2AF65).
A paper describing BPP is under review.
- Python3.5
Usage: BP_PPT.py -b -p -i -r -h
-b, --pwm file STR The file including PWM of BPS-p, --ppt file STR The file including the PPT score-i, --FASTA file STR The file including the fasta sequence-r, --report nu INT The reported sites; default=1; 0: print all positions-h, --help
- BP_PPT.py -b demo/pwmBP_human.txt -p demo/scPPT_human.txt -i demo/example.fa
id: ID of the intronbps: the branch point sequencebp_pos: the position of the branch point relative to 3'SS upstreamsc_bps: the score of the BPSsc_ppt: the score of the PPTsc: the score of the BPS and PPTzsc_bps: the z-score of the BPSzsc_ppt: the z-score of the PPTzsc: the z-score of the BPS and PPT