Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
P
PRESCOTT
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Mustafa Tekpinar
PRESCOTT
Commits
cdece0cd
Commit
cdece0cd
authored
Mar 20, 2025
by
Mustafa Tekpinar
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Updates for GnomAD 4.1.
parent
27078d3c
Hide whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
28 additions
and
16 deletions
+28
-16
gnomAD_v4.1.0_MLH1_HUMAN_ENSG00000076242.csv
data/gnomAD_v4.1.0_MLH1_HUMAN_ENSG00000076242.csv
+0
-0
input-preparation.rst
docs/input-preparation.rst
+0
-2
introduction.rst
docs/introduction.rst
+24
-13
example-prescott-script.sh
examples/example-prescott-script.sh
+4
-1
prescott-documentation.pdf
prescott-documentation.pdf
+0
-0
No files found.
data/gnomAD_v4.1.0_MLH1_HUMAN_ENSG00000076242.csv
0 → 100644
View file @
cdece0cd
This source diff could not be displayed because it is too large. You can
view the blob
instead.
docs/input-preparation.rst
View file @
cdece0cd
...
...
@@ -72,8 +72,6 @@ a3m and fasta files are too long. We have to shorten them. We can do that with
.. code:: bash
awk 'BEGIN{FS=" "}{if(NF>1) {printf(">%s\n", $1)}else{print $0}}' AKE_nogaps.fasta > AKE_nogaps_short_names.fasta
# Recheck this command if you can remove extra >
Congratulations! Now, you have all the input files required for PRESCOTT:
...
...
docs/introduction.rst
View file @
cdece0cd
...
...
@@ -60,7 +60,7 @@ folder of this repository.
Below, you will find examples of the most basic usage. Consult to the
documentation for further details.
Running
the
escott program
Running escott program
~~~~~~~~~~~~~~~~~~~~~~~~~~
Let’s assume that our input MSA is inputAli.fasta and input.pdb is our
...
...
@@ -85,8 +85,8 @@ line of the file should contain a mutation (e.g. D136R) or combination
of mutations separated by colons and ordered according to their
positions in the sequence (e.g. D136R,V271A).
Running
the
prescott program with gnomAD frequency data
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~
Running prescott program with gnomAD frequency data
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
A quick help can be accessed by typing
...
...
@@ -98,15 +98,10 @@ Run the program by issuing the following command in a bash terminal:
.. code:: bash
prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v4.0.0_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta
GnomAD v4.0.0 is the most comprehensive, publicly available human population dataset as far as we know. However,
if you would like to use GnomAD v2.1.1, you should specify the version with '--gnomadversion' parameter as below:
.. code:: bash
prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v4.1.0_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta
prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v2.1.1_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta --gnomadversion 2
GnomAD v4.1.0 is the most comprehensive, publicly available human population dataset as far as we are aware of.
The most important output is prescott-scores.csv file, which produces entire single point mutational landscape for the protein.
In addition, there is a file called prescott-scores-details.csv. The file contains all information about the points modulated by population
...
...
@@ -138,7 +133,23 @@ variants are affected by population information.
Please note that the example input files of MLH1 protein for prescott acalculations are in the data directory of this repository.
Running the prescott program with custom frequency data
Running prescott program with older versions of GnomAD data
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
If you would like to use GnomAD v4.0.0 data in prescott, you should specify GnomAD version with '--gnomadversion' parameter as follows:
.. code:: bash
prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v4.0.0_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta --gnomadversion 40
However, if you would like to use GnomAD v2.1.1, you should specify the version with '--gnomadversion' parameter as below:
.. code:: bash
prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v2.1.1_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta --gnomadversion 2
Running prescott program with custom frequency data
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
What if you have your own frequencies for a set of missense mutations that you or another researcher measured?
prescott module can do calculations with custom frequency files as well. The first step is to prepare a plain text file with ".txt" extension.
...
...
@@ -188,6 +199,6 @@ and structural model accurately predicts missense effect. medRxiv. doi:10.1101/2
https://www.medrxiv.org/content/10.1101/2024.02.03.24302219v1
.. |License:
CC BY-NC-SA 4.0| image:: https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg
:target: http
://creativecommons.org/licenses/by-nc-sa/4.0/
.. |License:
GPLv3| image:: https://img.shields.io/badge/license-GPLv3-blue
:target: http
s://www.gnu.org/licenses/gpl-3.0.html
examples/example-prescott-script.sh
View file @
cdece0cd
...
...
@@ -4,7 +4,10 @@
#prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v2.1.1_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta --gnomadversion 2
#If you are using GnomAD v4.0.0 data.
prescott
-e
../data/MLH1_normPred_evolCombi.txt
-g
../data/gnomAD_v4.0.0_MLH1_HUMAN_ENSG00000076242.csv
-s
../data/MLH1.fasta
#prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/gnomAD_v4.0.0_MLH1_HUMAN_ENSG00000076242.csv -s ../data/MLH1.fasta --gnomadversion 40
#If you are using GnomAD v4.1.0 data.
prescott
-e
../data/MLH1_normPred_evolCombi.txt
-g
../data/gnomAD_v4.1.0_MLH1_HUMAN_ENSG00000076242.csv
-s
../data/MLH1.fasta
#If you have a custom frequency file
#prescott -e ../data/MLH1_normPred_evolCombi.txt -g ../data/custom-frequency-file.txt -s ../data/MLH1.fasta
prescott-documentation.pdf
View file @
cdece0cd
No preview for this file type
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment