From 206c495e6e211a40847fea5013d80579cece91ee Mon Sep 17 00:00:00 2001 From: Carol-seven <542046605@qq.com> Date: Mon, 8 Jul 2024 22:19:26 -0400 Subject: [PATCH] add notes regarding the analysis reference --- docs/articles/scaffold.html | 2 +- docs/articles/usage_template.html | 246 +++++++++++------- .../figure-html/unnamed-chunk-33-1.png | Bin 51804 -> 50909 bytes .../figure-html/unnamed-chunk-41-1.png | Bin 28774 -> 28928 bytes .../figure-html/unnamed-chunk-44-1.png | Bin 42136 -> 41872 bytes docs/pkgdown.yml | 2 +- docs/search.json | 2 +- vignettes/usage_template.Rmd | 116 ++++++--- 8 files changed, 234 insertions(+), 134 deletions(-) diff --git a/docs/articles/scaffold.html b/docs/articles/scaffold.html index fc193e5..f4f5749 100644 --- a/docs/articles/scaffold.html +++ b/docs/articles/scaffold.html @@ -80,7 +80,7 @@
Facility, UConnvignettes/scaffold.Rmd
scaffold.Rmd
vignettes/usage_template.Rmd
usage_template.Rmd
For Spectronaut: “PG.Genes”, +“PG.ProteinAccessions”, “PG.ProteinDescriptions”, and +“PG.ProteinNames”.
For Scaffold: “ProteinDescriptions”, +“AccessionNumber”, and “AlternateID”.
filterProtein(dataTran, proteinInformation = "preprocess_protein_information.csv",
text = c("Ras-related protein Rab-3D", "Alcohol dehydrogenase 1"),
- by = "description",
+ by = "PG.ProteinDescriptions",
removeList = FALSE)
where proteinInformation
is the file name for protein
information, automatically generated by preprocessing()
. In
-this case, the proteins with descriptions “Ras-related protein Rab-3D”
-or “Alcohol dehydrogenase 1” will be kept. Note that the search value
-text
is used for exact equality search.
"PG.ProteinDescriptions"
+match with “Ras-related protein Rab-3D” or “Alcohol dehydrogenase 1”
+will be kept. Note that the search value text
is used for
+exact equality search.
R.Condition | R.Replicate | +RAB3D_HUMAN | +ADH1_YEAST |
---|---|---|---|
100fmol | 1 | +16.68685 | +20.25893 |
100fmol | 2 | +16.57184 | +20.21888 |
100fmol | 3 | +16.49578 | +20.37810 |
100fmol | 4 | +16.62059 | +20.27705 |
100fmol | 5 | +16.59566 | +20.23536 |
50fmol | 1 | +16.46218 | +19.38908 |
50fmol | 2 | +16.66106 | +19.54470 |
50fmol | 3 | +16.64862 | +19.50293 |
50fmol | 4 | +16.63910 | +19.37445 |
50fmol | 5 | +16.61782 | +19.40820 |
The function analyze()
calculates the results that can
-be used in subsequent visualizations. If more than two conditions exist
-in the data, precisely two conditions for comparison must be specified
-via the argument conditions
.
Note: The following listed analysis compare data
+under two conditions. The order of
+conditions
will affect downstream analysis, as the
+second condition serves as the reference of
+comparison.
If only two conditions exist in the data and
+conditions
is not specified, conditions
will
+automatically be generated by sorting the unique values alphabetically
+and in ascending order.
If more than two conditions exist in the data, precisely two
+conditions for comparison must be specified via the argument
+conditions
.
-cond <- c("50fmol", "100fmol")
cond <- c("100fmol", "50fmol")
The Student’s t-test is used to compare the means between two conditions for each protein, reporting both the difference in means -between the conditions (calculated as Condition 1 - Condition 2) and the -P-value of the test.
+between the conditions and the P-value of the test. +
anlys_t <- analyze(dataImput, conditions = cond, testType = "t-test")
#> Data are essentially constant.
#> Data are essentially constant.
Oops! The warning message shows “Data are essentially constant,” -which means that the data contain proteins with the same value in all -samples. In this case, the P-value of t-test returns NA.
anlys_mod.t <- analyze(dataImput, conditions = cond, testType = "mod.t-test")
In the moderated t-test, a warning message might occur stating, “Zero -sample variances detected, have been offset away from zero.” This -warning corresponds to examples of proteins that exhibited identical -quant values, either pre- or post-imputation, and therefore no variance -is present across conditions for those proteins. This does not impede -downstream analysis; it merely serves to alert users to its -occurrence.
- - - -The result of testType = "MA"
is to generate the data
for plotting an MA plot, which represents the protein-wise averages
within each condition.
anlys_MA <- analyze(dataImput, conditions = cond, testType = "MA")
Most proteins are expected to exhibit little variation, leading to the majority of points concentrating around the line M = 0 (indicating no difference between group means).
+conditions
in the analyze()
will determine how
+the MA plot is visualized. The second row of anlys_MA
acts
+as the comparison reference: the first and second rows refer to
+variables \(log_2 X\) and \(log_2 Y\), respectively.
+
visualize(anlys_MA, graphType = "MA", M.thres = 1, transformLabel = "Log2")
#> Warning: Removed 16 rows containing missing values or values outside the scale range
@@ -3548,10 +3614,10 @@ MA
where M.thres = 1
means the M thresholds are set to −1
and 1. The scatters are split into three parts: significant up (M >
1), no significant (-1 \(\leq\) M \(\leq\) 1), and significant down (M <
--1). And transformLabel = "Log2"
is used to label the
-title. Additionally, the warning message “Removed 16 rows containing
-missing values” indicates that there are 16 proteins with no
-significance.
+-1). And transformLabel = "Log2"
is used to prefix the
+title, x-axis, and y-axis labels. Additionally, the warning message
+“Removed 16 rows containing missing values” indicates that there are 16
+proteins with no significance.
1CClP8GjUN4Tr|{v
z)?eSiAh1JCK_R;O{HiiUP(TpL$Mr;V5xVW4bm;+|0?Qr&xW@_opT_q4Y2caI3FkOj
zNII$r;7MlY=8e$!FE0|!#=-p7l328y%*XHz7YP+Fv3&-O>U_7mN#$I`-9EO8tn8jC
zuc&B%xR;6(Ng5j)iDLuwQ<*T=d)y)ldk7Oj@*CKo7^Gxno4zM)uQbi>;B#EbA!H3R
zCgLimJoPxO0yWdyh_25cKYqxT1Fbv7-jtNo7{V!1B+`KHwa^dDWE`_%mfi7({SFQR
z0f7xq@Jwg2>gu+S9A2vp4nNhSFtN7IbeK&J)q_4A6Cc0x$JwTjAU@^FzuZpI%jux;
zs?H+DqTozqPAqPN5aYLQ-CA*00M| YbpMYQ^0){n1vlu8gga#D>*fN1Lzrbp{=gs~~Qh&JIE_
zWF^#4MF}6?z59fHNr-8Sn^@mF*piYqpuF`p6IQUJf*U>7?V|AzAY`aw^g|)?((x)k
z*(KRQNOz0aR5C*Q1%QoXmYZyCr({uQ7ysvMHjV;#{CM2#iph3feY
z?dp%>wFr;=a2d=;G^o*m$JhKN2@a23hw@pwXmBk!>VDKtPSxY@;-Da1S}(O?vS;`1
z<78JiH_o+mL?WHRaagyAgv@sg01xgfQ7>SoN+NmUb)Alm4kR1}zaS@P{Xd9>4_$y6
z(G59DnB2QaemH4#%lKV_Sk(mUy=nUp9GNk7UwkPMut-mI;Z+xfCA|x
z%r)q4Z4r2e1DOY}#k(aG_>)Te@78Oo4~&6k6$vo?TGYN08l=!{Bxo}W@YfvPoI%+h
z2AmiD-fXi!)tk{prDr=Yj%|1XWSS^jM?Eui^)#{p={rw~x=1Br8z~R8`XF>QAKH__
zP~hBBR8&lk{InRa+7-Lw3Is$Y-%B>@f4bdozV>b1Jf5=pyfFFQU}o&M^5unn&lgLL
zas@14DZV6N#aA>0exz9bSNZSNZXTv|;0FNEJxMl=gLm6)I>aeA((X+cWW#&eaA^K2
zNgq+8Nu_jZv{%A3+RMxL&$-^w(bP?*N`KFNv`Qer_tMvy)9Q<-P5w4{(;bh1z|LC<
z&jY0b+a66@{vcml{)UCYjd_I^R>g78nx=Ydk7%H>J3s%b-ShvNdX#_7x45!YN2ec^
z{el8M@6DvkW}Mr?H=8%huO+ERLc+&nWrv^NNg^7IAZQ~5z^x9@OZKz}d1Rq|y9h|?
z!McsX+(^;@g!=0I)TQUo%~v|0syeUjZ)a|3p&_|6@)}o)X+tZakx&B?8JEj2UF~$O
zG@~V9Xo|^0<9XxEYd-ekKWwX*X!c-Ut7~~Dl
CaRuhT0w;L$x^O5ly0SOYS-GJIN!3q3R!xU>
z$JE^c@2RM4&@9*hwkd6(0_qh>_SK5}_Jw=&Rsw>rsN*)+ZYw^uyDtQ{Jt1$r<<~(o
zsDu?NR@~{mU{Q6p%`xnC`(UJ{tx|9p{YK%fEbBzwgvKJ}1ez$m-LX1e_DW&>pxLVQ
zmWOEzC4+kPlB>s0tv6q5u+J~G!(~9f#yYv>o5OSXf@U^2JjV`{kXNI)f4@0bM=~VG
zcslDBY03<5Fx5-zC?77?sTnT&m{P7>brzA>&8-d_3e#V5XX53vH(B-T=Li0CL%8u)
z#`E3@D_vbOkB`6h_~K#%cyq7raj!m=&BMLf#v