fix issue 88 plotAbundance rank #113

Insaynoah · 2024-04-10T13:16:42Z

The original idea was for the default value to be equal to null instead of defaulting by 'Kingdom'. However, the plotExpression which is used for if the rank is not specified does not allow rank to be null because of issues with the scater library (merging error). Instead I created a function .find_lowest_taxonomy_level which looks for the lowest taxonomy level of the tse that does not contain NA's then reverts to that rank if the user didn't specify one.

TuomasBorman · 2024-04-10T14:00:35Z

Thanks!

Usually, there are some NAs in microbial datasets even in the highest ranks, so this might not be the most robust option.

Moreover, this still agglomerates the data which was the initial issue. Usually TreeSE is constructed so that the lowest rank (ASV/OTU/strain) is in "row level". The higher ranks are described in rowData (Species, Genus...). --> You cannot find the real lowest rank from rowData since it does not include that kind of information.

Check for example GlobalPatterns data. Agglomerate it to lowest available level in rowData (Species) --> the number of rows differ because there are more OTU level bacteria than Species level.

So the most transparent option would be to skip agglomeration by default.

In function .get_abundance_data, we could modify it so that it first calls agglomerateByRank only if rank is not NULL. (currently the agglomeration is done in the function without using these general mia functions for some reason)

miaViz/R/plotAbundance.R

Line 276 in 0fa499a

mutate(rank = factor(rowData(x)[,rank], unique(rowData(x)[,rank]))) %>%

So modifications to that function would be

a. Add line that calls agglomerateByRank
b. Remove line 276

Other thing is that then user cannot use plotExpression anymore from the plotAbundance function (plotExpression is called when rank is NULL).

Also, I noticed that plotExpression does not work currently because features cannot be NULL.

My suggestion:

rank = NULL by default
Use agglomerateByRank function in .get_abundance_data function (Skip the function when rank is NULL)
Remove plotExpression from plotAbundance function (User can use it from scater.)

TuomasBorman · 2024-04-11T06:55:44Z

@antagomir

antagomir · 2024-04-11T08:13:13Z

Reason for using scater::plotExpression has been that it is easier to use existing methods. Other option would be to reuse code from scater in miaViz. However, the scater pkg is under GPL-3 so we cannot use the code unless we change miaViz license (which we are by default unwilling to do). Thus we need to either stick to scater::plotExpression, or rewrite the necessary parts. Both are feasible options in principle. But calling scater would be less work.

If scater does not allow NULL ranks as discussed above, how about augmenting rowData internally and creating a new rowData field that equals to the assay rows. Then the agglomeration would not change the data but scater::plotExpression could still work?

The function .find_lowest_taxonomy_level might be otherwise useful somewhere.

TuomasBorman · 2024-04-11T12:33:28Z

Yes, but plotAbundance and plotExpression creates totally different kinds of plots.

library(miaViz)

data("GlobalPatterns")
tse <- GlobalPatterns

# This is how plotAbundance is "usually" used, --> it is agglomerated to some
# level
# It creates this common plot which is often used for microbiome summary
plotAbundance(tse, "Phylum")

# We have another layout for plotAbundance
plotAbundance(tse, "Phylum", layout = "point")

# If we want to visualize certain ASV level bacteria that belong to certain group for instance
tse_sub <- tse[1:10, ]

# We have to use plotExpression
plotAbundance(tse_sub, features = rownames(tse_sub), rank = NULL)

Currently, we cannot create plotAbundance-type plots for ASV level. So instead of using plotExpression, we could create plotAbundance-type plots also when rank==NULL

antagomir · 2024-05-21T19:18:02Z

I agree about separating plotAbundance and plotExpression.

I suggest that @TuomasBorman you check this with @Insaynoah when your respective schedules allow.

TuomasBorman · 2024-06-03T07:16:10Z

@Insaynoah Can you modify plotAbundance so that it does not utilize plotExpression when rank = NULL?

Insaynoah · 2024-06-03T15:27:29Z

One thing i don't really get is that if rank is set to null, since the plot's colors come from the rank, it will just create gray bars like this:

Thus, i don't really see how to make a usuable graph like this one.

TuomasBorman · 2024-06-03T15:43:46Z

Hmmm, true...

Well, if rank is NULL, then:

add rownames to rowData with name "rownames"
rank <- "rownames"

Does that work?

The problem might be that there are lots of rows to plot, but this is still useful since user might have already subsetted the data

Insaynoah · 2024-06-03T17:54:40Z

That is actually a very good solution. Now when rank is set to null, the plot will color by each individual rowname, creating something like this

However if you agglomerate the data beforehand such as by phylum it will create something like this:

Also when rank is set to null, I added a condition where order_sample_by should also be null because this parameter is dependant on the rank.

Let me know if I should make any changes.

Insaynoah · 2024-06-04T06:37:03Z

It seems as though, there's an issue with plotabundance in the Rmd file.
It uses the plotAbundance function with rank equal to NULL, expecting a plotExpression plot being returned:

But now that plotAbundance is updated to not use plotExpression when rank is null, the following line throws an error.

features <- match.arg(features, colnames(colData(x)))

Should the example be updated to not expect a plotExpression plot ?

TuomasBorman · 2024-06-04T07:16:50Z

I will comment on this later today in more detail, but you can use plotExpression() directly. That is handy function to visualize assay. (There will be support for boxplots in couple of days)

Also check OMA examples on this

antagomir · 2024-06-04T09:57:49Z

Also OMA chapter 8 on this would require updating once this is done:
https://microbiome.github.io/OMA/docs/devel/pages/21_microbiome_community.html

R/plotAbundance.R

antagomir · 2024-06-19T08:06:06Z

Hi @Insaynoah can we aim to close this soon?

Insaynoah · 2024-06-19T08:12:12Z

I think this one should be done no ? Am i missing something ?

antagomir · 2024-06-19T11:03:49Z

Ok to me (with no detailed testing).

antagomir · 2024-06-19T11:04:47Z

Ah, also confirm that you have updated vignettes/ folder if that contains examples.

If @TuomasBorman approves and merges this, please check that OMA examples are subsequently updated as well if this is used somewhere.

TuomasBorman

Can you still update documentation (description of rank and examples). Especially, you should modify example of rank = NULL. Abundance plot is not possible to do with too many features. You should first agglomerate the data and then plot.
Checks are failing
Can you update .get_abundance_data to use agglomerateByRank and meltSE

R/plotAbundance.R

antagomir · 2024-07-03T15:11:46Z

@Insaynoah any chance to fix this one..?

TuomasBorman · 2024-07-09T15:06:08Z

This PR:

Fixes issue with rank = NULL in plotAbundance. Now it is possible to plot abundances without agglomeration.
plotAbundance and prevalence plotting functions were using own implementations even though mia already includes implementations for agglomeration, melting etc. I modified the code so that mia is used whenever possible.
I deprecated plotFeaturePrevalence and created new function plotRowPrevalence (we have agreed to use row/col in function names)
The code lacked comments and explanations. I commented and simplified the code for easier maintenance.

The check fails in Mac and Win are caused by old version of mia. For some reason, they are not updated even though GHA should fetch the devel version of mia.

The check fail in linux is caused by permission issues since only devel branch can push to gh-pages branch. That issue is fixed in devel branch automatically. There seems not to be any other issues, so this PR can be merged.

fix issue 88 plotAbundance rank

3375c9c

Merge branch 'devel' into devel

e1b3df5

Merge branch 'devel' into devel

ff42ce7

rank can now be set to null

ba96357

Insaynoah added 3 commits June 4, 2024 13:42

fix rmarkdown issue

6afcc05

rmd file fix

caf7146

rmd file fix

023ef09

TuomasBorman reviewed Jun 4, 2024

View reviewed changes

R/plotAbundance.R Outdated Show resolved Hide resolved

TuomasBorman reviewed Jun 4, 2024

View reviewed changes

R/plotAbundance.R Outdated Show resolved Hide resolved

TuomasBorman reviewed Jun 4, 2024

View reviewed changes

R/plotAbundance.R Outdated Show resolved Hide resolved

Insaynoah and others added 2 commits June 5, 2024 09:19

Fixed issues

80ccbc2

Merge branch 'devel' into devel

7fb25ba

Insaynoah added 2 commits June 19, 2024 14:13

updated vignettes

cfc1c97

Merge branch 'devel' of https://github.com/Insaynoah/miaViz into devel

e597743

up

fca2242

TuomasBorman requested changes Jun 25, 2024

View reviewed changes

R/plotAbundance.R Show resolved Hide resolved

R/plotAbundance.R Outdated Show resolved Hide resolved

Insaynoah added 3 commits June 26, 2024 08:21

requested changes + documentation update

cd4e0bf

Merge branch 'devel' of https://github.com/Insaynoah/miaViz into devel

494f7ff

error fix

e97ba54

TuomasBorman reviewed Jun 26, 2024

View reviewed changes

R/plotAbundance.R Outdated Show resolved Hide resolved

Insaynoah added 2 commits June 27, 2024 10:18

check fix

5ac3525

sets rank to first taxonomic rank if more than 500 features

54c2c6d

TuomasBorman reviewed Jun 27, 2024

View reviewed changes

R/plotAbundance.R Outdated Show resolved Hide resolved

Insaynoah and others added 2 commits June 27, 2024 11:51

error when rank is null and more than 100 features

8a014e8

Merge branch 'devel' into devel

d561182

up

2275ea0

TuomasBorman approved these changes Jul 9, 2024

View reviewed changes

TuomasBorman added 5 commits July 9, 2024 11:34

up

287e58b

up

e1daf48

up

2425ead

up

4d7810e

up

869aae0

TuomasBorman linked an issue Jul 9, 2024 that may be closed by this pull request

Default ranks #88

Closed

TuomasBorman merged commit dde3253 into microbiome:devel Jul 9, 2024
0 of 3 checks passed

TuomasBorman mentioned this pull request Jul 9, 2024

Update params in user facing fxns #134

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix issue 88 plotAbundance rank #113

fix issue 88 plotAbundance rank #113

Insaynoah commented Apr 10, 2024

TuomasBorman commented Apr 10, 2024

TuomasBorman commented Apr 11, 2024

antagomir commented Apr 11, 2024

TuomasBorman commented Apr 11, 2024

antagomir commented May 21, 2024

TuomasBorman commented Jun 3, 2024

Insaynoah commented Jun 3, 2024

TuomasBorman commented Jun 3, 2024

Insaynoah commented Jun 3, 2024

Insaynoah commented Jun 4, 2024

TuomasBorman commented Jun 4, 2024

antagomir commented Jun 4, 2024

antagomir commented Jun 19, 2024

Insaynoah commented Jun 19, 2024

antagomir commented Jun 19, 2024

antagomir commented Jun 19, 2024

TuomasBorman left a comment

antagomir commented Jul 3, 2024

TuomasBorman commented Jul 9, 2024

fix issue 88 plotAbundance rank #113

fix issue 88 plotAbundance rank #113

Conversation

Insaynoah commented Apr 10, 2024

TuomasBorman commented Apr 10, 2024

TuomasBorman commented Apr 11, 2024

antagomir commented Apr 11, 2024

TuomasBorman commented Apr 11, 2024

antagomir commented May 21, 2024

TuomasBorman commented Jun 3, 2024

Insaynoah commented Jun 3, 2024

TuomasBorman commented Jun 3, 2024

Insaynoah commented Jun 3, 2024

Insaynoah commented Jun 4, 2024

TuomasBorman commented Jun 4, 2024

antagomir commented Jun 4, 2024

antagomir commented Jun 19, 2024

Insaynoah commented Jun 19, 2024

antagomir commented Jun 19, 2024

antagomir commented Jun 19, 2024

TuomasBorman left a comment

Choose a reason for hiding this comment

antagomir commented Jul 3, 2024

TuomasBorman commented Jul 9, 2024