Skip to content

A collection of plugins to analyse lineages of tracked objects in Mastodon

License

Notifications You must be signed in to change notification settings

mastodon-sc/mastodon-deep-lineage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status License Coverage Lines of Code DOI YouTube

Mastodon Deep Lineage - a collection of plugins to analyse lineages of tracked objects in Mastodon

Table of Contents

Documentation of Mastodon

Mastodon Deep Lineage is an extension of Mastodon. For the full documentation of Mastodon, please visit mastodon.readthedocs.io.

Installation Instructions

  • Add the listed Mastodon update sites in Fiji:
    • Help > Update > Manage update sites
      1. Mastodon
      2. Mastodon-DeepLineage Mastodon Update sites

Numerical Features added to Mastodon

Spot Features

Feature name Projections Description Formula/Visualisation
Spot Ellipsoid Short semi axes, Middle semi axis, Long semi axis The ellipsoid semi axes in ascending order of length. The semi axes are computed applying the square root to the eigenvalues of the so-called covariance matrix of the spots
Volume The volume of the ellipsoid.

Example visualization: spotVolumeAnimation.gif
Spot Ellipsoid Aspect Ratios Aspect ratio short to middle The ratio between the short axis and middle axis.
Aspect ratio short to long The ratio between the short axis and long axis.

Example visualization: spotEllispoidAspectRatioAnimation.gif
Aspect ratio middle to long The ratio between the middle axis and long axis.
Spot Branch ID idem The ID of the branch spot each spot belongs to.
Spot Relative Movement* x, y and z component The x, y and z components of the movement distance of a spot relative to its n nearest neighbors. The number of neighbors to be considered can be specified by the users. Default is 5.

Example visualization: spotRelativeMovementXAnimation.gif
distance The movement distance relative to n nearest neighbors.

Example visualization: spotRelativeMovementSpeedAnimation.gif

Branch Features

Feature name Projections Description Formula/Visualisation
Branch N Leaves idem The total number of leaves of a branch spot in the whole track subtree of this branch spot.

Example visualization: branchNLeavesAnimation.gif
Branch N Successors and Predecessors idem Total number of successors and predecessors of a branch spot in the whole track subtree of this branch spot.

Example visualization:
Successors (encodes reproductivity) branchNSuccessorsAnimation.gif
Predecessors (encodes generation) branchNPredecessorsAnimation.gif
Branch Start and End Timepoints idem The first timepoint and the last timepoint of a branch spot.

Example visualization:
First timepoint branchFirstTimepoint.gif
Last timepoint branchLastTimepoint.gif
Branch Sinuosity idem The sinuosity of a spot during its life span (cf. Sinuosity), i.e. how much the track represented by the branch is curved. Values close to 1: almost straight movement. Values significantly higher than 1: winding or meandering movement. Positive infinity (∞), if the spot is at the end at the same position as at the beginning, but has moved in between.

Example visualization: branchSinuosityAnimation.gif
Branch Average Movement idem The average movement per frame of a spot during its life span. Example visualization: branchAverageMovementAnimation.gif

e.g.:

Branch Movement Direction idem The movement direction of a branch spot represented as a normalized directional vector pointing from the start (spot) position to the end (spot) position of the BranchSpot.

Example visualizations:
x-component branchMovementDirectionXAnimation.gif
y-component branchMovementDirectionYAnimation.gif
z-component branchMovementDirectionZAnimation.gif
Branch Cell Division Frequency idem Number of cell divisions in the subtree rooted at each Branch-spot divided by total duration of branches in this sub-tree.

Example visualization: branchCellDivisionFrequencyAnimation.gif
Branch Relative Movement* x, y and z component The x, y and z components of the average speed of a spot during its life span relative to its n nearest neighbors. The number of neighbors to be considered can be specified by the users. Default is 5.
average speed The average speed of a spot during its life span relative to its n nearest neighbors.

Example visualization: branchRelativeMovementAnimation.gif

'*' The relative movement features cannot be called from the FeatureComputer directly. Instead, they can be accessed via the plugin menu: Plugins > Compute Features > Movement of spots relative to nearest neighbors

Hierarchical Clustering of Lineage Trees

  • Menu Location: Plugins > Lineage Analysis > Hierarchical Clustering of Lineage Trees
  • This command is capable of grouping similar lineage trees together.
  • The linage clustering operates on Mastodon's branch graph.
  • Lineage trees are considered similar, if they share a similar structure and thus represent a similar cell division pattern. The structure of a lineage tree is represented by the tree topology. This tree topology consists of the actual branching pattern and the cell lifetimes, i.e. the time points between two subsequent cell divisions.
  • Functionality in a nutshell: nutshell.gif

Zhang Tree Edit Distance

  • The similarity of a pair of lineage trees is computed based on the Zhang edit distance for unordered trees (Zhang, K. Algorithmica 15, 205–222, 1996). This method captures the cost of the transformation of one tree into the other.
  • The Zhang unordered edit distance allows the following edit operations. The edit operations are defined in a way that they satisfy the constraints elaborated in section 3.1 ("Constrained Edit Distance Mappings") of the paper: Zhang, K. Algorithmica 15, 205–222, 1996
  Note: The prefix T may represent a node or a complete subtree. Nodes without this prefix are just nodes.
 
  1. Change label
 
        A         A'
       / \  -->  / \
      TB TC     TB TC
 
 
  2a: Delete subtree (opposite of 2b)
 
        A         A
       / \   -->  |
      TB TC       TB
 
  2b: Insert subtree (opposite of 2a)
 
        A          A
        |    -->  / \
        TB       TB TC
 
 
  3a: Delete one child of a node and delete the node itself (opposite of 3b)
 
        A             A
       / \   -->     / \
      B  TC         TD TC
     / \
    TD TE        (delete TE and B, TD becomes child of A)
 
  3b: Insert a node and insert one child at that node (opposite of 3a)
        A            A
       / \     -->  / \
      TB TC         D  TC
                   / \
                  TB TE       (insert D and TE, TB becomes child of D)
 
 
  4a: Delete node and delete its sibling subtree (opposite of 4b)
        A               A
       / \             / \
      B  TC   -->     TD TE
     / \
    TD TE            (Node B and its sibling subtree TC are deleted and the children
                      of B, namely TD and TE, become the children of A)
 
  4b: Insert node and insert a sibling subtree (opposite of 4a)
        A               A
       / \             / \
      TB TC   -->     D  TE
                     / \
                    TB TC       (Node D and its sibling TE are inserted,
                                 TB and TC become the children of D)

As an example, the following case explicitly does not fulfill the constraints mentioned in the paper:

 Delete a node without deleting one of its children
          A            A
         / \   -->   / | \
        B  TC      TD TE TC
       / \
      TD TE        (delete B, TD and TE become children of A and TC remains)
  • A basic example of the tree edit distance:
Tree1
	                        node1(node_weight=13)
	               ┌──────────┴─────────────┐
	               │                        │
	             node2(node_weight=203)   node3(node_weight=203)
Tree2
	                        node1(node_weight=12)
	               ┌──────────┴─────────────┐
	               │                        │
	             node2(node_weight=227)   node3(node_weight=227)
	                             ┌──────────┴─────────────┐
	                           node4(node_weight=10)    node5(node_weight=10)
  • Edit distance of 69, because:
    • one node has a difference of 1 (13-12)
    • two nodes have a difference of 24 each (227-203 * 2)
    • two extra nodes are added with a weight of 10 each (10 * 2)
      • zhang_example.gif

Workflow

  1. The similarity measure uses the attribute cell lifetime, which is computed as a difference of time points between two subsequent divisions. There are multiple ways to compute the similarity between two lineage trees based on this attribute:
    1. The sum of the edit distances as shown in the basic example above. Individual differences in the cell lifetimes may be normalized by their sum (i.e. local normalization)
    2. The sum of the edit distances as shown in the basic example above normalized by the maximum possible edit distances of the two trees (normalized zhang edit distance)
    3. The sum of the edit distances normalized by the number of the involved nodes (per branch zhang edit distance)
  2. The similarities are computed between all possible combinations of lineage trees leading to a two-dimensional similarity matrix. The values in this matrix are considered to reflect similarities of lineage trees. Low tree edit distances represent a high similarity between a discrete pair of lineage trees. This matrix is then used to perform an Agglomerative Hierarchical Clustering into a specifiable number of groups.
  3. For the clustering three different linkage methods can be chosen.

Parameters

  • Crop criterion:

    • The criterion for cropping the lineage trees
    • Number of spots (default)
    • Time point
  • Crop start

    • At which number of spots / time point (depending on the chose crop criterion) the analysis should start
  • Crop end

    • At which number of spots / time point (depending on the chose crop criterion) the analysis should end
  • Number of clusters

    • How many groups the lineage trees should be assigned to by the clustering
    • Must not be greater than the number of valid lineage trees
  • Minimum number of divisions

    • The minimum number of divisions a lineage tree should have so that it is included in the analysis
  • Similarity measures:

    1. (default) normalized_zhang_distance.gif1,2
    2. per_branch_zhang_distance.gif1
    3. Zhang Tree Edit Distance1,2
    • 1Local cost function: local_cost.gif
    • 2Local cost function with normalization: local_cost_normalized.gif
  • Linkage strategy for hierarchical clustering, cf. linkage methods

    1. Average (default)
    2. Single
    3. Complete
  • List of further projects

    • If you have multiple similar projects, you can add them here to get an average clustering taking all projects into account.
    • Mastodon projects can be added / removed using
      • "Add files..."
      • "Add folder content..."
      • "Remove selected"
      • "Clear list"
      • Drag and drop of files and folders
    • The name of the current open project is shown above the list. The current project is always included in the hierarchical clustering. It cannot be added to the list.
    • It is important that the names of the roots of lineages in all projects included in the hierarchical clustering are the same. Otherwise, the hierarchical clustering will not work.
    • The effect of adding further projects is that the similarity matrix is computed for each project separately and then averaged, resulting in a more robust hierarchical clustering.
  • Add generated tags to further projects

    • If checked, the tags generated by the hierarchical clustering are also added to the further projects.
    • Important note: this will write tags to these projects. Consider making a backup of the further projects before running the hierarchical clustering, if you choose this option.
  • Show dendrogram of hierarchical clustering of lineage trees

    • If checked, the dendrogram is shown after the hierarchical clustering
  • Check validity of parameters

    • Press this button to check, if with the current parameters a hierarchical clustering is possible
    • If the parameters are invalid, a message will appear with the reason(s)
    • Possible reasons for invalid parameters:
      • The number of clusters is greater than the number of valid lineage trees
      • The crop start is greater than the crop end
      • The crop end is greater than the maximum number of spots / time points
      • Further projects that are included in the hierarchical clustering could not be found / opened

Example

  • Demo data: Example data set
    • The demo data does not contain any image data.
    • The spatial positions of the spots are randomly generated.
    • When opening the dataset, you should confirm that you open the project with dummy images. Dummy images
  • The track scheme of the demo data contains 8 lineage tree in total. You may see that the "symmetric", the "asymmetric" and the "single division" trees look similar to each other, but dissimilar to the remaining trees. Trackscheme
  • The hierarchical clustering dialog. Settings
  • Cf. section Parameters for the meaning of the parameters.
  • Not visible to the user, a similarity matrix is computed based on the chosen similarity measure. For the demo data, the matrix looks like this. Highly similar trees have low distances in this matrix.
    • Similarity matrix
  • The resulting dendrogram.
    • User can toggle on/off root labels, tags, clustering threshold and median of the tree edit distances.
    • If the option Show tag labels is checked, the tag set shown in the dendrogram can be chosen.
    • Export options for the dendrogram to SVG and PNG accessible via the context menu.
    • Dendrogram
    • The result of the hierarchical clustering can be exported to a CSV file via the context menu. The exported file contains the root names of the lineage trees, the tag set value, the assigned group and the similarity score. The similarity score indicates how similar the lineage trees in this group are. The lower the score, the more similar the trees are.
    • Export clustering
  • The resulting tag set may be used for coloring the track scheme. Trackscheme with tags
  • The resulting tag set may be used for coloring the track scheme branch view. Trackscheme Branch View with tags
  • The resulting tag set may be used for coloring the spots in the BigDataViewer. BigDataViewer

Dimensionality reduction

For visualizing high-dimensional data, e.g. in two dimensions, potentially getting more insights into your data, you can reduce the dimensionality of the measurements, using this algorithm:

Usage

  • Menu Location: Plugins > Compute Feature > Dimensionality reduction

Select the graph type whose features should be dimensionality reduced, either the Model Graph with Features for Spots and Links or the Branch Graph with Features on BranchSpots and BranchLinks. Next, select the feature + feature projections that should be dimensionality reduced. Prefer to select features, which describe the phenotype (e.g. size, shape, velocity, number of neighbors, etc.). Only select positional features (e.g. centroid, coordinates, timeframe, etc.), if the position of cells within the image are descriptive for the phenotype. If you are unsure, you can select all features and then remove the positional features later.

Description

The available algorithms reduce the dimensionality of the selected features and adds the results as a new feature to the table. In order to do so, the selected algorithm uses the data matrix from the spot or branch spot table, where each row represents a spot or branch spot and each column represents a feature. The link and branch link features can be included in the algorithm.

If they are selected, the algorithm will use the link feature value of its incoming edge or the average of all values of all incoming edges, if there is more than one incoming edge.

The dialog will look like this: umap_dialog.png

By default, all measurements are selected in the box.

Parameters

Common Parameters

  • Standardize: Whether to standardize the data before reducing the dimensionality. Standardization is recommended when the data has different scales / units. Further reading: Standardization.
  • Number of dimensions: The number of reduced dimensions to use. The default is 2, but 3 is also common. Further reading: Number of Dimensions.

UMAP Parameters

  • Number of neighbors: The size of the local neighborhood (in terms of number of neighboring sample points) used for manifold approximation. Larger values result in more global views of the manifold, while smaller values result in more local data being preserved. In general, it should be in the range 2 to 100. Further reading: Number of Neighbors.
  • Minimum distance: The minimum distance that points are allowed to be apart from each other in the low dimensional representation. This parameter controls how tightly UMAP is allowed to pack points together. Further reading: Minimum Distance.

When you are done with the selection, click on Compute. The resulting values will be added as additional columns to the selected table.

umap_table.png

You can visualize the results using the Grapher View of Mastodon and selecting the newly added columns.

umap_grapher.gif

Visualization with the Mastodon Blender View is also possible.

umap_blender.gif

Example

The example above has been generated using the tgmm-mini dataset, which is included in the Mastodon repository.

Import

Import Spots from Segmented Label Image

  • Menu Location: File > Import > Import spots from label image
  • You can use the plugin to import spots from a label image representing an instance segmentation into Mastodon. This may be useful if you have an instance segmentation of cells or other objects, and you want to track them using Mastodon.
  • The label image is expected to contain the spot ids as pixel values.
  • The label image is expected to have the same dimensions as the image data in Mastodon.
  • Labels are processed frame by frame.
  • Multiple blobs with the same id in the same frame are considered to belong to the same spot by this importer. It is advised to use unique ids for spots in the same frame.
  • The resulting spots are ellipsoids with the semi axes computed from the variance covariance matrix of this pixel positions of each label.
  • Labels with only one pixel are ignored. This is because the variance covariance matrix is not defined for a single point. If you want to import single pixel spots, you can use the Import Spots from CSV plugin.
  • The resulting spots may be linked using the linker plugin in Mastodon (Plugins > Tracking > Linking...) or Elephant.

Parameters

  • Ellipsoid scaling factor: The scaling factor to apply to the ellipsoids. The default is 1.0. The scaling factor is applied to the semi axes of the ellipsoids. The ellipsoid scaling factor can be used to increase (>1) or decrease ( <1) the size of the resulting ellipsoid. 1 is equivalent of ellipsoids drawn at 2.2σ.
  • Link spots having the same id in consecutive frames: If checked, spots with the same label id in consecutive frames are linked. Division or merge events are not considered.
  • Image source that has been used for the external segmentation: The channel containing the image data that has been used to create the label image. This channel is used to check, if the dimensions of the label image match the dimensions of the image data in Mastodon.

Label image as active image in ImageJ

  • The label image can be opened in ImageJ and the plugin can be called from the menu: File > Import > Import spots from label image > Import spots from ImageJ image
  • Please make sure that the label image is the active image in ImageJ.
  • Please make sure that the label image has the same dimensions as the image data in Mastodon.
    • You can use the Image > Properties command ImageJ to check (and) set the dimensions of the label image.
Example
  • You can also watch a video tutorial on how to import spots from a label image in Mastodon YouTube
  • Example dataset: Fluo-C3DL-MDA231 from Cell Tracking Challenge
  • Extract the file to a folder named Fluo-C3DL-MDA231
  • Import the image sequence with the actual image into ImageJ contained in folder Fluo-C3DL-MDA231/01/
    • File > Import > Image Sequence...
  • Set the dimensions of the image sequence to 512x512x1x30x12 (XYCTZ) using Image > Properties
    • plugin_import_example_1.png
  • Open Mastodon from Fiji and create a new project with the image sequence
    • Plugins > Mastodon > new Mastodon project > Use an image opened in ImageJ > Create
    • plugin_import_example_2.png
  • Import the image sequence encoding the label images into ImageJ contained in folder: Fluo-C3DL-MDA231/01_ERR_SEG/
    • Set the dimensions of the label image to 512x512x1x30x12 (XYCTZ) using Image > Properties
    • plugin_import_example_3.png
  • Open Import window in Mastodon: File > Import > Import spots from label image > Import spots from ImageJ image
    • You can keep the ellipsoid scaling factor at 1.0. Select factor higher than 1.0 to increase the size of the ellipsoids and lower than 1.0 to decrease the size of the resulting ellipsoids.
    • Check the box to link spots having the same label id in consecutive frames.
    • Select the channel in Big Data Viewer containing the image that has been used to create the label image. The channel is used to check, if the dimensions of the label image in ImageJ match the dimensions of the image data in Mastodon.
    • plugin_import_example_8.png
      • Click OK and the spots are imported into Mastodon.
    • plugin_import_example_4.png

Label image as BDV channel

  • The plugin can be called from the menu: File > Import > Import spots from label image > Import spots from BDV channel
Example
  • You can also watch a video tutorial on how to import spots from a label image in Mastodon YouTube
  • Example dataset: Fluo-C3DL-MDA231 from Cell Tracking Challenge
  • Extract the file to a folder named Fluo-C3DL-MDA231
  • Import the image sequence with the actual image into ImageJ contained in folder Fluo-C3DL-MDA231/01/
    • File > Import > Image Sequence...
    • Set the dimensions of the image sequence to 512x512x1x30x12 (XYCTZ) using Image > Properties
    • plugin_import_example_1.png
  • Import the image sequence encoding the label images into ImageJ contained in folder: Fluo-C3DL-MDA231/01_ERR_SEG/
    • Set the dimensions of the label image to 512x512x1x30x12 (XYCTZ) using Image > Properties
    • plugin_import_example_3.png
      • Merge the 2 images into a single image using the Image > Color > Merge Channels... command
    • plugin_import_example_5.png
  • Open Mastodon from Fiji and create a new project with merged image
    • Plugins > Mastodon > new Mastodon project > Use an image opened in ImageJ > Create
    • plugin_import_example_6.png
  • Open Import window: File > Import > Import spots from label image > Import spots from BDV channel
    • Keep the ellipsoid scaling factor at 1.0
    • You can decide to link spots having the same label id in consecutive frames. This is useful if you have a time series of label images and you want to link spots between frames. Linking dividing spots cannot be done by this. The Mastodon Linker plugin should be used for this.
    • Select the BDV channel containing the label image that has been used to create the segmented label image. This is used to check, if the dimensions of the label image and the image data in BDV match, which is required.
    • plugin_import_example_9.png
      • Click OK and the spots are imported into Mastodon.
    • plugin_import_example_7.png

Export

Label Image Export

  • Menu Location: File > Export > Export label image using ellipsoids
  • The Label image exporter is capable of saving a label image to a file using the existing ellipsoids in Mastodon.
  • For the labels, the spot ids, branch spot ids or the track ids that correspond to the spots / ellipsoids may be used. Since these Ids are counted zero based in Mastodon, an offset of 1 is added to all Ids so that no label clashes with the background of zero.
  • The recommended export format is '*.tif'-files. However, it should work also for other formats supported by ImageJ.
  • The export uses an image with signed integer value space, thus the maximum allowed id is 2.147.483.646.
  • The dialog: Plugin Export Dialog

Parameters

  • Label Id: The id that is used for the labels. The default is the Spot track Id.
    • The ids correspond to the highlighted columns in the feature table: Feature Table
  • Frame rate reduction: Only export every n-th frame. 1 means no reduction. Value must be >= 1.
    • The frame number corresponds to the Spot frame column in the feature table.
  • Resolution level: Spatial resolution level of export. 0 means highest resolution. Value > 0 mean lower resolution.
  • Save to: Path to the file to save the label image to. Should end with '.tif'.

Example

  • Demo data: Example data set
  • The timelapse with the ellipsoids in BigDataViewer: BigDataViewer
  • The exported tif imported into Napari 3D view: Napari

GraphML Export

  • Menu Location: File > Export > Export to GraphML (branches)
  • Exports the branch graph to a GraphML file.
    • The graph is directed. The branch spots are the vertices and the branch links are the edges.
    • The vertices receive a label attribute with the branch spot name. The vertices receive a duration attribute with the branch duration.
    • The edges are not labeled and have no attributes.
  • GraphML can be visualized with Cytoscape, yEd or Gephi.
  • GraphML can be processed in Java using the JGraphT library.
  • GraphML can be processed in Python using the NetworkX library.

Options

  • Export all branches to GraphML (one file)
    • Exports the whole branch graph to a single file.
    • Select a file to save to. Should end with '.graphml'.
  • Export selected branches to GraphML (one file)
    • Exports the selected branches to a single file.
      • The selected branches are the ones that are highlighted in the branch view.
      • A branch is considered selected if at least one of its spots is selected. However, the exported duration attribute always reflects the whole branch duration.
    • Select a file to save to. Should end with '.graphml'.
  • Export tracks to GraphML (one file per track)
    • Exports each track to a separate file.
    • Select a directory to save to.

Example

  • Demo data: Example data set
  • The resulting file loaded into yEd: yEd
  • The resulting file loaded into Cytoscape: Cytoscape

Technical Information

Maintainer

Contributors

License

Contribute Code or Provide Feedback

  • You are welcome to submit Pull Requests to this repository. This repository runs code analyses on every Pull Request using SonarCloud.
  • Please read the general advice re contributing to Mastodon and its plugins.

Contribute Documentation

  • If you would like to contribute to this documentation, feel free to open a pull request. The documentation is written in Markdown format.

Acknowledgements

About

A collection of plugins to analyse lineages of tracked objects in Mastodon

Resources

License

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  

Languages