Research
Print page Print page
Switch language
The Capital Region of Denmark - a part of Copenhagen University Hospital
Published

SinaPlot: An Enhanced Chart for Simple and Truthful Representation of Single Observations Over Multiple Classes

Research output: Contribution to journalJournal articleResearchpeer-review

  1. The splicing factor RBM25 controls MYC activity in acute myeloid leukemia

    Research output: Contribution to journalJournal articleResearchpeer-review

  2. BloodSpot: a database of healthy and malignant haematopoiesis updated with purified and single cell mRNA sequencing profiles

    Research output: Contribution to journalJournal articleResearchpeer-review

  3. Leukemogenic nucleophosmin mutation disrupts the transcription factor hub regulating granulo-monocytic fates

    Research output: Contribution to journalJournal articleResearchpeer-review

  4. Differences in Cell Cycle Status Underlie Transcriptional Heterogeneity in the HSC Compartment

    Research output: Contribution to journalJournal articleResearchpeer-review

View graph of relations

Recent developments in data-driven science have led researchers to integrate data from several sources, over diverse experimental procedures, or databases. This alone poses a major challenge in truthfully visualizing data, especially when the number of data points varies between classes. To aid the representation of datasets with differing sample size, we have developed a new type of plot overcoming limitations of current standard visualization charts. SinaPlot is inspired by the strip chart and the violin plot and operates by letting the normalized density of points restrict the jitter along the x-axis. The plot displays the same contour as a violin plot but resembles a simple strip chart for a small number of data points. By normalizing jitter over all classes, the plot provides a fair representation for comparison between classes with a varying number of samples. In this way, the plot conveys information of both the number of data points, the density distribution, outliers and data spread in a very simple, comprehensible, and condensed format. The package for producing the plots is available for R through the CRAN network using base graphics package and as geom for ggplot through ggforce. We also provide access to a web-server accepting excel sheets to produce the plots (http://servers.binf.ku.dk:8890/sinaplot/).

Original languageEnglish
JournalJournal of Computational and Graphical Statistics
Pages (from-to)1-4
Number of pages4
ISSN1061-8600
DOIs
Publication statusPublished - 17 May 2018

    Research areas

  • Big data, Bioinformatics, Visualization

ID: 55136895