{"id":15606,"date":"2020-04-04T10:51:52","date_gmt":"2020-04-04T09:51:52","guid":{"rendered":"https:\/\/www.datanovia.com\/en\/?p=15606"},"modified":"2020-04-04T10:51:52","modified_gmt":"2020-04-04T09:51:52","slug":"dplyr-how-to-compute-summary-statistics-across-multiple-columns","status":"publish","type":"post","link":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/","title":{"rendered":"dplyr: How to Compute Summary Statistics Across Multiple Columns"},"content":{"rendered":"<div class=\"rdoc\">\n<p>This article describes how to compute summary statistics, such as mean, sd, quantiles, across multiple numeric columns.<\/p>\n<p><strong>Key R functions and packages<\/strong><\/p>\n<p>The <code>dplyr<\/code> package [v&gt;= 1.0.0] is required. We\u2019ll use the function <code>across()<\/code> to make computation across multiple columns.<\/p>\n<p><strong>Usage<\/strong>:<\/p>\n<pre class=\"r\"><code>across(.cols = everything(), .fns = NULL, ..., .names = NULL)<\/code><\/pre>\n<ul>\n<li><code>.cols<\/code>: Columns you want to operate on. You can pick columns by position, name, function of name, type, or any combination thereof using Boolean operators.<\/li>\n<li><code>.fns<\/code>: Function or list of functions to apply to each column.<\/li>\n<li><code>...<\/code>: Additional arguments for the function calls in .fns.<\/li>\n<li><code>.names<\/code>: A glue specification that describes how to name the output columns. This can use <code>{col}<\/code> to stand for the selected column name, and <code>{fn}<\/code> to stand for the name of the function being applied. The default (NULL) is equivalent to <code>\"{col}\"<\/code> for the single function case and <code>\"{col}_{fn}\"<\/code> for the case where a list is used for <code>.fns<\/code>.<\/li>\n<\/ul>\n<pre class=\"r\"><code># Load required R packages\r\nlibrary(dplyr)\r\n\r\n# Data preparation\r\ndf &lt;- as_tibble(iris)\r\nhead(df)<\/code><\/pre>\n<pre><code>## # A tibble: 6 x 5\r\n##   Sepal.Length Sepal.Width Petal.Length Petal.Width Species\r\n##          &lt;dbl&gt;       &lt;dbl&gt;        &lt;dbl&gt;       &lt;dbl&gt; &lt;fct&gt;  \r\n## 1          5.1         3.5          1.4         0.2 setosa \r\n## 2          4.9         3            1.4         0.2 setosa \r\n## 3          4.7         3.2          1.3         0.2 setosa \r\n## 4          4.6         3.1          1.5         0.2 setosa \r\n## 5          5           3.6          1.4         0.2 setosa \r\n## 6          5.4         3.9          1.7         0.4 setosa<\/code><\/pre>\n<pre class=\"r\"><code># Compute the mean of multiple columns\r\ndf %&gt;%\r\n  group_by(Species) %&gt;%\r\n  summarise(across(Sepal.Length:Petal.Length, mean, na.rm= TRUE))<\/code><\/pre>\n<pre><code>## # A tibble: 3 x 4\r\n##   Species    Sepal.Length Sepal.Width Petal.Length\r\n## * &lt;fct&gt;             &lt;dbl&gt;       &lt;dbl&gt;        &lt;dbl&gt;\r\n## 1 setosa             5.01        3.43         1.46\r\n## 2 versicolor         5.94        2.77         4.26\r\n## 3 virginica          6.59        2.97         5.55<\/code><\/pre>\n<pre class=\"r\"><code># Compute the mean and the sd of all numeric columns\r\ndf %&gt;%\r\n  group_by(Species) %&gt;%\r\n  summarise(across(\r\n    .cols = is.numeric, \r\n    .fns = list(Mean = mean, SD = sd), na.rm = TRUE, \r\n    .names = \"{col}_{fn}\"\r\n    ))<\/code><\/pre>\n<pre><code>## # A tibble: 3 x 9\r\n##   Species Sepal.Length_Me\u2026 Sepal.Length_SD Sepal.Width_Mean Sepal.Width_SD Petal.Length_Me\u2026 Petal.Length_SD\r\n## * &lt;fct&gt;              &lt;dbl&gt;           &lt;dbl&gt;            &lt;dbl&gt;          &lt;dbl&gt;            &lt;dbl&gt;           &lt;dbl&gt;\r\n## 1 setosa              5.01           0.352             3.43          0.379             1.46           0.174\r\n## 2 versic\u2026             5.94           0.516             2.77          0.314             4.26           0.470\r\n## 3 virgin\u2026             6.59           0.636             2.97          0.322             5.55           0.552\r\n## # \u2026 with 2 more variables: Petal.Width_Mean &lt;dbl&gt;, Petal.Width_SD &lt;dbl&gt;<\/code><\/pre>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>This article describes how to compute summary statistics, such as mean, sd, quantiles, across multiple numeric columns. Key R functions and packages The dplyr package [v&gt;= 1.0.0] is required. We\u2019ll [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":8000,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"rating_form_position":"","rating_results_position":"","mr_structured_data_type":"","footnotes":""},"categories":[341],"tags":[359,357,358],"class_list":["post-15606","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-faq","tag-data-manipulation","tag-dplyr","tag-tidyverse"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia\" \/>\n<meta property=\"og:description\" content=\"This article describes how to compute summary statistics, such as mean, sd, quantiles, across multiple numeric columns. Key R functions and packages The dplyr package [v&gt;= 1.0.0] is required. We\u2019ll [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\" \/>\n<meta property=\"og:site_name\" content=\"Datanovia\" \/>\n<meta property=\"article:published_time\" content=\"2020-04-04T09:51:52+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"512\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Alboukadel\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Alboukadel\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\"},\"author\":{\"name\":\"Alboukadel\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/7767cf2bd5c91a1610c6eb53a0ff069e\"},\"headline\":\"dplyr: How to Compute Summary Statistics Across Multiple Columns\",\"datePublished\":\"2020-04-04T09:51:52+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\"},\"wordCount\":144,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg\",\"keywords\":[\"Data Manipulation\",\"dplyr\",\"tidyverse\"],\"articleSection\":[\"FAQ\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\",\"url\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\",\"name\":\"dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia\",\"isPartOf\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg\",\"datePublished\":\"2020-04-04T09:51:52+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage\",\"url\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg\",\"contentUrl\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg\",\"width\":1024,\"height\":512},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.datanovia.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"dplyr: How to Compute Summary Statistics Across Multiple Columns\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#website\",\"url\":\"https:\/\/www.datanovia.com\/en\/\",\"name\":\"Datanovia\",\"description\":\"Data Mining and Statistics for Decision Support\",\"publisher\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.datanovia.com\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#organization\",\"name\":\"Datanovia\",\"url\":\"https:\/\/www.datanovia.com\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/09\/datanovia-logo.png\",\"contentUrl\":\"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/09\/datanovia-logo.png\",\"width\":98,\"height\":99,\"caption\":\"Datanovia\"},\"image\":{\"@id\":\"https:\/\/www.datanovia.com\/en\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/7767cf2bd5c91a1610c6eb53a0ff069e\",\"name\":\"Alboukadel\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/ed3108646c5c7c3d188324ab972f96ad7d9975b41b94014d7f68257791be395a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/ed3108646c5c7c3d188324ab972f96ad7d9975b41b94014d7f68257791be395a?s=96&d=mm&r=g\",\"caption\":\"Alboukadel\"},\"url\":\"https:\/\/www.datanovia.com\/en\/blog\/author\/kassambara\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/","og_locale":"en_US","og_type":"article","og_title":"dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia","og_description":"This article describes how to compute summary statistics, such as mean, sd, quantiles, across multiple numeric columns. Key R functions and packages The dplyr package [v&gt;= 1.0.0] is required. We\u2019ll [&hellip;]","og_url":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/","og_site_name":"Datanovia","article_published_time":"2020-04-04T09:51:52+00:00","og_image":[{"width":1024,"height":512,"url":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg","type":"image\/jpeg"}],"author":"Alboukadel","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Alboukadel","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#article","isPartOf":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/"},"author":{"name":"Alboukadel","@id":"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/7767cf2bd5c91a1610c6eb53a0ff069e"},"headline":"dplyr: How to Compute Summary Statistics Across Multiple Columns","datePublished":"2020-04-04T09:51:52+00:00","mainEntityOfPage":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/"},"wordCount":144,"commentCount":0,"publisher":{"@id":"https:\/\/www.datanovia.com\/en\/#organization"},"image":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage"},"thumbnailUrl":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg","keywords":["Data Manipulation","dplyr","tidyverse"],"articleSection":["FAQ"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/","url":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/","name":"dplyr: How to Compute Summary Statistics Across Multiple Columns - Datanovia","isPartOf":{"@id":"https:\/\/www.datanovia.com\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage"},"image":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage"},"thumbnailUrl":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg","datePublished":"2020-04-04T09:51:52+00:00","breadcrumb":{"@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#primaryimage","url":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg","contentUrl":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/10\/IMG_4709.jpg","width":1024,"height":512},{"@type":"BreadcrumbList","@id":"https:\/\/www.datanovia.com\/en\/blog\/dplyr-how-to-compute-summary-statistics-across-multiple-columns\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.datanovia.com\/en\/"},{"@type":"ListItem","position":2,"name":"dplyr: How to Compute Summary Statistics Across Multiple Columns"}]},{"@type":"WebSite","@id":"https:\/\/www.datanovia.com\/en\/#website","url":"https:\/\/www.datanovia.com\/en\/","name":"Datanovia","description":"Data Mining and Statistics for Decision Support","publisher":{"@id":"https:\/\/www.datanovia.com\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.datanovia.com\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.datanovia.com\/en\/#organization","name":"Datanovia","url":"https:\/\/www.datanovia.com\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.datanovia.com\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/09\/datanovia-logo.png","contentUrl":"https:\/\/www.datanovia.com\/en\/wp-content\/uploads\/2018\/09\/datanovia-logo.png","width":98,"height":99,"caption":"Datanovia"},"image":{"@id":"https:\/\/www.datanovia.com\/en\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/7767cf2bd5c91a1610c6eb53a0ff069e","name":"Alboukadel","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.datanovia.com\/en\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/ed3108646c5c7c3d188324ab972f96ad7d9975b41b94014d7f68257791be395a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/ed3108646c5c7c3d188324ab972f96ad7d9975b41b94014d7f68257791be395a?s=96&d=mm&r=g","caption":"Alboukadel"},"url":"https:\/\/www.datanovia.com\/en\/blog\/author\/kassambara\/"}]}},"multi-rating":{"mr_rating_results":[]},"_links":{"self":[{"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/posts\/15606","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/comments?post=15606"}],"version-history":[{"count":1,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/posts\/15606\/revisions"}],"predecessor-version":[{"id":15607,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/posts\/15606\/revisions\/15607"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/media\/8000"}],"wp:attachment":[{"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/media?parent=15606"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/categories?post=15606"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.datanovia.com\/en\/wp-json\/wp\/v2\/tags?post=15606"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}