#datamustread: Visualize This: The FlowingData Guide to Design, Visualization, and Statistics (2nd Edition) by Nathan Yau

A bookshelf neatly arranged with several books on data visualization and analytics: Displayed in the center is the 2nd edition of "Visualize This: The FlowingData Guide to Design, Visualization, and Statistics" by Nathan Yau. Surrounding this book are various other titles, including those by the Alexander Loth: "Decisively Digital", "Teach Yourself VISUALLY Power BI", "Visual Analytics with Tableau", "Datenvisualisierung mit Tableau", "Datenvisualisierung mit Power BI", and "KI für Content Creation." Other visible titles include "Rewired" and "Self-Service BI & Analytics." The arrangement highlights a strong focus on data visualization, analytics, and AI.
The 2nd edition of Visualize This by Nathan Yau, surrounded by several influential data and AI books, including my own works like Decisively Digital and Teach Yourself Visually Power BI.

While my latest book, KI für Content Creation, has just been reviewed by the renowned c’t magazine, I’m happy to continue reviewing books myself. Today, I’m reviewing the just-released second edition of a cornerstone of the data visualization community, Visualize This: The FlowingData Guide to Design, Visualization, and Statistics by Nathan Yau.

Visualize This: A Deep Dive into Data Visualization

Nathan Yau’s Visualize This has long been a staple for data enthusiasts, and the updated second edition brings fresh techniques, technologies, and examples that reflect the rapidly evolving landscape of data visualization.

Core Highlights of This Book

Data-First Approach: Yau emphasizes that effective visualizations start with a deep understanding of the data. This foundational principle ensures that the resulting graphics are not just visually appealing but also accurately convey the underlying information.

Diverse Toolkit: The book introduces a wide range of tools, including the latest R packages, Python libraries, JavaScript libraries, and illustration software. Yau’s pragmatic approach helps readers choose the right tool for the job without feeling overwhelmed by options.

Real-World Applications: With practical, hands-on examples using real-world datasets, readers learn to create meaningful visualizations. This experiential learning approach is particularly valuable for grasping the subtleties of data representation.

Comprehensive Tutorials: The step-by-step guides are a standout feature, covering statistical graphics, geographical maps, and information design. These tutorials provide clear, actionable instructions that make complex visualizations accessible.

Web and Print Design: Yau details how to create visuals suitable for various mediums, ensuring versatility in application whether for digital platforms or printed materials.

Personal Insights on Visualize This

Having taught data strategy and visualization for seven years, I find Visualize This to be an exceptional resource for a broad audience. Yau skillfully integrates scientific data visualization techniques with graphic design principles, providing practical advice along the way. The book’s toolkit is extensive, featuring R, Illustrator, XML, Python (with BeautifulSoup), JSON, and more, each with working code examples to demonstrate real-world applications.

The image shows an open page from the second edition of "Visualize This: The FlowingData Guide to Design, Visualization, and Statistics" by Nathan Yau. The page, from Chapter 5 titled "Visualizing Categories," features a colorful visualization titled "Cycle of Many," which depicts a 24-hour snapshot of daily activities based on data from the American Time Use Survey. This visual highlights how categories change over time and demonstrates the book's practical approach to data visualization.
Visualize This featuring a colorful 24-hour activity visualization based on data from the American Time Use Survey.

Even though I read the first edition years ago, I couldn’t put the second edition down all weekend. This book is a must-read for anyone who handles data or prepares data-based reports. Its beautiful presentation and careful consideration of every aspect—from typeface to page layout—make it a pleasure to read.

The book is user-friendly, offering a massive set of references and free tools for obtaining interesting datasets across various fields, from sports to politics to health. This breadth of resources is crucial for anyone looking to create impactful visualizations across different domains.

While the focus on Adobe Illustrator might be daunting due to its cost and learning curve, Yau’s examples show how Illustrator can enhance graphics created in other tools like SAS and R. I personally prefer the open-source Inkscape, but Yau’s insights helped me overcome my initial reluctance to use Illustrator, leading to more polished and professional visuals.

Yau uses R, Python, and Adobe Illustrator to demonstrate what can be achieved with imagination and creativity. Although some readers might desire more complex walkthroughs from raw data to final graphics, such material would require substantial foundational knowledge in R and Python. Including this would make the book significantly thicker and veer off from its focus on creating visually appealing graphics.

Conclusion: Visualize This is Essential Reading for Data Professionals

Visualize This (Amazon) is an indispensable guide for anyone serious about data visualization. Its methodical, data-first approach, combined with practical tutorials and a comprehensive toolkit, makes it a must-read for information designers, analysts, journalists, statisticians, and data scientists.

For those looking to refine their data visualization skills and create compelling, accurate graphics, this book offers invaluable insights and techniques.

Connect with me on LinkedIn and Twitter for more reviews and insights on the latest in data & AI, and #datamustread:

„#datamustread: Visualize This: The FlowingData Guide to Design, Visualization, and Statistics (2nd Edition) by Nathan Yau“ weiterlesen

Power BI Tricks: 20 Essential DAX Tricks for Your Power BI Reports – A Comprehensive Guide to Power BI DAX

Even more Power BI DAX tricks in these books: "Datenvisualisierung mit Power BI" and "Teach Yourself Visually Power BI"
Even more Power BI DAX tricks in these books: „Datenvisualisierung mit Power BI“ and „Teach Yourself Visually Power BI“

Power BI DAX (Data Analysis Expressions) is at the core of Microsoft’s Power BI and offers incredible capabilities for data manipulation and insights. In this post, we’ll explore 20 ultimate DAX tricks to elevate your Power BI reports. Whether you’re a beginner or an expert, these tips will help you unlock the full potential of Power BI and Microsoft Fabric.

20 Ultimate DAX Tricks – Simply Explained

  1. Use CALCULATE for Context Modification 🛠️
    CALCULATE is a powerful function that changes the context in which data is analyzed.
    Example:CALCULATE(SUM('Sales'[Sales Amount]), 'Sales'[Region] = "West")
    This calculates the sum of sales in the West region.
  2. Use RELATED for Accessing Data from Related Tables 🔄
    RELATED function allows you to access data from a table related to the current table.
    Example: RELATED('Product'[Product Name])
    This fetches the product name related to the current row.
  3. Use EARLIER for Row Context 🕰️
    EARLIER is a useful function when you want to access data from an earlier row context.
    Example: CALCULATE(SUM('Sales'[Sales Amount]), FILTER('Sales', 'Sales'[Sales ID] = EARLIER('Sales'[Sales ID])))
  4. Use RANKX for Ranking 🏅
    RANKX function allows you to rank values in a column.
    Example: RANKX(ALL('Sales'), 'Sales'[Sales Amount], , DESC)
    This ranks sales amounts in descending order.
  5. Use DIVIDE for Safe Division 🧮
    DIVIDE function performs division and handles division by zero.
    Example: DIVIDE([Total Sales], [Total Units])
    This divides total sales by total units and returns BLANK() for division by zero.
  6. Use SWITCH for Multiple Conditions 🔄
    SWITCH function is a better alternative to nested IFs.
    Example: SWITCH([Rating], 1, "Poor", 2, "Average", 3, "Good", "Unknown")
    This assigns a label based on the rating.
  7. Use ALL for Removing Filters 🚫
    ALL function removes filters from a column or table.
    Example: CALCULATE(SUM('Sales'[Sales Amount]), ALL('Sales'))
    This calculates the total sales, ignoring any filters.
  8. Use CONCATENATEX for String Aggregation 🧵
    CONCATENATEX function concatenates a column of strings.
    Example: CONCATENATEX('Sales', 'Sales'[Product], ", ")
    This concatenates product names with a comma separator.
  9. Use USERELATIONSHIP for Inactive Relationships 🔄
    USERELATIONSHIP function allows you to use inactive relationships.
    Example: CALCULATE(SUM('Sales'[Sales Amount]), USERELATIONSHIP('Sales'[Date], 'Calendar'[Date]))
    This calculates sales using an inactive relationship.
  10. Use SAMEPERIODLASTYEAR for Year-Over-Year Comparisons 📆
    SAMEPERIODLASTYEAR function calculates the same period in the previous year.
    Example: CALCULATE(SUM('Sales'[Sales Amount]), SAMEPERIODLASTYEAR('Calendar'[Date]))
    This calculates sales for the same period last year.
  11. Use BLANK for Missing Data 🕳️
    BLANK function returns a blank.
    Example: IF('Sales'[Sales Amount] = 0, BLANK(), 'Sales'[Sales Amount])
    This returns a blank if the sales amount is zero.
  12. Use FORMAT for Custom Formatting 🎨
    FORMAT function formats a value based on a custom format string.
    Example: FORMAT('Sales'[Sales Date], "MMM-YYYY")
    This formats the sales date as „MMM-YYYY“.
  13. Use HASONEVALUE for Single Value Validation 🎯
    HASONEVALUE function checks if a column has only one distinct value.
    Example: IF(HASONEVALUE('Sales'[Region]), VALUES('Sales'[Region]), "Multiple Regions")
    This checks if there is only one region.
  14. Use ISFILTERED for Filter Detection 🕵️‍♀️
    ISFILTERED function checks if a column is filtered.
    Example: IF(ISFILTERED('Sales'[Region]), "Filtered", "Not Filtered")
    This checks if the region column is filtered.
  15. Use MAXX for Maximum Values in a Table 📈
    MAXX function returns the maximum value in a table.
    Example: MAXX('Sales', 'Sales'[Sales Amount])
    This returns the maximum sales amount.
  16. Use MINX for Minimum Values in a Table 📉
    MINX function returns the minimum value in a table.
    Example: MINX('Sales', 'Sales'[Sales Amount])
    This returns the minimum sales amount.
  17. Use COUNTROWS for Counting Rows in a Table 🧮
    COUNTROWS function counts the number of rows in a table.
    Example: COUNTROWS('Sales')
    This counts the number of rows in the Sales table.
  18. Use DISTINCTCOUNT for Counting Unique Values 🎲
    DISTINCTCOUNT function counts the number of distinct values in a column.
    Example: DISTINCTCOUNT('Sales'[Product]) This counts the number of distinct products.
  19. Use CONTAINS for Lookup Scenarios 🔍
    CONTAINS function checks if a table contains a row with certain values.
    Example: CONTAINS('Sales', 'Sales'[Product], "Product A")
    This checks if „Product A“ exists in the Sales table.
  20. Use GENERATESERIES for Creating a Series of Numbers 📊
    GENERATESERIES function generates a series of numbers.
    Example: GENERATESERIES(1, 10, 1)
    This generates a series of numbers from 1 to 10 with a step of 1.

Even more Power BI DAX Tricks

📚 If you want to dive even deeper into the world of Power BI, check out my Power BI books 🔗 Teach Yourself Visually Power BI (Amazon) and 🔗 Datenvisualisierung mit Power BI (Amazon)! These books are packed with even more tips, tricks, and tutorials to help you master Power BI. Don’t miss out on these invaluable resources!

Want to stay updated with the latest Power BI insights? Follow me on Twitter and LinkedIn. Share your thoughts, ask questions, and engage with a community of Power BI enthusiasts like yourself.

Feel free to leave a comment, ask questions, or share my Power BI DAX tweets:

„Power BI Tricks: 20 Essential DAX Tricks for Your Power BI Reports – A Comprehensive Guide to Power BI DAX“ weiterlesen

Newsletter: Data & AI Digest #1

Generated with DALL-E
Generated with DALL-E

Hello and welcome to the first issue of Data & AI Digest! We’ve curated an exceptional list of articles that delve into a wide array of topics—from the triumphs of data visualization in public health to the ethical dilemmas surrounding AI-generated images. Are you curious about running Python directly in Excel? We’ve got you covered!

  • [dataviz] The Triumph Over Tobacco: A Public Health Milestone: Explore how a blend of regulation, taxation, and education led to a significant decline in cigarette sales and lung cancer deaths in the U.S. Read more
  • [dataviz] Master the Art of Data Visualization with These Must-Read Books: Whether you’re a novice or a pro, discover five essential books that guide you to data visualization mastery. Read more
  • [analytics] Python Meets Excel: A New Era in Data Analysis: Announcing Python support in Microsoft Excel, enabling data analysis directly within the Excel grid—no separate Python installation needed. Read more
  • [powerbi] Unlock Power BI’s Full Potential with DAX: Discover 20 essential DAX tricks to enhance your Power BI reports, suitable for both beginners and experts. Read more
  • [ethics] Controversy Over ‚SmashOrPassAI‘ Site: A new site that allows users to rate AI-generated women sparks backlash, raising ethical concerns. Read more
  • [public opinion] Rising Concerns Over AI’s Role in Daily Life: A new Pew Research survey reveals growing apprehension among Americans about the role of AI in daily life, with views varying by age and use cases. Read more
  • [privacy] GDPR Complaint Against OpenAI Over ChatGPT: OpenAI faces allegations of GDPR violations regarding its ChatGPT model, as filed by a privacy researcher. Read more
  • [coding] Meet Code Llama: A New Large Language Model for Coding: Meta is introducing Code Llama, a state-of-the-art large language model designed to assist with coding tasks. Read more

We hope you find this week’s digest both informative and inspiring. Enjoy the newsletter? Help us make it bigger and better by sharing it with colleagues and friends. For even more curated content, discussions, and networking opportunities, don’t forget to join our LinkedIn Data & AI User Group.

Get the Data & AI Digest newsletter delivered to your email weekly.

Stay curious and keep innovating!

Introducing „Data & AI Digest“: Your Essential AI Newsletter for Weekly Insights on Data Science and Artificial Intelligence

Data & AI Digest - the AI newsletter (image generated with DALL-E)
Data & AI Digest – the AI newsletter (image generated with DALL-E)

Why You Can’t Afford to Miss the „Data & AI Digest“

In today’s fast-paced digital landscape, keeping up with the ever-changing worlds of Data Science and Artificial Intelligence (AI) can be challenging. From breakthroughs in machine learning algorithms to ethical debates surrounding AI, the scope of what you need to know is vast and continuously expanding.

Enter the Data & AI Digest, a weekly newsletter curated to keep you updated on the most pertinent news, articles, and discussions in the realms of data and AI. Whether you are a seasoned professional, a student, or simply an enthusiast, this newsletter is designed with you in mind.

„Introducing „Data & AI Digest“: Your Essential AI Newsletter for Weekly Insights on Data Science and Artificial Intelligence“ weiterlesen

#datamustread Data Viz Essentials: The Must-Read Books to Master Data Visualization

#DataVizEssentials 2023: The Must-Read Books to Master Data Visualization
#DataVizEssentials 2023: The Must-Read Books to Master Data Visualization

Building on the previous #datamustread recommendations, I’m excited to present the data viz edition of #datamustread. In this post, we’re focusing on the indispensable skill of data visualization. Whether you’re a beginner or a seasoned pro, these five books will guide you to mastery:

  1. 📖 The Big Book of Dashboards
  2. 📖 Storytelling with Data
  3. 📖 The Truthful Art
  4. 📖 Show Me the Numbers
  5. 📖 Teach Yourself VISUALLY Power BI

The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios

A comprehensive guide filled with real-world solutions for building effective business dashboards across various industries and platforms. It’s a go-to resource for matching great dashboards with real-world scenarios.

Storytelling with Data: A Data Visualization Guide for Business Professionals

Cole Nussbaumer Knaflic shares practical guidance on creating compelling data stories. Learn how to make your data visually appealing, engaging, and resonant with your audience.

„#datamustread Data Viz Essentials: The Must-Read Books to Master Data Visualization“ weiterlesen