Simple Data Analysis for Nonprofits

Ryan Brooks
Sep 15, 2023
10 min read

Updated: Aug 6, 2024

Analyze Your Nonprofit Program Data to Understand your Impact

Your social service nonprofit will collect lots of data about your participants (aka clients, consumers, guests) and services. Some of that will be demographic data about your participants, and some of it will focus on your outputs and outcomes to track and demonstrate your performance.

However, having a bunch of data in a bunch of spreadsheets, or stuck in your case management software, isn’t very satisfying. You’ll need to analyze your data to get a payoff from spending your time collecting it.

In this post, we explore a few different ways to examine your data, and we'll connect the dots with a lot of topics we've already discussed in our Nonprofit Data Bootcamp.

Measures of Central Tendency & Levels of Measurement: A Quick Review

Let’s do a quick review of levels of measurement and the measures of central tendency that can be used with each level of measurement.

Measures of Central Tendency

The measure of central tendency attempts describe your entire dataset with a single value. There are three measures of central tendency.

Mean – the mathematical average that you learned in elementary school.
Median – the middle value when all values in a set are rank ordered
Mode – the most common value in a set of values

Levels of Measurement

There are three levels of measurement that you should know: nominal, ordinal, and interval/ratio.

Nominal

Nominal Data is categorical data. The categories themselves don’t have an underlying numerical value and they cannot be rank ordered. No category is inherently “better” or “more” than the others. They are simply different.

Some examples nominal data:

Yes/No
Red/Blue/Black/Yellow
True/False
Military Veteran/Not A Military Veteran
Gender
Race & Ethnicity

Nominal data measure(s) of central tendency: Mode

Ordinal

Ordinal data is categorical data that has a logical order to it. It’s still categorical, so there is no inherent underlying numerical value. You can’t measure the exact “distance” between categories even though you can put them in order. We often assign a numeric value to them to make analysis easier.

Some Examples of Ordinal Data

Opinion: (1) Strongly Disagree, (2) Disagree, (3) Agree, (4) Strongly Agree
Educational Attainment: (1) Some High School, (2) High School Graduate, (3) Some College, (4) College graduate
Income Level: (1) $0-10,000, (2) $10,001-$20,000, (3) $20,001-$30,000, (4) $30,001 or greater

Ordinal data measure(s) of central tendency: median and mode

We often see the mean used with Likert Scale data. That's usually fine if you’re doing simple comparisons (e.g. customer satisfaction last year vs this year) and not doing rigorous data analysis (e.g. multivariate regression).

Interval/Ratio

Interval/Ratio* data is continuous data with an underlying numerical value. Interval/Ratio data can be rank ordered, and you can do mathematical operations with it because differences between values are meaningful and measurable.

Examples of Interval/Ratio data:

Height
Age
Annual income
Rent as a percentage of income
Years of Education completed

Interval/Ratio measures of central tendency: Mean and Median

The mode often not useful with interval/ratio data because there can be too many possible values. For example, if we asked 1,000 people for their income last year, we'd get several hundred different responses. In this situation, the mode would not be representative of the entire data set.

NOTE: Interval and ratio data are technically different levels of measurement. Their difference is not meaningful to most of us, most of the time, so we’ll combine them for simplicity.

Pro Tip: If you want to turn around a dull party, then try correcting someone when they mistakenly label interval data as ratio data in a casual data nerd conversation. Take Me Home, Country Roads suddenly stops. Harlem Shake plays. Wild dancing ensues. Works every time.

Descriptive Statistics

A descriptive statistics table presents an overview of your data. For example, you might display demographic characteristics of people using your pantry service along with data about meals served or the average number visits per year.

A descriptive statistics table helps you understand who you are serving and what/how much you are doing.

In Table 1(below), we present some descriptive statistics for our food pantry. Table 1 includes data on gender and race of the head, household composition, and finally a little performance data including: count of households served, total beneficiaries, and meals provided.

Table 1: Descriptive Statistics for Food Pantry Participants July 1, 2022 - June 30, 2023

Head of Household Gender	Count	Percent
Woman	83	67%
Man	38	31%
Another Gender	4	3%
Head of Household Race/Ethnicity	Count	Percent
Asian, Pacific Islander	6	5%
Black	25	20%
Hispanic or Latino	25	20
Multi-Racial	19	15%
White	44	35%
Another Race or Ethnicity	6	5%
Household Composition	Count	Percent
Single Parent/Guardian	69	55%
Multi-Parent/Guardian	19	15%
No Children or Dependents	37	30%
Households and Services	Count
Unduplicated Households	125
Unduplicated Beneficiaries (all household members)	350
Total Meals Served	25,200

Table 1 Description: example of descriptive statistics table for a food pantry. Table 1 includes participant demographics (gender, race, household composition), as well as some performance metrics (unduplicated households served, unduplicated beneficiaries, and total meals served).

The descriptive statistics table helps us quickly grasp the size, scope, and makeup of our program. For example, we can see that 67% of the heads-of-household that come to the organization are women. We can also see that slightly more than half (55%) of the households served are single-parent households. Finally, Table 1 shows that we served 25,200 meals to 125 unduplicated (i.e. unique) households.

Table 1 might be too large to grasp easily. Another option would be to break it apart. It would make sense to separate the demographic fields in the table from the fields focused on services delivery - basically everything under "Households and Services".

Questions to ask about your descriptive statistics table(s):

Are the people we serve what I expected...
- based on our mission, neighborhood or service area, program goals/targets?
How does this compare to the same period last year?
- Are we serving more men, multi-parent households, people who are employed full-time, veterans, etc?
What do I see that is unexpected?
- Did we serve more or fewer meals last year? Is that change surprising?
Is there anything I want to dig into more deeply? (see next section on cross tabulations)

Cross tabulations

Cross Tabulations (aka crosstabs) are one of the simplest and most powerful ways you can analyze your data. They are particularly useful when you want to compare groups.

For example, you might want to compare the average length of enrollment in a program for single parents to participants who don’t have children. Or, you might want to compare program dropout rates between people who speak English as their first language vs those whose first language is not English.

NOTE: You can use pivot tables in Excel and Google sheets to produce cross tabs. There are countless videos online that can teach you to make them in under 10 minutes.

Looking at Table 2 (below) we compare the results of single parent and childless adults in our employment program.

Table 2: Cross Tabulation of Household Composition and Employment Program Outcomes for Program Participants Between January 1, 2022 and December 31, 2022.

Measure	Single Parent Count	Single Parent Percent	Childless Adult Count	Childless Adult Percent	Total Count	Total Percent
Obtained Job	80	64%	50	67%	130	65%
Job has health benefits	57	46%	45	60%	102	51%
Job pays at least $15/hr	68	54%	47	63%	115	58%
Job has PTO	56	36%	44	59%	100	50%

Total	125		75		200

Table 2 Description: Example of a cross tabulation that compares single parents, childless adults, and all participants along 4 job program outcomes (obtained job, job has health benefits, job pays at least $15/hr, and Job has PTO). Table includes counts and percentages for each outcome measure.

We include both counts and percentages in our table because that gives us the clearest view of whether our work is equally successful for single parents and childless adults.

Table 2 also presents data for all participants. In this example, we only have 2 groups (single parents and childless adults). However, if you had 3 groups or more, comparing every group to every other group gets confusing and tedious. It’s helpful to be able to say “Participants who are X (e.g. single parents, immigrants, college graduates, etc) are/are not as successful as participants overall.

TIP: When presenting crosstabs, I like to put the differentiating variable in the columns. For example, if I wanted to compare program results based on gender, I would make gender categories my column headers. I would use the rows below to compare the results. In Table 2, that differentiating variable household composition (Single Parent vs Childless Adult vs All Participants).

Among X, what is Y? The power of cross tabs

Cross tabs let us ask and answer the general question “Among X, what is Y?”

Among the single parent households, what is the average number of times do they visit our food pantry?
Among multi-racial households, what is the average number of meals they receive each time they visit?

With that information, we can make simple comparisons that help us understand our programs and the needs of our participants.

How does the average number of visits to our food pantry compare for single parents, multi-parent, and childless adult households?
How long do employed participants receive services compared to unemployed participants?

We obviously can't answer every important question using cross tabs, but they can help us understand our programs' equity, effectiveness, and areas for improvement.

Scales

You probably won't see scales in your stats 101 course, but they're powerful and easy to understand. Scales allow you to combine a set of related measures into a single continuous measure.

Let’s say you have four Likert Scale questions about a person’s financial stability.

I can pay my rent on time
I save money for the future
I track my income and expenses
I can afford nutritious food

Our questions have the following response options:

Never
Rarely
Sometimes
Often
Always

NOTE: It's worth pointing out that the numeric values we assigned to each response are logically ordered just like the categories. But, the categories themselves are not precise and the numbers we assign are not perfect.

There's no reason that "never" should have the value of 1 instead of 0 or -1 or something else. The values on our scale should be evenly spaced (e.g. 1, 2, 3, 4, 5 not 1, 3, 17, 97, 98, 250), but we're still dealing with categories that are "fuzzy". It's fine, just be aware of it.

Scales give us three big benefits.

First, you can treat data in scales as interval/ratio data, so you can do calculate means (and standard deviations if you are so inclined).

Second, many concepts that we care about, such as financial stability, are complex and multi-faceted. No single question or data point can measure the concept fully, so we might need to use multiple questions to do that. With scales, we can combine data from multiple questions into a single “scale score”.

Third, scales can make cross tabulations much easier and more accurate.

In this example, we could create cross tabs with race and each of the 4 financial stability questions. Frankly, we probably should do that to see if anything stands out, but we also in tricky territory. We now have 4 tables to compare and synthesize to determine if our program is less effective for, say, Native Americans. That's a tough exercise.

But, we could combine the assigned numeric values for each of these responses into a single scale score. With a single scale score, our comparisons are much easier.

Table 3 and Table 4 (below) helps us see how we might construct a scale score.

Table 3: Individual Participant Responses to Financial Stability Questions

	Pay Rent on Time	Save Money	Track Income & Expenses	Afford Nutritious Food
Jim	(4) Often	(2) Rarely	(1) Never	(2) Rarely
Tina	(3) Sometimes	(1) Never	(1) Never	(1) Never
KT	(4) Often	(2) Rarely	(2) Rarely	(3) Sometimes
Maria	(5) Always	(1) Never	(3) Sometimes	(2) Rarely
Pat	(5) Always	(4) Often	(4) Often	(5) Always

Table 3 Description: Example of table that presents individual values to four Likert Scale questions focused on financial stability. The complete questions are listed above. For simplicity, the columns are labeled as: (1) Pay Rent on Time, (2) Save Money, (3) Track Income & Expenses, and (4) Afford Nutritious Food.

Table 3 is OK. We can get a sense that Pat is doing pretty well overall and that Tina is struggling. But, Table 3 is deceptive because it's small. There are only 5 participants and 4 questions. What if there were 50 participants and 10 questions - so 500 cells to review? It would be overwhelming.

We can make these comparison simpler by combining the values we see in each column into a single Financial Stability Scale score. For example, for Jim, we would add 4+2+1+2 = 9. Jim has a score of 9 on the Financial Stability Scale.

Table 4 shows the results for all 5 participants in the Scale Score column.

Table 4: Financial Stability Scale Score and Average Item Score

	Scale Score	Average Item Score	Comparison to Overall Averages
Jim	9	2.25	Below Average
Tina	6	1.5	Below Average
KT	11	2.75	Average
Maria	11	2.75	Average
Pat	18	4.5	Above Average
Overall Averages	11	2.75

Table 4 Description: Example of table that includes the Scale Score, Average Item Score, and Comparison to the Overall Averages for five participants on a Financial Stability Scale. Two participants (Jim and Tina) have below average scores, two participants (KT and Maria) have average scores, and one participant (Pat) has an above average score.

Looking at Jim's score of 9, do we think that's good, bad, or somewhere in the middle? It’s hard to say without more context.

Scales can strip away the context that is inherent to these questions (i.e. 1 = never, 5 = always), so we have to find a way to create that context again. We have two easy ways to do that.

First, we can just calculate the overall average financial stability scale score. That’s easy enough. We included that on the bottom of Table 4 as "Overall Averages".

Now we can compare each individual to the overall average score.

This is an improvement, but it still feels a little lacking. Maybe if I had access to a few years of data for comparisons, then it would mean something more. As is, it still feels like we’re missing something.

Second, we could calculate each individual’s average item score. To do that, we simply divide their scale score by the number of questions (n=4). Pat has a score of 18, so Pat's average item score is 8/4 = 4.5.

This gives us useful contextual information because can anchor our analysis on those original response values (1=never, 2 = rarely, and so on).

Tina’s average score is 1.5. Her average score lies somewhere between Never (=1) and Rarely (=2). Tina is probably not financially stable. On the other hand, Pat’s average of 4.5 is somewhere between Often (=4) and Always (=5). Pat's looking pretty good.

TIP for making Scales: The questions you include to measure a complex concept clearly belong together and are associated with that topic.

For example, if you are measuring financial stability, you would not include veteran status or dietary restrictions. Sure, you could find a way to say those things impact financial stability; but they are pretty far afield of the main concept you are measuring.

We can see how scales take potentially complex data and analysis and simplify it. They can help use draw conclusions about our work and focus on areas for celebration and improvement.

Data Tools and Data Software

Spreadsheets are powerful tools for data analysis, and there are countless videos and written tutorials that explain how to do cross tabulations (aka pivot tables) and basic statistical analysis. When your data gets more complex because your nonprofit has multiple programs, data stored in spreadsheets gets hard to manage; but they are affordable and powerful tools to rely upon as long as your data is simple.

If you’re using (or shopping for) specialized data collection or case management software it should have built-in tools that easily produce basic statistics that you regularly need, and/or you should be able to export your data to other data analysis tools (like spreadsheets).

Learn More About Nonprofit Data Management

This post is part of our nonprofit data bootcamp series. Check out the complete list of nonprofit data bootcamp topics with links to other published posts.

Reporting your impact is hard when you’re juggling spreadsheets. countbubble makes it easy so you can focus on your mission.

countbubble is case management simplified. We can help your nonprofit master data collection and reporting. Email us contact@countbubble.com or sign up for email updates on blog posts, product news, or scheduling a demo.

Ryan Brooks

Founder, CountBubble, LLC

Please connect with us on social media: Facebook and LinkedIn