I am passionate about fitness. Nutrition is a crucial part of fitness and hence, I love learning about the way nutrition works, and about people's habits and trends when it comes to food. I have gone through quite a bit of fitness content available online and felt like it was time I did some analysis of my own.
Dataset Used
The dataset used for this was originally obtained from MyFitnessPal. It contains 587,187 days of food diary records logged by 9.9K users over a 208 day period from September 2014 through April 2015. The time-series data includes a unique anonymised user id, date, list of foods and their corresponding macro-nutrient breakdown. Hats off to Ingmar Weber and Palakorn Achananuparp for making this data available.
Exploratory Data Analysis
Food Demand Analysis
First, I wanted to find the top foods consumed overall. I developed a function which picked them out month-wise and the results were as follows:
It was obvious from this chart how much people love their favourite cuppa. Coffee was the unparallelled choice throughout. The simple reason behind this could be that coffee is not just a drink — it is one of those simple pleasures that people enjoy as a part of their daily routine. It is also something that brings people together.
Following this were bananas — the most consumed fruits on a daily basis. Strawberries were also widely consumed but were superseded by bananas. However, this was reversed in the month of April, likely due to the onset of summer.
Spinach was the most consumed vegetable and salted butter was the most consumed fat source. This might be surprising as oil is known to be used a lot more than butter. The reason behind this result is likely the variety when it comes to oil (olive oil, vegetable oil, sunflower oil) — there is no such variety with butter, causing statistical convergence on a single product.
Chicken breast was the most popular protein source — no surprise as it is lean, high in protein and widely available.
This analysis can aid grocery stores in effectively stocking up on groceries. Anticipating the correct amount of need can help them avoid low profits due to products going out of stock and food wastage caused by products expiring.
User Analysis
The number of active users was analysed on each day between September 2014 and April 2015.
A sharp drop can be seen right after day 100 — determined to be Christmas. This makes sense as most people spend this time on vacation or with families and tend to get off their diet. The calories and macros are not available at most family dinners to input into the app either.
Following this, there is a dramatic rise around the new year. A lot of people would have put on holiday weight and would be motivated to shed it. The new year also motivates people to make healthy changes.
There was another serious drop towards the end of March. The most plausible explanation is that many users who started on January 1 were on a 3-month diet (one of the most popular durations), and some users must have stopped due to not noticing benefits or lack of motivation.
Day of Week Analysis
There was a significant difference between Monday and all other days. Monday makes for a stronger start and easier tracking. In contrast, Wednesday was much lower — mid-week motivations at an all-time low, so most people would decide to start next Monday instead.
Usage Duration
A large number of users used the app for just 72 hours, with the majority (422) just using it on the first day — likely users wanting to try the app before committing.
Following this drastic drop, usage gradually declines. However, there is a significant rise approaching the 180 mark (178 days highest with 1190 users) — likely the result of the many 180-day diet challenges available online.
The mean was 59 days and the median was 42 days. The mean being higher indicates positive skew by some extremely consistent users.
User Consistency
Over a quarter of users missed less than 10 days. However, this was skewed by users who used the app for less than 10 days total.
After filtering out users with fewer than 10 days of usage, the number fell by over 1000 users but remained comparatively high.
Approximately 1,100 people consistently tracked food for 3 months or more and missed under 20 days. These serious and committed users make up about 1/9th of the total population.
Meals Per Day
Most people ate 4 meals a day.
Goal Achievement
Just 15% of users significantly exceeded their calorie goal by over 100 calories. The remaining 85% successfully stuck to their goal overall.
Sugar Intake Analysis
Half the userbase was at risk of developing type II diabetes. 20% were at high risk, consuming over 200 grams of sugar per day.
In my opinion, sugar is one of the worst things you can put in your body. It has no nutritional benefit, is addictive and inflammatory. Using natural sweeteners such as Stevia would greatly help in combating type 2 diabetes and making the world a healthier place.
Key Insights Derived
Insights to Aid Supermarkets
- Coffee was the most popular item consumed
- Banana was the most consumed fruit
- The most popular protein source was chicken breast
Insights About People and Nutrition
- Sharp drop during Christmas and a dramatic rise on January 1 (New Year)
- On average, 82% of users met their goal and 85% were in a calorie deficit
- The percentage of serious users (logged over 3 months, missed less than 20 days) was 11.11%
- 20.6% were suspected to be at risk of getting type 2 diabetes
- Monday was the most popular start day; Wednesday was the least