<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:cc="http://cyber.law.harvard.edu/rss/creativeCommonsRssModule.html">
    <channel>
        <title><![CDATA[Stories by Enwere Bright on Medium]]></title>
        <description><![CDATA[Stories by Enwere Bright on Medium]]></description>
        <link>https://medium.com/@enwerebright?source=rss-0ac631af6f32------2</link>
        <image>
            <url>https://cdn-images-1.medium.com/fit/c/150/150/0*C6_X5CX5pemJlJ-a</url>
            <title>Stories by Enwere Bright on Medium</title>
            <link>https://medium.com/@enwerebright?source=rss-0ac631af6f32------2</link>
        </image>
        <generator>Medium</generator>
        <lastBuildDate>Fri, 05 Jun 2026 11:29:24 GMT</lastBuildDate>
        <atom:link href="https://medium.com/@enwerebright/feed" rel="self" type="application/rss+xml"/>
        <webMaster><![CDATA[yourfriends@medium.com]]></webMaster>
        <atom:link href="http://medium.superfeedr.com" rel="hub"/>
        <item>
            <title><![CDATA[Cleaning and Exploring the Ask a Manager Survey Dataset]]></title>
            <link>https://medium.com/@enwerebright/cleaning-and-exploring-the-ask-a-manager-survey-dataset-9120003cd2f9?source=rss-0ac631af6f32------2</link>
            <guid isPermaLink="false">https://medium.com/p/9120003cd2f9</guid>
            <category><![CDATA[pandas]]></category>
            <category><![CDATA[numpy]]></category>
            <category><![CDATA[seaborn]]></category>
            <category><![CDATA[exploratory-data-analysis]]></category>
            <category><![CDATA[python]]></category>
            <dc:creator><![CDATA[Enwere Bright]]></dc:creator>
            <pubDate>Sun, 09 Feb 2025 23:30:16 GMT</pubDate>
            <atom:updated>2025-02-10T10:32:23.085Z</atom:updated>
            <content:encoded><![CDATA[<h4>Project Overview: I cleaned and performed exploratory analysis on super dirty survey dataset titled “Ask a Manager Salary Survey 2021 (Responses)’. Additionally, I utilized Power BI for advanced visualization, uncovering insights on how salary varies across age groups, work experience, and industry.”</h4><h4><strong>Dataset Description:</strong></h4><p>The dataset, titled “Ask a Manager Salary Survey 2021 (Responses),” is a real-world collection of survey data featuring 17 variables focused on salaries and workplace demographics. Sourced from AskAManager.org, the data primarily reflects responses from the United States, with additional inputs from international participants.</p><ul><li>‘Gender’ tells us the gender of workers.</li><li>‘Industry’ tells us the industry of the workers.</li><li>‘Age Groups’ tells us the age group of the workers.</li><li>‘Work Year Experience’ tells us the year of experience of the workers.</li><li>‘Location (Country, State, City)’ tells us the location of workers.</li><li>‘Salary’ tells us the Annual salary of workers.</li><li>‘Education’ tells us the highest Education qualifications of workers.</li><li>‘Monetary compensation’ tells us the bonuses or overtime in an average year.</li><li>‘Race’ tells us the race of the workers.</li><li>‘Job Title’ tells us the job title of the workers.</li></ul><p>See the Datasets here: <a href="https://github.com/brightboy373/Cleaning-Exploring-the-Ask-a-Manager-Survey-Dataset/blob/main/Ask%20A%20Manager%20Salary%20Survey%202021.csv">Ask a Manager Survey 2021(Responses)</a></p><h4>Tools and Frameworks:</h4><ul><li>Python, Jupyter notebook (Cleaning and exploratory analysis)</li><li>Libraries: Pandas, Numpy, matplotlib and, Seaborn.</li><li>Power BI (Data Visualization)</li></ul><p><strong>Key Question(s) for Analysis</strong></p><ul><li>How does age group influences salary?</li><li>Which industry pay the most?</li><li>Which Educational Qualification earns the most salary?</li><li>What is the most popular industry?</li><li>How does gender vary by educational qualification?</li><li>How do work year experience influences salary?</li></ul><h4><strong>Key Steps and Challenges:</strong></h4><ul><li>Imported Libraries needed like seaborn, pandas, numpy, and matplotlib.</li><li>Handling Encoding Errors: The first hurdle was an encoding issue. I resolved it by using the latin-1 encoding to ensure proper reading of the data.</li><li>Renaming Irregular Columns: The column names were inconsistent, so I standardized them using Python’s rename() function with a dictionary argument. Additionally, I applied the lower() function to make them lowercase and strip() to remove trailing spaces.</li><li>Exploring Categories: Using value_counts(), I examined the distribution of categories in variables like &quot;Country&quot; and &quot;Job title.&quot; This step helped me identify inconsistencies, such as duplicate entries (e.g., &quot;United States&quot;, &quot;US&quot;. &quot;USA&quot;, &quot;UK&quot;, &quot;United Kingdom&quot;, &quot;United kinkdom&quot;), and regroup them for clarity to the standard name.</li><li>Defined and handled the inconsistent data type assigned to my columns by assessing the data programmatically using the info() function.</li><li>In the race column, I regrouped the column to multiracial, Biracial, and moniracial for easy readability. Categories with one race was named monoracial, Categories with two races was named Biracial, and categories with more than two races was renamed multiracial.</li><li>Changed the salary column from string to float using replace function to remove the commas. Then I converted all the salary to USD. Then I filled in null values with the mean before converting the data type from string to integer. I handled outlier in our salary column by calculating the interquartile range.</li><li>The monetary compensation, I converted it to USD using the current exchange rate, using lambda function. Then I also filled in null values with the mean before converting to integer.</li><li>Recategorized some rows in my work_year_exeperience column for easy understanding eg 41yearsormore to 41years+ using the replace function.</li><li>I dropped unnecessary columns in my datafrom such as other_job_context which was not needed for my analysis.</li></ul><p>See my python codes here: <a href="https://github.com/brightboy373/Cleaning-Exploring-the-Ask-a-Manager-Survey-Dataset/blob/main/Survey%20data%20Analysis%20Project.ipynb">Data Cleaning and Exploratory Analysis</a></p><h4>Data Visualization With Power BI</h4><p>After transforming my data with python, I created dashboard with Power BI to uncover insight into average salary by education, age group, work experience, and Top 5 industries by salary.</p><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/0*Eysqp8rk3T2hoKNm" /></figure><h4>Insights</h4><ul><li>The survey datasets highlight a clear relationship between salary, age group, and work experience. The 45–54 age group earns the highest average salary, reflecting their peak earning potential, while the under 18 age group has the lowest, likely due to limited work experience and entry-level roles. Similarly, work experience significantly impacts earnings, with those having 21–30 years of experience earning the most, followed by 31–40 years of experience. Interestingly, salary is likely to be influenced more by work experience than age. The slight decline in salary for the latter group may indicate that earnings begin to plateau or decrease as individuals approach retirement.</li><li>Individuals with Professional degrees have the highest average salaries, closely followed by those with PhDs, emphasizing the financial benefits of advanced education. In contrast, individuals with only a high school education earn the least on average, reflecting the influence of educational attainment on earning potential. This trend highlights the vital role of higher education in unlocking better-paying opportunities, highlighting its value in career advancement and income growth.</li><li>There is an imbalance in gender distribution in education levels, particularly in advanced degrees like PhDs and professional degrees, where men are more represented. However, women lead in the Master’s degree category, suggesting their pursuit of higher education. The low representation of non-binary individuals and undisclosed gender groups across all education levels highlighting the need for inclusivity and opportunities for these groups to bridge educational gaps.</li><li>The science and technology sector offers the highest salary opportunities, making it the fastest growing industry in this era.</li><li>Education and Salary: Individuals with higher educational qualifications, such as Professional degrees and PhDs, earn more on average. Encouraging and supporting individuals to pursue advanced education could improve earning potential. This is especially relevant for industries requiring specialized skills.</li><li>Work Experience and Salary: Salaries tend to peak between 21–30 years of work experience, suggesting the importance of consistent career progression and skill development. Employers should consider implementing mentorship and growth opportunities for employees to help them maximize their earning potential within this range.</li><li>Age and Salary: Salaries increase steadily across age groups, peaking around 45–54 years. Employers might consider leveraging the expertise of this group while also investing in younger employees to ensure a skilled workforce over time.</li><li>Industry Trends: The science and technology sector offers the highest salary opportunities, followed by education and NGOs. Individuals should be guided toward industries with higher earning potentials based on their skills and interests. Policymakers and educators might also emphasize STEM (Science, Technology, Engineering, and Math) education to prepare individuals for these lucrative fields.</li><li>Gender Representation: Efforts should be made to address gender gaps in high-paying industries and leadership roles to ensure equity and diversity. Programs to empower underrepresented groups to access better education and career opportunities would contribute to societal balance.</li></ul><h3>Recommendation</h3><ul><li>Education and Salary: Individuals with higher educational qualifications, such as Professional degrees and PhDs, earn more on average. Encouraging and supporting individuals to pursue advanced education could improve earning potential. This is especially relevant for industries requiring specialized skills.</li><li>Work Experience and Salary: Salaries tend to peak between 21–30 years of work experience, suggesting the importance of consistent career progression and skill development. Employers should consider implementing mentorship and growth opportunities for employees to help them maximize their earning potential within this range.</li><li>Age and Salary: Salaries increase steadily across age groups, peaking around 45–54 years. Employers might consider leveraging the expertise of this group while also investing in younger employees to ensure a skilled workforce over time.</li><li>Industry Trends: The science and technology sector offers the highest salary opportunities, followed by education and NGOs. Individuals should be guided toward industries with higher earning potentials based on their skills and interests. Policymakers and educators might also emphasize STEM (Science, Technology, Engineering, and Math) education to prepare individuals for these lucrative fields.</li><li>Gender Representation: Efforts should be made to address gender gaps in high-paying industries and leadership roles to ensure equity and diversity. Programs to empower underrepresented groups to access better education and career opportunities would contribute to societal balance.</li></ul><p><em>Originally published at </em><a href="https://github.com/brightboy373/Cleaning-and-Exploring-the-Ask-a-Manager-Survey-Dataset"><em>https://github.com.</em></a></p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=9120003cd2f9" width="1" height="1" alt="">]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[Exploratory Data Analysis on Retail Sales Data]]></title>
            <link>https://medium.com/@enwerebright/exploratory-data-analysis-on-retail-sales-data-6bd7da18b540?source=rss-0ac631af6f32------2</link>
            <guid isPermaLink="false">https://medium.com/p/6bd7da18b540</guid>
            <category><![CDATA[exploratory-data-analysis]]></category>
            <dc:creator><![CDATA[Enwere Bright]]></dc:creator>
            <pubDate>Sun, 09 Feb 2025 22:41:13 GMT</pubDate>
            <atom:updated>2025-02-10T01:31:22.915Z</atom:updated>
            <content:encoded><![CDATA[<p>As a data Analyst Intern at Oasis Infobytes, I explored retail sales data to uncover patterns, trends, and insights that can help businesses make data-driven decisions. Through exploratory data analysis (EDA), we analyze customer demographics, purchasing behavior, and product performance to provide actionable recommendations.</p><h3>Dataset Description:</h3><p>This dataset is a snapshot of a fictional retail landscape sourced from kaggle, capturing essential attributes that drive retail operations and customer interactions. It includes key details such as Transaction ID, Date, Customer ID, Gender, Age, Product Category, Quantity, Price per Unit, and Total Amount. These attributes enable a multifaceted exploration of sales trends, demographic influences, and purchasing behaviors.</p><p>Datasets: <a href="https://github.com/brightboy373/Exploratory-Data-Analysis-on-Retail-Sales-for-Oasis-Infobyte/blob/main/retail_sales_raw_dataset.csv">Link</a></p><h3>Goal of Analysis:</h3><p>The goal of the Analysis is to uncover insights into customer’s demographics, purchase behaviour, and lastly product performance.</p><h3>Methods</h3><p>Here are some processes and method I took to achieve my goal.</p><ol><li>Data Understanding: To be able to carry out this analysis, I needed to understand my data. This will enable me to be sure I have all the columns needed for my analysis and also check for inconsistencies in the dataset. I reviewed the dataset description and identified key attributes (e.g., Transaction ID, Date, Customer ID, Gender, Age, Product Category, Quantity, Price per Unit, Total Amount).</li><li>Data loading and Transformation: Since the datasets is relatively small. I opted to use Excel for the analysis. I loaded and formatted the datasets as a table using Ctrl + T.</li></ol><ul><li>Checked for null values, inconsistencies, and anomalies using Excel’s filter functionality.</li><li>Extracted the month and year from the Date column using the TEXT function.</li><li>Created an Age Group column to categorize customers into age brackets using the IFS function</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*rsJ0VzAlKGfaAuZjCxYOuw.png" /><figcaption>Extracted the month from Date Column using the function TEXT</figcaption></figure><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*dZomlp8SwVhzTYsJkGTbKQ.png" /><figcaption>Extracted the month from Date Column using the function TEXT</figcaption></figure><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*Nf1HiwErj70WuawSnxIWcw.png" /><figcaption>Used the IFS function to group the age of customers.</figcaption></figure><p>3. Used Descriptive Statistics to answer quick questions for stakeholders:</p><ul><li>Average revenue for May 2023 using the AVERAGEIFS function:</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*tONAXl1z5x8zbfNdycNmeQ.png" /></figure><ul><li>Total revenue generated from the Beauty product category using the SUMIF function.</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*HS_rYiz_ZqYp960WSjz3-Q.png" /></figure><ul><li>The number of female customers in November 2023 using the COUNTIFS function:</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*CTKp9pmK_XNGPA5UUxdCyg.png" /></figure><h3>Data Aggregation:</h3><p>Created pivot tables on two different Sheets to aggregate data for customer insights and product performance:</p><ul><li>Customer Segmentation Analysis: Aggregated data by gender, age group, and revenue.</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*PXMr_9UzKww6tDn-IhMf-w.png" /></figure><ul><li>Product Performance: Aggregated data by product category, gender, and revenue.</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*5cBybOVMaijEZpI27rpANQ.png" /></figure><h3>Data Visualization</h3><p>Created two dashboards in Excel to visualize insights:</p><ul><li>Customer Analysis Dashboard: Focused on customer demographics and purchasing behavior.</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*zt9vLNry9dwVHZER3RKXuQ.png" /></figure><ul><li>Product Performance Dashboard: Focused on product category performance and sales trends.</li></ul><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*akfFk_fxTCUhU07EJwuliQ.png" /></figure><ul><li>Used charts such as donut charts, bar charts, line charts, KPIs to present insights effectively.</li></ul><h3>Insights:</h3><h3>Customer Analysis: Who’s Driving Our Sales?</h3><p>When it comes to retail sales, understanding your customers is the key to success. Our analysis of a fictional retail dataset revealed some fascinating trends about who’s shopping, what they’re buying, and why.</p><h4>Gender Distribution: Women Lead the Way</h4><ul><li><strong>51% of our customers are women</strong>, and they contribute <strong>51% of total revenue</strong>. This slight majority isn’t just about numbers — it’s about purchasing power. Women aren’t just shopping more frequently; they’re spending more per transaction, making them our most valuable customer segment.</li></ul><h4>Age Matters: The 51+ Group Dominates</h4><ul><li>The most active and loyal customers? Those aged <strong>51 and above</strong>. This group, largely female, is likely in or nearing retirement, with more time and disposable income to spend. They’re followed by customers aged <strong>41–50</strong>, who are financially stable and highly engaged.</li><li>Younger adults (<strong>21–30</strong>) are the third most active group. Early in their careers, they’re building their lifestyles and are more willing to spend. On the flip side, teenagers (<strong>15–20</strong>) are the least active, as they’re often financially dependent on parents.</li></ul><h4>Revenue Breakdown: Who’s Spending the Most?</h4><ul><li>The <strong>51+ age group</strong> generates the highest revenue, followed by <strong>21–30-year-olds</strong>. Older customers have more disposable income, while younger adults are in a spending phase. Customers aged <strong>31–40</strong> and <strong>41–50</strong> are more cautious, balancing spending with responsibilities like mortgages and family expenses. The <strong>15–20 age group</strong> contributes the least, as they’re still financially dependent.</li></ul><h4>Behavioral Insights: What Drives Purchases?</h4><ul><li><strong>51+ Customers</strong>: They value convenience and quality. Think loyalty programs or home delivery services to keep them engaged.</li><li><strong>21–30 Customers</strong>: They’re trend-driven and respond well to lifestyle-oriented campaigns.</li><li><strong>15–20 Customers</strong>: While they’re the least active now, targeted strategies like student discounts could unlock their potential.</li></ul><h3>Product Analysis: What’s Selling and Why?</h3><h4>Total Revenue and Sales Volume</h4><ul><li>Total revenue: <strong>$456,000</strong> (51% from women).</li><li>Total units sold: <strong>2,514</strong> (52% from women).<br>Women aren’t just our most valuable customers — they’re also driving the majority of sales volume.</li></ul><h4>Product Preferences by Gender</h4><ul><li><strong>Electronics</strong>: Men buy slightly more, but women aren’t far behind. Electronics are a household staple, and women often make these decisions.</li><li><strong>Beauty Products</strong>: Women dominate this category, while men purchase beauty items less frequently — likely for themselves or as gifts.</li><li><strong>Clothing</strong>: Men outspend women here, suggesting a strong interest in fashion and functionality.</li></ul><h4>Pricing and Demand</h4><ul><li><strong>Beauty Products</strong>: At <strong>$184.06</strong> on average, they’re the most expensive but sell the least (<strong>771 units</strong>).</li><li><strong>Clothing</strong>: The most affordable at <strong>$174.29</strong>, it’s also the top-selling category with <strong>894 units</strong>.</li><li><strong>Electronics</strong>: Priced at <strong>$181.90</strong>, they strike a balance, selling <strong>849 units</strong> and contributing <strong>34% of revenue</strong>.</li></ul><h4>Seasonal Trends</h4><ul><li><strong>May</strong> was our best month, with <strong>$53,150 in revenue</strong> and <strong>259 units sold</strong>. This spike suggests seasonal opportunities — think holiday promotions or targeted campaigns.</li></ul><h3>Recommendations: Turning Insights into Action</h3><p>Our analysis of retail sales data reveals clear trends about who’s shopping, what they’re buying, and how we can better serve them. Here’s how we can turn these insights into actionable strategies:</p><h4>1. Prioritize Female Customers</h4><p><strong>Why</strong>: Women make up <strong>51% of our customer base</strong> and contribute <strong>51% of total revenue</strong>. They’re not just frequent shoppers — they’re high spenders, likely due to their role in household purchasing decisions.</p><p><strong>How</strong>:</p><ul><li>Continue offering home-related products that appeal to women.</li><li>Expand marketing efforts to highlight convenience and quality, which resonate with this segment.</li></ul><h4>2. Don’t Overlook Male Customers</h4><ul><li><strong>Why</strong>: While men are slightly less active, they still show strong interest in categories like <strong>clothing</strong> and <strong>electronics</strong>.</li></ul><p><strong>How</strong>:</p><ul><li>Introduce more male-targeted products, especially in clothing and electronics.</li><li>Market convenience-focused solutions (e.g., fast delivery, easy returns) to appeal to their busy lifestyles.</li></ul><h4>3. Cater to the 51+ Age Group</h4><p><strong>Why</strong>: Customers aged <strong>51 and above</strong> are our most active and loyal shoppers, with the highest purchasing power. Their financial stability and free time make them a key demographic.</p><p><strong>How</strong>:</p><ul><li>Offer <strong>home delivery options</strong> to make shopping easier for them.</li><li>Create loyalty programs or exclusive discounts to reward their continued patronage.</li></ul><h4>4. Engage Younger Adults (21–30)</h4><ul><li><strong>Why</strong>: This group has high purchasing power, likely because they’re early in their careers with fewer financial responsibilities.</li></ul><p><strong>How</strong>:</p><ul><li>Launch trendy, lifestyle-oriented marketing campaigns.</li><li>Offer promotions or bundles that appeal to their desire for value and style.</li></ul><h4>5. Don’t Ignore Teenagers (15–20)</h4><p><strong>Why</strong>: While this group contributes the least to sales, they represent a future customer base. Parents may also purchase on their behalf.</p><p><strong>How</strong>:</p><ul><li>Introduce <strong>student discounts</strong> or back-to-school promotions.</li><li>Partner with schools or youth organizations to build brand awareness.</li></ul><h4>6. Optimize Product Offerings</h4><p><strong>Why</strong>:</p><ul><li><strong>Clothing</strong> is the top-selling category, driven by its affordability and necessity.</li><li><strong>Electronics</strong> are in high demand, especially for home entertainment and convenience.</li><li><strong>Beauty products</strong> have the highest average price but the lowest sales volume.</li></ul><p><strong>How</strong>:</p><ul><li>Prioritize <strong>clothing</strong> and <strong>electronics</strong>, as they consistently drive revenue.</li><li>Boost <strong>beauty product</strong> sales through discounts, bundles, or loyalty rewards.</li></ul><h4>7. Leverage Seasonal Trends</h4><p><strong>Why</strong>: Sales peak in <strong>May</strong> and <strong>October</strong>, suggesting strong seasonal demand.</p><p><strong>How</strong>:</p><ul><li>Launch targeted promotions and marketing campaigns during these months.</li><li>Offer holiday-themed discounts or limited-time deals to capitalize on increased spending.</li></ul><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=6bd7da18b540" width="1" height="1" alt="">]]></content:encoded>
        </item>
    </channel>
</rss>