Dependecy of personal income on various factors by Egor Chernyi
Introduction
Income inequality and wage disparities are widely studied in economics and social sciences. Understanding the key factors that influence personal income can help individuals make informed career choices and assist policymakers in addressing wage gaps. This study aims to analyze the dependency of personal income on various factors, including age, education level, occupation, gender, marital status, and working hours. Using the Adult Income Dataset, which contains demographic and economic attributes of individuals, we will investigate which variables have the most significant impact on income levels. The key research question we seek to answer is: “How does personal income depend on various factors?” To address this question, I will: 1. Explore the dataset – Understand its structure, clean the data, and identify relevant variables. 2. Analyze statistical relationships – Examine correlations and distributions of income across different demographic and professional groups. 3. Visualize patterns – Use graphs and charts to illustrate trends and dependencies. By the end of this project, I aim to identify the most influential factors in shaping personal income, assess whether common assumptions—such as higher education leading to higher wages—hold true, and evaluate the effectiveness of a predictive model in classifying individuals based on their income level.
Run to view results
Data collecting and cleaning
Run to view results
Run to view results
Run to view results
Run to view results
Run to view results
Run to view results
By observing the unique values in the columns, I noticed that there are values that are "?" which represents unknown value. The categorical columns values also have a space in front, so we'll remove them to reduce confusion.
Run to view results
Run to view results
Run to view results
We will replace the "?" values with the most frequent values in each of the columns.
Run to view results
Run to view results
Data Analysis
Run to view results
Run to view results
How personal income affected by age
Run to view results
Higher-income individuals tend to be older. The average age for high earners is higher (~44 years) than that of lower earners (~37 years).
Run to view results
Run to view results
T-test also shows significance of age regarding personal income/ Possible explanation: -Older indivuduals have more work experience, leading to higher salaries. -Career growth and promotions often come with age -Younger indivuduals might still be in education, internships, or entry-level jobs Key insights: Age is correlated with income, ut there may be a plateau where salary growth slows down at a certain age
What employment type is highly rewarded
Run to view results
Private sector jobs dominate the dataset, but higher salaries are more common in self-employment. Key findings: - Private-sector is the most common work class, but mostly low-icnome earners - Self-employed has a higher proportion of high-earners, possibly due to business ownership and flexible earning potential - Government jobs are more stable, often requiring higher education, leading to relatively etter pay - Unpaid workers and those who never worked are not represented. Possible explanation: - Job type and industry matter: government and self-employed individuals may have specialaozed skills or ownership benefits - Career security: Government jos may offer structured promotions and pay raises over time
Race and income
Run to view results
The distribution of income across races is uneven. The White racial group has the highes proportion of high earners Key findings: - White individuals dominate the high-income group - Other racial groups, including Black, Asian, and Native American Individuals, have fewer high-income earners Possible explanation: - Systematic factors such as access to higher education and better job opportunities could contribute to these diffrences. - Occupational disparities and historical inequalities in USA may influence wage gaps
Personal icnome dependency on education level
Run to view results
Higher education levels strongly correlate with higher income. Key findings: - Individuals with Doctorate, Masters and Bachelors degrees have the highest-percentage of high-income earners - High-School graduates and below are mostly in the low-income group - Technical and Associate degrees offer moderate improvement in income potential Possible explanation: - Higher education typically leads to specialized skills and qualifications, making individuals eligble for higher-paying jobs - Many high-income careers require advanced degrees
Hard work pays off or personal income and working hours dependency
Run to view results
High-income individuals tend to work more hours per week on average. Key findings: - Most low-income individuals work around 40 hours per week - Higher- income individuals tend to work more than 40 hours per week, often on demanding or specialized roles -- Some individuals working fewer hours ut still earning high income might be on high-paying fields or self-employed Possible explanation: - Working more hiurs can increase total earnings, especially in hourly wage jobs - Some high-income professions require long-hours - Part-time jobs are more common among low-income earners
Final thoughts
This dataset provides valuable insights into income dependency on various factors. Next steps of this research could be: 1. In-depth statisticall analysis - Correlation analysis - Chi-square test etc. 2. Predictive modelling - Logistic Regression: Since income is binary, logistic regression could be used to predict whether an individual earns a high salary based on age, education, work class, and working hours