Dr.Chandra Hariharan Iyer: December 2012

Tuesday, 25 December 2012

RM- Basic Statistics for Research in Management(Uploaded for BMS students for project work statistical analysis part)

1. Population: The word 'population' or Universe denotes aggregate or group of individual objects of any nature whose general characteristics are studied by a statistical investigation. The population may finite or infinite.

2. Sample : Sample is a finite sub set of the population and the number of items in a sample is called size of a sample. It may be large or small sample.

3. The standard deviation of sampling distribution of statistic is known as standard error.

4. Statistical constants of population namely mean (μ) and variance (s2) etc, which are usually referred as parameter. The statistical measures from sample observation are known as mean (x) and S.D (S), variable (S2).

5. "A hypothesis in statistics is simply a quantitative statement about a population". It is based an assumptions.

6. Null hypothesis is the hypothesis, which is tested for possible rejection under the assumption that it is true and is denoted as Ho

7. Alternative hypothesis is the statement about the population, which gives an alternative to the null hypothesis and is denoted by H1.

8. Type I and Type II error: Rejection of the hypothesis when it should be accepted is known as Type I error. Acceptance of a hypothesis when it should be rejected is known as Type II error.

Accept Ho Reject Ho

Ho is true Correct decision Type I error

Ho is false Type II error Correct decision

9. In testing a given hypothesis, the maximum probability with which we could be willing to risk is called level of significance of the test.

10. critical value: The value of the test statistic, which separates the sample space into rejection region and the acceptance region, is called the critical value.

11. Procedure for testing of hypothesis: 1. Set up the Null hypothesis: Ho 2. Set up the alternative hypothesis: H1 3. Choose an appropriate level of significance 4. Calculate the test statistic Z= t-e(t)/ s.e(t) 5. Compare the computed value with the table value. if Z > table value : Reject the Null ; Z < table value: Accept the Null

12. One-tailed test: In any test, the critical region is represented by a portion of the area under the probability curve of the sampling distribution of the test statistic. A test of any statistical hypothesis where the alternative hypothesis is one tailed (right or left tailed) is called a one-tailed test.

13.Two-tailed test: A test of statistical hypothesis where the alternative hypothesis is two tail. Ho:

μ = μo against the alternative hypothesis H1: μ >μo and H1: μ< μo is known as two tailed test in such case the critical region is given by the portion of the area lie in both the tails of the probability curve of the test statistic.

14. non-parametric test: The tests, which do not depend upon the population parameters such as mean and the variance, they are called non-parametric tests. Non-parametric statistics is a collection of tools for data analysis that offers a different approach to many of the decision problems. Non-parametric tests are distribution free. That is they do not require any assumption to be made about population. They are simple to understand and easy to apply when the sample sizes are small. Non-parametric test make fewer and less stringent assumptions than do the classical procedures. It is less time consuming. The following are the methods used in non-parametric tests. They are: 1. The sign test 2. A Rank sum test 3. The one sample Runs Test 4. The kruskal wallis or H test 5. The spearman's Rank correlation procedure

15. Correlation analysis deals with the association between two or more variables. The following are the significances of correlation: There are some kinds of relationship between variables. For example relationship between price and supply, income and expenditure etc. The two variables are closely related. That is the estimate the value of one variable given the value of another. The effect of correlation is to reduce the range of uncertainty. If two variables tend to move together in the same direction. That is an increase in the value of one variable is accompanied by an increase in the value of other variable. If two variables tend to move together in opposite directions so that an increase or decrease in the value of one variable is accompanied by a decrease or increase in the value of other variable then the correlation is called negative or inverse correlation.

16. Rank correlation coefficient: In 1904, Charles Edwin Spearman a British psychologist found out the method by determining the coefficient of correlation by ranks. This measure is useful in dealing with qualitative characteristics such as intelligence, beauty, morality, character etc. Features of spearman's correlation coefficient : 1. The sum of the difference of ranks between two variables shall be zero

That is d = o 2. Spearman's correlation coefficient is distribution free.

17. "Regression is the measure of the average relationship between two or more variables in terms of the original units of data". Uses of regression analysis: 1. Regression analysis provides estimates of value of the dependent variable from values of the independent variable. 2. With the help of regression coefficients, we can calculate the correlation coefficient (r) and the coefficient of determination (r2). 3. The regression analysis is highly useful and the regression line equation helps to estimate the value of dependent variable, when the values of independent variables are used in the equation

18. A time series may be defined as a collection of readings belonging to different periods of some economic variable or composite of variables. The following are the various components of time series. 1. Trend

2. Seasonal charges 3. Cyclical charges 4. Irregular or Random fluctuations. The changes in the value of variable in different periods of time are due to so many factors. These factors are called the components of a time series.

(Source: Text book by S.P.Gupta, Indira Gupta, Notes given by Girija vallaban sir, IGNOU study material)

RM-Statistical analysis in excel few tips

Excel Central Tendency and Variability Functions
Function	What It Calculates
AVERAGE	Mean of a set of numbers
AVERAGEIF	Mean of a set of numbers that meet a condition
AVERAGEIFS	Mean of a set of numbers that meet one or more conditions
HARMEAN	Harmonic mean of a set of positive numbers
GEOMEAN	Geometric mean of a set of positive numbers
MODE	Mode of a set of numbers
MEDIAN	Median of a set of numbers
VARP	Variance of a set of numbers considered to be a population
VAR	Variance of a set of numbers considered to be a sample
STDEVP	Standard deviation of a set of numbers considered to be a population
STDEV	Standard deviation of a set of numbers considered to be a sample
STANDARDIZE	A standard score based on a given mean and standard deviation

Excel Relative Standing Functions
Function	What It Calculates
RANK	Rank of a number in a set of numbers
PERCENTRANK	Rank of a number expressed as a percent
PERCENTILE	The indicated percentile in a set of numbers
QUARTILE	The 1st, 2nd, 3rd, or 4th quartile of a set of numbers
Excel Correlation and Regression Functions
Function	What It Calculates
CORREL	Correlation coefficient between two sets of numbers
PEARSON	Same as CORREL. (Go figure!)
RSQ	Coefficient of determination between two sets of numbers (square of the correlation coefficient)
SLOPE	Slope of a regression line through two sets of numbers
INTERCEPT	Intercept of a regression line through two sets of numbers
STEYX	Standard error of estimate for a regression line through two sets of numbers

Tool	What it Does
Anova: Single Factor	Analysis of variance for two or more samples
Anova: Two Factor with Replication	Analysis of variance with two independent variables, and multiple observations in each combination of the levels of the variables.
Anova: Two Factor without Replication	Analysis of variance with two independent variables, and one observation in each combination of the levels of the variables.
Correlation	With more than two measurements on a sample of individuals, calculates a matrix of correlation coefficients for all possible pairs of the measurements
Covariance	With more than two measurements on a sample of individuals, calculates a matrix of covariances for all possible pairs of the measurements
Descriptive Statistics	Generates a report of central tendency, variability, and other characteristics of values in the selected range of cells
Exponential Smoothing	In a sequence of values, calculates a prediction based on a preceding set of values, and on a prior prediction for those values
F-Test Two Sample for Variances	Performs an F-test to compare two variances
Histogram	Tabulates individual and cumulative frequencies for values in the selected range of cells
Moving Average	In a sequence of values, calculates a prediction which is the average of a specified number of preceding values
Random Number Generation	Provides a specified amount of random numbers generated from one of seven possible distributions
Rank and Percentile	Creates a table that shows the ordinal rank and the percentage rank of each value in a set of values
Regression	Creates a report of the regression statistics based on linear regression through a set of data containing one dependent variable and one or more independent variables
Sampling	Creates a sample from the values in a specified range of cells
t-Test: Two Sample	Three t-test tools test the difference between two means. One assumes equal variances in the two samples. Another assumes unequal variances in the two samples. The third assumes matched samples.
z-Test: Two Sample for Means	Performs a two-sample z-test to compare two means when the variances are known

RM-Tools for statistical analysis

1) ANOVA: ANOVA can be uses to examine differences among the means of several different groups at once. It is a statistical technique for assessing how nominal independent variables influence a continuous dependent variable.

2) Correlation analysis: The correlation is the study of finding the relationship between the variables. If there are only 2 variables in the study of correlations there it is called simple correlation. Otherwise the study will be in either partial or multiple correlations. In this study the simple inter-correlations analysis is performed between the selected variables and the results are presented in the form of correlation matrix. Further the significance of correlation was tested for is significance at 5% level of significance.

3) Multiple regression analysis: The multiple regressions analysis is a functional relationship between a dependent variable and a set of independent variables. In this section the results of multiple regressions analysis is presented between the dependent variable and other independent variables.

4) Chi square analysis: The Chi square test is used in any study on social science and management for testing the independence of two attributes. Each of the Personal factors is compared with the study fact and chi square test is applied and describes the results in terms of personal factors, chi-square values (c2), p values and their significance(S/NS) on the factor studied and the results are presented with suitable hypothesis and relevant interpretations.

5) Average Score analysis: The Average score analysis is mainly used in any study is to assess the level of opinion/awareness/satisfaction of the different category of respondents on the various aspects relating to the study. First the opinion of the respondents are assessed through a scaling technique and then based on the consolidated opinion of the respondents, the average score is calculated.

6) Percentage Analysis: It is the simple and common method to represent raw streams of data as a percentage for better understanding of collected data. Percentages are used in making comparison between two or more variables to find the efficacy of each variable.

Basics of statistical analysis

Statistical analysis

This term refers to a wide range of techniques to. . . 1. (Describe) 2. Explore 3. Understand 4. Prove 5. Predict . . . based on sample datasets collected from populations, using some sampling strategy.

Why?

1. We want to summarize some data in a shorter form 2. We are trying to understand some process and possible predict based on this understanding • So we need model it, i.e. make a conceptual or mathematical representation, from which we infer the process. • But how do we know if the model is “correct”? * Are we imagining relations where there are none? * Are there true relations we haven’t found? • Statistical analysis gives us a way to quantify the confidence we can have in our inferences.

Populations and samples

• Population: a set of elements (individuals) * Finite vs. “infinite” • Sample: a subset of elements taken from a population * Representative vs. biased • We make inferences about a population from a sample taken from it. • In some situations we can examine the entire population; then there is no inference from a sample. Example: all pixels in an image.

Types of Variables

1. Nominal 2. Ordinal 3. Interval 4. Ratio

Data analysis strategy

1. Posing the research questions 2. Examining data items and their support 3. Exploratory non-spatial data analysis 4. Non-spatial modelling 5. Exploratory spatial data analysis
6. Spatial modelling 7. Prediction 8. Answering the research questions

RM-MM sample example

Marketing	1. Consumers outlook	1. Towards department stores in ________ city	1. Type of research 2. Sampling area: 3. population 4. sample size 5. sampling design 6. primary data 7. secondary data 8. Research instruments 9. Research analytical tools	1. To find out consumer opinion towards department store in______	CRM	Analysis of Relationship between	Gender, age, martial status, montly income, family size, class, locality,	Customer support, price, satisfaction, feel good, budget, quality, time pass, easeness, availability, fast, value for money, pride, respect, point benefits, gifts, product range, neatness, tidy, good CRM, parking, easy availability, promotional schemes, wide product range
				2. To find out the reasons for purchasing in department store
				3. To know consumers satisfaction level towards department stores

RM-FM selection one sample example

Finance	1. Mututal funds	Perception of investors	1. Type of research 2. Sampling area: 3. population 4. sample size 5. sampling design 6. primary data 7. secondary data 8. Research instruments 9. Research analytical tools	The basic objective of the study is to analyze the quality of financial institutions in ____________giving the investment expertise to the mutual fund investors.	Factors impacting investment	There is no significant difference about investor’s perception and financial expertise of the mutual fund services providers amongst investors classified by gender/Age/Qualification/Income/	Investment factors	1. Technology: Builds the access to the knowledge portals with increased speed 2. Innovation : More creative options of investment created 3. Counseling Gives the security to the investor developed 4. Motivation Builds the positive attitude 5. Quality information Builds the empowerment 6. Collaboration Increases the learning and networking 7. Commercialization Use of knowledge in the industry 8. Information technology Builds the virtual teams and makes the investor emop 9. Marketing strategies Connects to the society and builds the faith 10. Globalization Exposure to the international environment 11. Government support Builds the quality and faith with better assurance 12. Certification Increases the credibility
				To determine the impact of mutual fund investment on investor’s perceptions and satisfaction	Impact of MF investment	There is no significant difference about investor’s perception and mututal fund performance classified by gender/Age/Qualification/Income/	Impact of MF investment	1. contribution to owners : growth, profitability, realization of objectives and goals, development and innovation of new facilities, and cources 2. to investors: value added services, optimum cost, build loyalty, trust and faith, safe and secured returns 3. to society: giving liquidity for companies for business performance, building pool of savings, better r&d to new products of investments
				To frame the suggested strategy for providing the quality investment products through the mutual funds.	Risk assessment	There is no significant difference about investor’s perception and mututal fund risks classified by gender/Age/Qualification/Income/	Risks	Country risk , Credit risk , Currency risk , Interest rate risk,Liquidity risk , Market risk
	2. Stock market	Perception of investors	1. Type of research 2. Sampling area: 3. population 4. sample size 5. sampling design 6. primary data 7. secondary data 8. Research instruments 9. Research analytical tools
	2. Stock market	Perception of Brockers
	3. Investment analysis/Portfolio mangement	Investors attitude
		Invetment objectives
		impact of market movements
		Financial statement fraud
		Towards commodity derivatives-Futures, forward, swaps etc
		Peoples preference as to investment
		Peoples preference as to portfolio
		Peoples preference as to industry
		Corproate governance
		CSR

Dr.Chandra Hariharan Iyer

Total Pageviews

Followers

Search This Blog