Loading
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management1UNIT 4: STATISTICAL METHODSTabulation of data - Analysis of data –Testing of Hypothesis, Advanced techniques – ANOVA,Chi-Square - Discriminant Analysis - Factor analysis, Conjoint analysis - MultidimensionalScaling - Cluster AnalysisPROCESSING OPERATIONS IN RESEARCH METHODOLOGY1. Editing: Editing of data is a process of examining the collected raw data (specially insurveys) to detect errors and omissions and to correct these when possible. As a matter offact, editing involves a careful scrutiny of the completed questionnaires and/or schedules.Editing is done to assure that the data are accurate, consistent with other facts gathered,uniformly entered, as completed as possible and have been well arranged to facilitate codingand tabulation. With regard to points or stages at which editing should be done, one can talkof field editing and central editing. Field editing consists in the review of the reporting formsby the investigator for completing (translating or rewriting) what the latter has written inabbreviated and/or in illegible form at the time of recording the respondents’ responses.This type of editing is necessary in view of the fact that individual writing styles often canbe difficult for others to decipher. This sort of editing should be done as soon as possibleafter the interview, preferably on the very day or on the next day. While doing field editing,the investigator must restrain himself and must not correct errors of omission by simplyguessing what the informant would have said if the question had been asked.Central editing should take place when all forms or schedules have been completed andreturned to the office. This type of editing implies that all forms should get a thoroughediting by a single editor in a small study and by a team of editors in case of a large inquiry.Editor(s) may correct the obvious errors such as an entry in the wrong place, entry recordedin months when it should have been recorded in weeks, and the like. In case of inappropriateon missing replies, the editor can sometimes determine the proper answer by reviewing theother information in the schedule. At times, the respondent can be contacted forclarification. The editor must strike out the answer if the same is inappropriate and he hasno basis for determining the correct answer or the response. In such a case an editing entryof ‘no answer’ is called for. All the wrong replies, which are quite obvious, must be droppedfrom the final results, especially in the context of mail surveys.Editors must keep in view several points while performing their work: They should befamiliar with instructions given to the interviewers and coders as well as with the editinginstructions supplied to them for the purpose. While crossing out an original entry for onereason or another, they should just draw a single line on it so that the same may remainlegible. They must make entries (if any) on the form in some distinctive colour and that tooin a standardised form. They should initial all answers which they change or supply. Editor’sinitials and the date of editing should be placed on each completed form or schedule.2. Coding: Coding refers to the process of assigning numerals or other symbols to answers sothat responses can be put into a limited number of categories or classes. Such classes shouldbe appropriate to the research problem under consideration. They must also possess the
Page 1
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management2characteristic of exhaustiveness (i.e., there must be a class for every data item) and also thatof mutual exclusively which means that a specific answer can be placed in one and only onecell in a given category set. Another rule to be observed is that of unidimensional by whichis meant that every class is defined in terms of only one concept. Coding is necessary forefficient analysis and through it the several replies may be reduced to a small number ofclasses which contain the critical information required for analysis. Coding decisions shouldusually be taken at the designing stage of the questionnaire. This makes it possible toprecode the questionnaire choices and which in turn is helpful for computer tabulation asone can straight forward key punch from the original questionnaires. But in case of handcoding some standard method may be used. One such standard method is to code in themargin with a coloured pencil. The other method can be to transcribe the data from thequestionnaire to a coding sheet. Whatever method is adopted, one should see that codingerrors are altogether eliminated or reduced to the minimum level.3. Classification: Most research studies result in a large volume of raw data which must bereduced into homogeneous groups if we are to get meaningful relationships. This factnecessitates classification of data which happens to be the process of arranging data ingroups or classes on the basis of common characteristics. Data having a commoncharacteristic are placed in one class and in this way the entire data get divided into a numberof groups or classes. Classification can be one of the following two types, depending uponthe nature of the phenomenon involved4. Tabulation: When a mass of data has been assembled, it becomes necessary for theresearcher to arrange the same in some kind of concise and logical order. This procedure isreferred to as tabulation. Thus, tabulation is the process of summarising raw data anddisplaying the same in compact form (i.e., in the form of statistical tables) for furtheranalysis. In a broader sense, tabulation is an orderly arrangement of data in columns androws. Tabulation is essential because of the following reasons.1. It conserves space and reduces explanatory and descriptive statement to a minimum.2. It facilitates the process of comparison.3. It facilitates the summation of items and the detection of errors and omissions.4. It provides a basis for various statistical computations.Tabulation can be done by hand or by mechanical or electronic devices. The choice dependson the size and type of study, cost considerations, time pressures and the availaibility oftabulating machines or computers. In relatively large inquiries, we may use mechanical orcomputer tabulation if other factors are favourable and necessary facilities are available. Handtabulation is usually preferred in case of small inquiries where the number of questionnaires issmall and they are of relatively short length. Hand tabulation may be done using the direct tally,the list and tally or the card sort and count methods. When there are simple codes, it is feasibleto tally directly from the questionnaire. Under this method, the codes are written on a sheet ofpaper, called tally sheet, and for each response a stroke is marked against the code in which itfalls. Usually after every four strokes against a particular code, the fifth response is indicatedby drawing a diagonal or horizontal line through the strokes. These groups of five are easy to
Page 2
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management3count and the data are sorted against each code conveniently. In the listing method, the coderesponses may be transcribed onto a large work-sheet, allowing a line for each questionnaire.This way a large number of questionnaires can be listed on one work sheet. Tallies are thenmade for each question. The card sorting method is the most flexible hand tabulation. In thismethod the data are recorded on special cards of convenient size and shape with a series ofholes. Each hole stands for a code and when cards are stacked, a needle passes throughparticular hole representing a particular code. These cards are then separated and counted. Inthis way frequencies of various codes can be found out by the repetition of this technique. Wecan as well use the mechanical devices or the computer facility for tabulation purpose in casewe want quick results, our budget permits their use and we have a large volume of straightforward tabulation involving a number of cross-breaks.Tabulation may also be classified as simple and complex tabulation. The former type oftabulation gives information about one or more groups of independent questions, whereas thelatter type of tabulation shows the division of data in two or more categories and as such isdeigned to give information concerning one or more sets of inter-related questions. Simpletabulation generally results in one-way tables which supply answers to questions about onecharacteristic of data only. As against this, complex tabulation usually results in two-way tables(which give information about two inter-related characteristics of data), three-way tables(giving information about three interrelated characteristics of data) or still higher order tables,also known as manifold tables, which supply information about several interrelatedcharacteristics of data. Two-way tables, three-way tables or manifold tables are all examplesof what is sometimes described as cross tabulation.Generally accepted principles of tabulation: Such principles of tabulation, particularly ofconstructing statistical tables, can be briefly states as follows:1. Every table should have a clear, concise and adequate title so as to make the table intelligiblewithout reference to the text and this title should always be placed just above the body ofthe table.2. Every table should be given a distinct number to facilitate easy reference.3. The column headings (captions) and the row headings (stubs) of the table should be clearand brief.4. The units of measurement under each heading or sub-heading must always be indicated.5. Explanatory footnotes, if any, concerning the table should be placed directly beneath thetable, along with the reference symbols used in the table.6. Source or sources from where the data in the table have been obtained must be indicatedjust below the table.7. Usually the columns are separated from one another by lines which make the table morereadable and attractive. Lines are always drawn at the top and bottom of the table and belowthe captions.8. There should be thick lines to separate the data under one class from the data under anotherclass and the lines separating the sub-divisions of the classes should be comparatively thinlines.9. The columns may be numbered to facilitate reference.
Page 3
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management410. Those columns whose data are to be compared should be kept side by side. Similarly,percentages and/or averages must also be kept close to the data.11. It is generally considered better to approximate figures before tabulation as the same wouldreduce unnecessary details in the table itself.12. In order to emphasise the relative significance of certain categories, different kinds of type,spacing and indentations may be used.13. It is important that all column figures be properly aligned. Decimal points and (+) or (–)signs should be in perfect alignment.14. Abbreviations should be avoided to the extent possible and ditto marks should not be usedin the table.15. Miscellaneous and exceptional items, if any, should be usually placed in the last row of thetable.16. Table should be made as logical, clear, accurate and simple as possible. If the data happento be very large, they should not be crowded in a single table for that would make the tableunwieldy and inconvenient.17. Total of rows should normally be placed in the extreme right column and that of columnsshould be placed at the bottom.18. The arrangement of the categories in a table may be chronological, geographical,alphabetical or according to magnitude to facilitate comparison. Above all, the table mustsuit the needs and requirements of an investigation.ANALYSIS OF DATAData analysis embraces a whole range of activities of both the qualitative and quantitative type.It is usual tendency in behavioral research that much use of quantative analysis is made andstatistical methods and techniques are employed. The statistical methods and techniques areemployed. The statistical methods and techniques have got a special position in researchbecause they provide answers to the problems.Kaul defines data analysis as,” Studying the organized material in order to discover inherentfacts. The data are studied from as many angles as possible to explore the new facts.”Purpose:The following are the main purposes of data analysis:(i) Description: It involves a set of activities that are as essential first step in thedevelopment of most fields. A researcher must be able to identify a topic about whichmuch was not known; he must be able to convince others about its importance andmust be able to collect data.(ii) Construction of Measurement Scale: The researcher should construct a measurementscale. All numbers generated by measuring instruments can be placed into one of fourcategories:
Page 4
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management5(a) Nominal: The number serves as nothing more than labels. For example, no 1 wasnot less than No. 2. Similarly, No. 2 was neither more than no 1 and nor less than no3.(b) Ordinal: Such numbers are used to designate an ordering along some dimensionssuch as from less to more, from small to large, from sooner to later.(c) Interval: The interval provides more précised information than ordinal one. By thistype of measurement, the researcher can make exact and meaningful decisions. Forexample if A, B and C are of 150 cm, 145cm and 140 cm height, the researcher cansay that A is 5 cm taller than B and B is 5 cm taller than C.(d) Ratio Scale: It has two unique characteristics. The intervals between points can bedemonstrated to be precisely the same and the scale has a conceptually meaningfulzero point.(iii) Generating empirical relationships: Another purpose of analysis of data isidentification of regularities and relationships among data. The researcher has no clearidea about the relationship which will be found from the collected data. If the data wereavailable in details it will be easier to determine the relationship. The researcher candevelop theories if he is able to recognize pattern and order of data. The pattern maybe showing association among variables, which may be done by calculating correlationamong variables or showing order, precedence or priority. The derivation of empiricallaws may be made in the form of simple equations relating one interval or ratio scaledvariable to a few others through graph methods.(iv) Explanation and prediction: Generally knowledge and research are equated with theidentification of causal relationships and all research activities are directed to it. But inmany fields the research has not been developed to the level where causal explanationis possible or valid predictions can be made. In such a situation explanation andprediction is construct as enabling the values of one set of variables to be derived giventhe values of another.Functions: The following are the main functions of data analysis:(i) The researcher should analyze the available data for examining the statementof the problem.(ii) The researcher should analyze the available data for examining each hypothesisof the problem.(iii) The researcher should study the original records of the data before dataanalysis.(iv) The researcher should analyze the data for thinking about the research problemin lay man’s term.(v) The researcher should analyze the data by attacking it through statisticalcalculations.(vi) The researcher should think in terms of significant tables that the available datapermits for the analysis of data.
Page 5
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management6STATISTICAL CALCULATIONS:The researcher will have to use either descriptive statistics or inferential statistics for thepurpose of the analysis.The descriptive statistics may be on any of the following forms:(a) Measures of Central Tendency: These measures are mean, median, mode geometric meanand harmonic mean. In behavioral statistics the last two measures are not used. Which of thefirst three will be used in social statistics depends upon the nature of the problem.(b) Measures of Variability: These measures are range, mean deviation, quartile deviation andstandard deviation. In social statistics the first two measures are rarely used. The use of standarddeviation is very frequently made for the purpose of analysis.(c) Measures of Relative Position: These measures are standard scores (Z or T scores),percentiles and percentile ranks. All of them are used in educational statistics for data analysis.(d) Measures of Relationship: There measures are Co-efficient of Correlation, partialcorrelation and multiple correlations. All of them are used in educational statistics for theanalysis of data. However, the use of rank method is made more in comparison to Karl pearsonmethod.The inferential statistics may be in any one of the following forms:(a) Significance of Difference between Means: It is used to determine whether a truedifference exists between population means of two samples.(b) Analysis of Variance: The Z or t tests are used to determine whether there was anysignificant difference between the means of two random samples. The F test enables theresearcher to determine whether the sample means differ from one another to a greater extentthen the test scores differ from their own sample means using the F ratio.(c) Analysis of Co-Variance: It is an extension of analysis of variance to test the significanceof difference between means of final experimental data by taking into account the Correlationbetween the dependent variable and one or more Co-variates or control variables and byadjusting initial mean differences in the group.(d) Correlation Methods: Either of two methods of correlation can be used for the purpose ofcalculating the significance of the difference between Co-efficient of Correlation.(e) Chi Square Test: It is used to estimate the like hood that some factor other than chanceaccounts to the observed relationship. In this test the expected frequency and observedfrequency are used for evaluating Chi Square.(f) Regression Analysis: For calculating the probability of occurrence of any phenomenon orfor predicting the phenomenon or relationship between different variables regression analysisis cone.
Page 6
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management7ANOVA is essentially a procedure for testing the difference among different groups of datafor homogeneity. “The essence of ANOVA is that the total amount of variation in a set of datais brokendown into two types, that amount which can be attributed to chance and that amountwhich can be attributed to specified causes.” There may be variation between samples and alsowithin sample items. ANOVA consists in splitting the variance for analytical purposes. Hence,it is a method of analysing the variance to which a response is subject into its variouscomponents corresponding to various sources of variation. Through this technique one canexplain whether various varieties of seeds or fertilizers or soils differ significantly so that apolicy decision could be taken accordingly, concerning a particular variety in the context ofagriculture researches.TESTING OF HYPOTHESISThe word hypothesis consists of two words –Hypo+Thesis. ‘Hypo’ means tentative or subjectto the verification. ‘Thesis’ means statement about solution of the problem. Thus the literalmeaning of the term hypothesis is a tentative statement about the solution of the problem.Hypothesis offers a solution of the problem that is to be verified empirically and based on somerationale.Ordinarily, when one talks about hypothesis, one simply means a mere assumption or somesupposition to be proved or disproved. But for a researcher hypothesis is a formal question thathe intends to resolve.Quite often a research hypothesis is a predictive statement, capable of being tested by scientificmethods, that relates an independent variable to some dependent variable. For example,consider statements like the following ones.“Students who receive counselling will show a greater increase in creativity than students notreceiving counselling” Or “the automobile A is performing as well as automobile B.”These are hypotheses capable of being objectively verified and tested. Thus, we may concludethat a hypothesis states what we are looking for and it is a proposition which can be put to atest to determine its validity.BASIC CONCEPTS OF HYPOTHESIS TESTINGBasic concepts in the context of testing of hypotheses need to be explained.Null hypothesis and alternative hypothesis: In the context of statistical analysis, we oftentalk about null hypothesis and alternative hypothesis. If we are to compare method A withmethod B about its superiority and if we proceed on the assumption that both methods areequally good, then this assumption is termed as the null hypothesis. As against this, we maythink that the method A is superior or the method B is inferior, we are then stating what istermed as alternative hypothesis. The null hypothesis is generally symbolized as H0 and thealternative hypothesis as Ha. Suppose we want to test the hypothesis that the population meanbmg is equal to the hypothesised mean mH0 d i = 100.Then we would say that the null hypothesis is that the population mean is equal to thehypothesized mean 100 and symbolically we can express as:
Page 7
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management8If our sample results do not support this null hypothesis, we should conclude that somethingelse is true. What we conclude rejecting the null hypothesis is known as alternative hypothesis.In other words, the set of alternatives to the null hypothesis is referred to as the alternativehypothesis. If we accept H0, then we are rejecting Ha and if we reject H0, then we are acceptingHa. For H0 : m m = = H0 100 , we may consider three possible alternative hypotheses asfollows:The null hypothesis and the alternative hypothesis are chosen before the sample is drawn (theresearcher must avoid the error of deriving hypotheses from the data that he collects and thentesting the hypotheses from the same data). In the choice of null hypothesis, the followingconsiderations are usually kept in view:• Alternative hypothesis is usually the one which one wishes to prove and the null hypothesisis the one which one wishes to disprove. Thus, a null hypothesis represents the hypothesiswe are trying to reject, and alternative hypothesis represents all other possibilities.• If the rejection of a certain hypothesis when it is actually true involves great risk, it is takenas null hypothesis because then the probability of rejecting it when it is true is a (the levelof significance) which is chosen very small.• Null hypothesis should always be specific hypothesis i.e., it should not state about orapproximately a certain value. Generally, in hypothesis testing we proceed on the basis ofnull hypothesis, keeping the alternative hypothesis in view. Why so? The answer is that onthe assumption that null hypothesis is true, one can assign the probabilities to differentpossible sample results, but this cannot be done if we proceed with the alternativehypothesis. Hence the use of null hypothesis (at times also known as statistical hypothesis)is quite frequent.The level of significance: This is a very important concept in the context of hypothesistesting. It is always some percentage (usually 5%) which should be chosen with great care,thought and reason. In case we take the significance level at 5 per cent, then this impliesthat H0 will be rejected when the sampling result (i.e., observed evidence) has a less than0.05 probability of occurring if H0 is true. In other words, the 5 per cent level of significancemeans that researcher is willing to take as much as a 5 per cent risk of rejecting the nullhypothesis when it (H0) happens to be true. Thus, the significance level is the maximum
Page 8
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management9value of the probability of rejecting H0 when it is true and is usually determined in advancebefore testing the hypothesis.Decision rule or test of hypothesis: Given a hypothesis H0 and an alternative hypothesisHa, we make a rule which is known as decision rule according to which we accept H0 (i.e.,reject Ha) or reject H0 (i.e., accept Ha). For instance, if (H0 is that a certain lot is good(there are very few defective items in it) against Ha) that the lot is not good (there are toomany defective items in it), then we must decide the number of items to be tested and thecriterion for accepting or rejecting the hypothesis. We might test 10 items in the lot and planour decision saying that if there are none or only 1 defective item among the 10, we willaccept H0 otherwise we will reject H0 (or accept Ha). This sort of basis is known as decisionrule.Type I and Type II errors: In the context of testing of hypotheses, there are basically twotypes of errors we can make. We may reject H0 when H0 is true and we may accept H0when in fact H0 is not true. The former is known as Type I error and the latter as Type IIerror. In other words, Type I error means rejection of hypothesis which should have beenaccepted and Type II error means accepting the hypothesis which should have been rejected.Type I error is denoted by a (alpha)known as a error, also called the level of significance oftest; and Type II error is denoted by b (beta) known as b error. In a tabular form the saidtwo errors can be presented as follows:The probability of Type I error is usually determined in advance and is understood as the levelof significance of testing the hypothesis. If type I error is fixed at 5 per cent, it means that thereare about 5 chances in 100 that we will reject H0 when H0 is true. We can control Type I errorjust by fixing it at a lower level. For instance, if we fix it at 1 per cent, we will say that themaximum probability of committing Type I error would only be 0.01.But with a fixed sample size, n, when we try to reduce Type I error, the probability ofcommitting Type II error increases. Both types of errors cannot be reduced simultaneously.There is a trade-off between two types of errors which means that the probability of makingone type of error can only be reduced if we are willing to increase the probability of makingthe other type of error. To deal with this trade-off in business situations, decision-makers decidethe appropriate level of Type I error by examining the costs or penalties attached to both typesof errors. If Type I error involves the time and trouble of reworking a batch of chemicals thatshould have been accepted, whereas Type II error means taking a chance that an entire group
Page 9
- BUSINESS RESEARCH METHODS, 4TH SEMESTER BBA, BANGALORE CENTRAL UNIVERSITYVinutha T.N, Assistant Professor, MES Institute of Management10of users of this chemical compound will be poisoned, then in such a situation one should prefera Type I error to a Type II error. As a result one must set very high level for Type I error inone’s testing technique of a given hypothesis.2 Hence, in the testing of hypothesis, one mustmake all possible effort to strike an adequate balance between Type I and Type II errors.Two-tailed and One-tailed tests: In the context of hypothesis testing, these two terms arequite important and must be clearly understood. A two-tailed test rejects the null hypothesis if,say, the sample mean is significantly higher or lower than the hypothesised value of the meanof the population. Such a test is appropriate when the null hypothesis is some specified valueand the alternative hypothesis is a value not equal to the specified value of the null hypothesis.Symbolically, the two tailed test is appropriate when we havewhich may mean m > mH0 or m< mH0.Thus, in a two-tailed test, there are two rejection regions*, one on each tail of the curve whichcan be illustrated as under:Mathematically we can state:Acceptance Region A : Z < 1.96Rejection Region R : Z > 1.96
Page 10
Download this file to view remaining 3 pages
Related documents:
- American Literature Question bank with Answers - Question Bank
- Cultural Heritage of India SEMESTER III SG3CMT06 MCQs - MCQ
- MG6851 PRINCIPLES OF MANAGEMENT qbank - Question Bank
- Commerce MCQs (TYBCom Sem VI ) - MCQ
- Art Culture - Questions and answers - Notes
- Consumer Behavior MCQs - MCQ
- Physics (Paper I) 2017 Question Paper - Question Paper
- Business Law MCQs with Answers - MCQ
- Principles Of Electrical Communication MCQs - MCQ
- Sociology (Paper I) 2018 Question Paper - Question Paper
- English Prose - MCQ
- Testing of Materials Solved MCQs - MCQ
- Sociology Of Gender - Questions with answers - Question Bank
- Public Administration (Paper II) 2019 Question Paper - Question Paper
- Principles of Management QPaper NovDec 2017 - Question Paper
- Basic Electronics and Computer Fundamentals MCQs - MCQ
- Books Authors - Questions and answers - Notes
- MG6851 Principles of Management QBank - Question Bank
- SERVICES MARKETING - Notes
- CONCEPT RECAPITULATION TEST - IV - Question Paper