Estimating the neighborhood influence on decision makers : theory and an application on the analysis of innovation decisions

(1)

THEORY AND AN APPLICATION

ON THE ANALYSIS OF INNOVATION DECISIONS

NIKOLAUS HAUTSCH AND STEFAN KLOTZ

NIKOLAUS.HAUTSCH@UNI-KONSTANZ.DE STEFAN.KLOTZ@UNI-KONSTANZ.DE

CENTER OF FINANCE AND ECONOMETRICS (COFE)

UNIVERSITY OF KONSTANZ

Abstract. Whenmakingdecisions,agentstendtomakeuseofdecisionsoth-

ershavemadeinsimilarsituations. Ignoringthisbehaviorinempiricalmodels

can be interpreted as a problem of omittedvariables and mayseriously bias

parameterestimatesandharminference. Wesuggestapossibilityofintegrat-

ingsuchoutsideinuencesintomodelsofdiscretechoicedecisionsbydening

an abstract space in which agents with similar characteristics are neighbors

whopossiblyinuenceeachother. Inordertocorrectforcorrelationsbetween

the characteristics,the designofthis space allowsfornonorthogonalityof its

dimensions. SeveralMonteCarlosimulationsshowthesmallsampleproperties

of spatial models with binary choice. When applying theestimator to inno-

vation decisionsdata of Germanrms, wend evidence for the existenceof

neighborhoodeects.

JEL classication: C21,C25, C44, O33, R15

Key Words: decision models; discrete choice; neighborhood inuence; spatial

econometrics;socialspace; Euclidean measure

An earlier version of this papers was presented at the 1999 meeting of the

EconometricSocietyinSantiagodeCompostela,Spain. Wewouldliketothank

the ZEW for providing us with the Mannheim Innovation Panel. Financial

support by the CoFE and the Deutsche Forschungsgemeinschaft is gratefully

acknowledged. For helpful comments, we would like to thank Frank Gerhard,

DieterHess,JoachimInkmann,Michael Lechner,WinfriedPohlmeier, partic-

ipants of workshops at the Universities of Konstanz and St. Gallen, and an

anonymous referee. The usualdisclaimer applies.

Corresponding author: NikolausHautsch,BoxD124, UniversityofKonstanz,

78457 Konstanz, Germany.

May18,2001.

(2)

1. Introduction

Theamountofinformationisakeyfactorindecisionmakingprocesses. There-

fore, individuals try totake as much informationinto account as possible. So it

may be sensible, when uncertainty exists, also to use information about the de-

cisions of others, who have had a comparable decision to make, as a yardstick.

Furthermore, from a sociological point of view, humans often tend to seek as-

surance as to whether their thinking is reasonable. As a result, an individual's

decision for a particular alternative is more likely when he realizes that people

with similar features have come to the same conclusion. An additionalrationale

in observing the decisions of others occurs when one is held responsible for the

success of his decision - as e.g. managers are: a mistake can be more readily

excused if others have also made the same mistake. Forall of these reasons, we

might expect that the decisions of agents in similar situations should correlate

even after controlling for similarity. This paper aims to show how this interde-

pendence of decisions can bemodeled and estimated.

The benetofintegratingsuchoutsideinuencesis twofold. On theone hand,

it seems to be very interesting to check whether there is empirical evidence for

correlation between decisionmakers. On the other hand, from a more technical

point of view, ignoring existing dependencies of other people's decisions could

lead tobiased parameter estimates (cf. Anselin and GriÆth(1988))or tobiased

estimates of standard errors (McMillen(1995), for the probit case).

But, why exactly should a decision maker pay attention to the decisions that

otherindividualshave reached? Suchsocialinteractionsare rationalifthe utility

expectedtoresultfromaparticulardecisionisraisedasobeyingotherindividuals

means an enlargement of the own relevant information set (Brock and Durlauf

(1995)). BesleyandCase (1995)provideanexampleofhowasymmetricinforma-

tion can lead to such mechanisms: voters lack information which would enable

them to judge the state or county government on an absolute scale. Therefore,

theytendtomakecomparisonsbetweenincumbentsandendupwithforcingthem

intoayardstick competition,usingarelativemeasureinsteadofanabsoluteone.

(3)

Topa(1997)hintstothepointthatvaluableinformationmaybecommunicated

secretly in social networks, which may result in correlating decisions. Labor

markettheoryprovidesyetanotherreason: Insomesituations,notone'sabsolute

utilitymay beconsidered asrelevant,but hisrelativeposition. In this lineis the

arguing of the fair wage-eort hypothesis (see e.g. Akerlof and Yellen (1990)):

workers want to be paid at a `fair' level. This puts pressure on managers to

conform their wage decision to the level of comparable, i.e. social neighboring,

rms. Labor market related behavior of investment managers is as well blamed

for the so called herd behavior of professional investors on nancial markets,

according to Scharfstein and Stein (1990). As `it is better for reputation to

fail conventionally than to succeed unconventionally', few incentives are given

not to followthe herd. However, there's a general human tendency to conform

to others' behavior which can be perfectly rational, as the society rewards not

deviatingfrom accepted norms (Bernheim(1994)).

The wish for conformity may be even higher when there are only two alter-

natives, leaving noroomto opt for a compromise. However, in many situations

individualsarefaced with discretechoices, e.g. inwhich foreigncountrytobuild

a new plant, whether to do an IPO or to start an innovation project are some

examplesof binary choices.

Inthispaper,wewouldliketoshowhowsuchinuencescanbeconsideredwhen

modelling and estimating decision behavior. As we assume that such inuence

areespeciallyprevalentfordiscrete choicedecisions, weconcentrateonmodelsof

binary and multiple choice. Therefore, we suggest a modelwhich would identify

socialneighbors - that's what we calldecision makers who have to solve similar

decision problems - and would estimate the inuence neighbors have on each

other. The neighborhood dening characteristics are used as dimensions of an

abstract spaceinwhichthe individualsare located. Incontrast tothe Euclidean

assumptioncommonlyused, the space'sdesign which wesuggest ismore general

inthe sensethat the dimensions are not restricted tobeorthogonal. This allows

tocorrect forpossiblecorrelations between the socialcharacteristics. Estimating

(4)

poor,whereas the size of the distance matrix can be challenging, as it increases

by the square of the samplesize. The samplesize problemisinthe center ofour

Monte Carlo study.

We apply the suggested methodology to German innovation decision data.

Decisions whether about innovation eorts are highlylikely to be inuenced by

the behavior of rms that are competitors on the output and/or on a factor

market. The case of product innovations perfectly ts to our considerations, as

deciders can rather easy observe product innovations of their competitors when

they have been realized. In contrast, they only have an imperfect inside about

reasons and success of the innovations, and about planned innovations yet to

be realized. As deciders cannot learn by observing these more or less secret

features, they will restricted to utilize as an additional information only the

pure information of the observed decision to realize a product information. An

incentiveformimicingacompetitor'sdecisioncanalsobelabormarketpressures,

described abovefornancialinvestmentmanagers,towhichrms'managersmay

be subjects.

The remainder of the paper is organized as follows: The next section will

discuss reasonsforincludingneighborhoodinuences andpresent the modeland

the estimation strategy. In section 3, Monte Carlo studies will show how our

model works and illustrate the inuence the sample size has on the estimates'

quality. The application tothe innovation decision follows in section 4. Section

5 concludes and providessome additionalideas for research onthis topic.

2. An econometric approach to include neighborhood influences

2.1. General considerations. Technically, the interdependence of decisions

leads to correlations between some or all decision makers. Such cross section

correlationbetweenobservationsisawell-known phenomenon inregionalscience

wherecontiguousor,moregeneral,neighboringspatialunits ofteninuenceeach

other; e.g. the unemployment rates of counties are spatially correlated because

(5)

crosscorrelation isoftencalledspatial correlation. But this occurrence isnot re-

strictedto geographical spaces: observations can be thought of as being located

in an abstract space, with certain social characteristics being the dimensions.

Then,if observations are neighbors inthis abstract space,these observations are

said tobe similar.

Meanwhile, a broad range of methods exists - at least for continuous depen-

dent variables - which analyze spatial correlation 1

. However, relatively little

research has been done on limited dependent variables in the context of spa-

tialcorrelation 2

. In aninnovation adoption framework, Case (1992)presents an

estimation strategy for probit models with spatial dependence in the explained

decision result. Unfortunately, the estimation scheme is only applicable to the

unsatisfactory case of block wise dependence: the inuence parameter between

two potential adopters is set to one if they are located in the same region, and

to zero otherwise. Another feature which is questionable in the context of (in-

novation) decisions is the assumption that the dierent individuals make their

decisionsimultaneously. Often,anindividualisonlyable toobserveothers'deci-

sionsafterthey have been made,sotaking the timepatternintoconsiderationis

indispensableformanysituations. Aspatialinnovationdiusionprocessforlogit

modelsis presented by Dubin(1995): In hertwoperiodmodel, rms which have

tomakea decisiondonot inuence themselves simultaneously. Instead, if arm

did not decide to innovate in the rst period,it reconsiders whether to innovate

in the second period, and then pays attention to the decisions that other rms

have already made inthe rst period.

The binary decision y

i

of rm i in the second period depends on the latent

scalarvariabley

i

(1) y

i

=

1 ; if y

i

>0;

0 ; else;

and the latentvariablespecied as

(2) y

i

=x 0

i +

N

X

j=1

ij y

1

j +u

i

; i=1;:::;N;

1

Foranintroduction, seethetextbookof Anselin(1988),orPartIIofFlorax(1992).

2

(6)

where x

i

is a vector of the explanatory variables associated with rm i, with

as the corresponding coeÆcient vector. The outside inuence emanates from

decisionsy 1

j

whichallj rms madeinthe rstperiod,weightedbytheinuence

ij

whichrm j has onrm i, and

ij

being 0. The logistic error term(assumed

to be i.i.d.) is denoted by u

i .

Hence,Dubinmodelsinnovationdecisionsthatdependontherms'(geograph-

ical) distance to prior innovators - more specically, adoptinga new technology

becomes more likely the more rst periodadopters are in the neighborhood. In

this framework Dubin makes arestrictive assumption: Onlyrms who have not

yet adopted an innovation in the prior period are allowed to face an innovation

decision in the current period,i.e. rms can innovate only in one of the two pe-

riods. This refers to an epidemic diusion modelling which is restricted to one

period which is aected by the dispersion.

In the following section, we present a model which is more general in three

respects:

First,decisionsdependnotonlyonpriordecisionsofothersbutalsoontheown

prior decision. Herewith, we donot onlymeasure the inuence a decisionmaker

may have on himself, but we also avoid that, when estimating, this inuence is

caught by the neighborhood eect. We assume all decisions made in this rst

period asexogenously given.

Second, ourapproachisnotrestrictedtobinarychoiceproblems. However, the

multiple choice setting even providesthe possibility of estimating specic neigh-

borhood impacts, i.e. neighborhood relationships depending also on each single

alternative,oering the possibility todeal with sophisticated decisionproblems.

Third, we provide a suggestion for an advanced solution to the central prob-

lemonefaceswhenmodellingneighborhoodinuences: providinganappropriate

concept tomeasure the degreeof similarity,i.e. the 'socialdistance' between in-

dividuals. Themost convenientstrategy istodenea spacewithinwhichweare

able to measure distances between decision makers. Whereas Case (1992) and

Dubin (1995) analyze two-dimensional geographic spaces, we allow for multi-

(7)

becomes less and less important while individualsand rms pay more and more

attention to those being in a comparable situation. For similar reasons, Case

andKatz (1991)successfully use a concept of socialinsteadof geographicneigh-

borhood. (See Akerlof (1997), for a model of social space.) Griliches (1992)

established the innovation space, usingcertain characteristics of rms asdimen-

sions for the abstract space and measures these abstract distances by using an

Euclidean measure. The drawback of using an Euclidean space is the implied

assumption of uncorrelatedness of the social characteristics, which corresponds

toorthogonal dimensions spanning the space. We develop this concept by drop-

ping this unrealistic assumption of an Euclidean space. Instead, we allow the

dimensions of our space to be not orthogonal according to correlations between

the characteristics.

2.2. The basic model of binary choice decisions. We inspect the decision

makingprocess ofN individualswho areconfronted withacertainbinary choice

problem. Theirdecisionsdependontheirowncharacteristics,ontheowndecision

withrespecttothesameprobleminapriorperiod(whichisincontrasttoDubin's

approach), andondecisionsof theother decisionmakers inthis priorperiod(for

which we use an alternative modelling). In the binary setup, the decision y

i of

decisionmaker i depends, as inequation (1), on the latent variable y

i

which we

model as

(3) y

i

=+x 0

i +y

1

i

+A 1

i +u

i

; i=1;:::;N;

where is a scalar constant. x

i

is a K 1 vector of K explanatory variables

associated with i, and is the corresponding coeÆcient vector. y 1

i

describes

the decision outcome of individual i in the prior period (i.e., 0 or 1), with the

scalar parameter measuring the impact this own decision has on the current

period. A 1

i

isascalarreectingtheneighborhoodimpactofallotherindividuals

on decision maker i, and u

i

is an i.i.d. N(0;

2

) error term, where E [A 1

i u

i ] =0

is assumed. We model A 1

i

as the weighted sum of the decisions y 1

j

which the

individuals j, that are i's neighbors, made in the prior period. The weights

used for A 1

depend on the distance D

between i and j, and on parameters

(8)

determiningthescale(a

1

;a

0

)andthedecay(b

1

;b

0

)oftheneighborhoodinuence

(4) A

1

i

= N

X

j=1

j6=i

a

1

exp ( D

ij b

1 )y

1

j

+

a

0

exp ( D

ij b

0

)(1 y 1

j ) :

Thisspecicationallowsfordierentspatialdependence patterns,conditionalon

the decisionofthe particularneighbor. The spatialparameters(a

1

;a

0

;b

1

;b

0 )are,

together with ; and ,the parametersto be estimated.

Inthe followingwediscussthecalculation ofthe neighborhoodimpactA 1

i . A

prerequisite forthis istodeterminethe distance D

ij

betweenallpairs ofdecision

makersiandj. Asthedistanceshallbesmallifthedecisionsmakersareinsimilar

situations(i.e. the decision makers shall, in this case,betreated as`neighbors'),

the distance measure has todepend on characteristicsthat describe the decision

makers' situations. Therefore, we dene the DN matrix Z of (socialdistance

dening) characteristics by

Z = 0

@ z

11

::: z

1N

.

z

D1

::: z

DN 1

A

;

where z

di

denotes the social characteristic d (d =1;:::;D)of individuali. Fur-

thermore, we dene a D 1 vector

ij

of all the (social) dierences between

an individual i and an arbitrary other decision makers j (j 6= i) within the D

characteristics by 3

ij

0

B

@ z

1i z

1j

z

2i z

2j

.

z

Di z

Dj 1

C

A :

Vector

ij

can be interpreted as being located in a D-dimensional abstract

space, which is spanned by the D characteristics. The most popular strategy is

tomeasuredistancesinanEuclideanspacewheretheEuclidean Distanceisgiven

by

(5) D

ij

= q

0

ij

ij :

3

Foreaseofnotationweassumethattheparticularsocialcharacteristicshaveameanequal

(9)

Theassumption ofanEuclidean spaceis,inthis context, equivalenttoassuming

uncorrelatedness between the characteristics as well as equal variances, i.e. the

correlationmatrix of the socialcharacteristics isassumed tobe the identity ma-

trix. This is a serious restriction to the general case of arbitrary correlation

between the characteristics: In most applications, like for innovation decisions,

such characteristics will be correlated, e.g. the age of a rm and its size. Fur-

thermore, there may be information concerning the size in terms of the number

of employees and the turnover as well. If the correlation willbe considered, the

informationof bothcharacteristics canbeused withoutoveremphasizingthe size

aspect. Therefore, we propose a transformationof the dierences

ij

given by

(6)

ij

p

2NP

ij

;

whereP is obtained by the Cholesky factorization of ZZ 0

, i.e.

4

(7) P

0

P =(ZZ 0

) 1

:

Nevertheless, the characteristics ofdierent decisionmakers are stillassumed to

be independent. The transformation of

ij

into

ij

is, of course, equivalent to

transforming Z by Z

p

NZP, and calculating

ij

based on Z

.

This modication procedure ensures that the modied dierences in

ij are

orthogonal,thus, theynowsuittobetransformedintodistancesbytheEuclidean

measure. Hence, our modied distance measure is dened by

(8) D

ij

= q

0

ij

ij :

Thisconceptprovides,inviewofthestochasticalpropertiesofthecharacteristics,

a more general and more realistic measure of distances in an abstract space. If

the characteristics are uncorrelated and the variances of all characteristics are

equal, P p

N is the identity matrix, thus

ij

=

ij

. Hence, our modied distance

measure nests the Euclidean distance measure as a special case.

4

IfthematrixZZ 0

doesnothavefullranktheCholeskyfactorsP havetobecomputedon

thegeneralizedMoore-Penroseinverse. Hence,ifonecharacteristicvectorofthespacecanbe

writtenasalinearcombinationofothervectorsthenumberofthedimensionsofthespacewill

(10)

After determining the distance between allpairs of decision makers, the next

stepconsistsindeninganadequatedistancedecayfunction(DDF)whichtrans-

lates thesocialdistance intoavalue expressing the strengthof theneighborhood

impact. Asthisinuenceofjonishalldiminishwithdistance,aDDF=(D

ij )

has to be dened which (i) is decreasing in the argument, (ii) returns positive

values for positive arguments and (iii)returns a nite positivevalue for zero. A

functional formfulllingthis requirement isthe DDF

(9) (D

ij

)=aexp D

ij b

;

which oersthe advantage that two parameters are enough to describe a rather

wide range of dierent decay patterns: The parameter a measures the strength

of the neighborhoodinuence onthe decisionoutcome, whereas the parameterb

allows ustoinvestigatetheratebywhichthe impactofpriordecisionsattenuates

with the distance. Obviously, the parameter b is only identied if a 6= 0. If the

parameters a and b are jointly signicant, empirical evidence for neighborhood

inuences is found.

5

Areasonableextensionofthemodelismotivatedbythepossibilitythatmakers

of dierent decisions may have a dierent distance decay. Such dierences are

particularly likely if one of the both alternativesis opted for rarely, sodecisions

for this alternativeare noticed carefullywithlittlerespect tothe distance to the

respective decisionmaker. For example, one observed such asymmetricpatterns

in the early days of the internet, when the launch of any rm's web site was

exceedinglynoticed. Therefore, introducingtwodierentDDFs

1 and

0

(which

are associated to dierent parameters a

1

and b

1

and, resp., a

0

and b

0

) may be

sensible for certaincircumstances.

Theneighborhoodimpactofj onthedecisionoutcome ofiisthenj'sweighted

decision outcome

(10)

ij

=

1 (D

ij )y

1

j

+

0 (D

ij

)(1 y 1

j )

:

5

Ofcourse,therearemanyotherfunctionalformssuitableasDDF.FortheMonteCarloas

wellas forthe application study, wealso employed alternative functions which failed to give

(11)

Aggregating the spatial impacts

ij

of all units j 6=i leadsto

A 1

i

= N

X

j=1

j6=i

ij

;

which isequivalent toequation (4).

Neglectingneighborhooddependencies,i.e., droppingA 1

i

inequation(3), cor-

respondstotheproblemofomittedvariables. Thus,thecoeÆcientvectorwould

beestimatedwithabiaswhichdependsonthe correlationsbetween theexplana-

tory variables x on the one hand and the distances within the characteristics Z

and prior period decisions y 1

j

on the other hand. Additionally, the estimated

standard errors of would be wrong.

2.3. Multiple Choice. Often decision makers have todecide not onlybetween

`yes' or`no' but between several dierent alternatives. Therefore, amore general

specication is obtained by regarding more than two alternatives. For this pur-

pose, thebinary decisionmodelofsubsection 2.2can begeneralizedby including

spatialdependencies inmultinomialmodels. Byusing alternativespecicspatial

parameters,itispossibletoestimatethe particularimpactthat decisionmakers,

whochoseinthepriorperiodacertainalternativer,haveonindividualschoosing

between S alternatives inthe next period.

Thegeneralsettingofthemultiplechoicemodelisaccordingtothebasicbinary

choicemodel: An individualican decideamong S alternatives, dependingagain

onhisowncharacteristics,andonthe(exogenously given) decisionsmadebyhim

andhissocialneighborsj inacertainpriorperiod. Eachalternativesprovidesan

outcome (utility) y

is

; s =1;:::;S, which is not observable. The decisionmaker

chooses the alternative s if y

is

> y

is 0

; 8s 0

6= s: Then, the observable variable y

i

takes the value y

i

= s. By separating the impact of the own prior decision we

denethe latent modelassociated with the choice of s by

(11) y

is

=x 0

i

s +

R

X

r=1 y

1

ir

rs +A

1

is +u

is

; i=1;:::;N;

where y 1

ir

= 1l

(y 1

i

=r) . x

i

denotes, as in section 2.2, a vector of explana-

(12)

current period, with

s

asthe correspondingcoeÆcient vector.

rs

measures the

strengthoftheownpriordecision's impactifthedecisionmakerchosealternative

r(r = 1;:::;R ) in the prior period. Thus, this specication allows quantifying

the particular impacts of the own priordecisions dependingon the chosen alter-

natives in the two periods. In the prior period, the number R of alternatives

may dier from the alternatives' number S in the current period. For the error

terms u

i (u

i1

;:::;u

iS )

0

we assume u

i

i.i.d. N(0;) and E[A 1

is u

is

]=0. A 1

is

denotes the aggregatedspatial impactsof allprior decisionsona decisionmaker

i choosing the alternatives:

(12) A

1

is

= N

X

j=1

j6=i

"

R

X

r=1 a

rs

exp D

ij b

rs

y 1

jr

#

;

beingastraightforwardgeneralizationof(4),witha

rs andb

rs

measuringscaleand

pattern of the inuence that prior decision makers who chose alternative r have

on decision makers choosing alternative s. As before, these spatial parameters

are tobe estimated.

Again, an estimation of

s

that ignores the impact of prior decisionswill gen-

erallybebiased.

3. Monte Carlo Studies

The inclusion of aspatial distance matrix intodiscrete choice models leads to

highlynonlinear models which poses the question afterthe smallsample proper-

tiesofourestimationscheme. Especiallywithregardtotheapplicationinsection

4,wewant togain insight intohowthe quality of estimates depends ondierent

sample sizes.

Weanalyze the smallsampleproperties for a correctlyspecied binary choice

modeldenedaccordingtoequations(3)and(4)whichisestimatedbyMaximum

Likelihood. For the exogenously given decision outcomes y 1

i

from the prior

period we use random draws v

i

from a uniformly U[0;1] distribution coded by

y 1

i

=1l

(v

i

>0:5)

. We choose a parameter constellationwhich ensures that the

ratio between decisions of both outcomes (0and 1)is about balanced. We xed

(13)

the parameter for the impact of the own prior period decisions is = 1. The

spatial parameters are chosen to be a

1

= 1 and a

0

= 3 as scaling and b

1

= 3

and b

0

=5 asdecay parameters. The social space is spanned by two orthogonal

vectors obtained by random draws of the U[0;1] distribution. As we want to

concentrate on the estimation quality dependent subject to the sample size, we

do not utilize the correction described in 6. The error term is drawn from a

standard normaldistribution,whilethe exogenous variablesx

i

are sampledfrom

the uniformlyU [0;1]distribution.

Weanalyzeeightsamplesizesfrom50to400with1,000replicationseach. The

results are presented by means of box plots in appendix 6.1. There, the gures

1 till 5 show the empirical distributions of the respective parameters depending

on the sample size. The horizontal linewithin each box indicates the empirical

distribution'smedian. The box itself is dened by the upperand lowerquartile.

Adding three halfs of the interquartile range into both directions from the box

yieldsthe position of the so-called fences. Adjacent values, which lie out of the

box-fences-formation,are denoted byasmallcircle.

6

The respectivetrue valueis

located in the middle of each gure and marked by ahorizontalline. Note that

thescalesofthey-axesdier. Concerningthespatialparameters,weonlypresent

the twoones whichdescribethe inuence of theformer decisionsy 1

i

=1,as the

estimatesof the twootherparameters with respect toy 1

i

=0were qualitatively

identical.

A general nding of our study is that the challenge, which seven parameters

and a rather complicated likelihood mean at least for a sample size of 50, has

been met: there are no convergence problems tonotice and no sensitivity to the

start values even for this smallest sample.

For all parameters, the variance of the estimates' empirical distributions re-

duces remarkably over the sample size range. For n = 50, there are, for any

parameter,ahandful ofextreme outlierswhich cannot be drawn intothegure.

Butforrisingsamplesize, thevarianceconvergesathigh speed,and forn=400,

6

(14)

the reduction of the estimator`s variance achieved by additional observations is

only considerablefor the two spatialparameters.

Again, allve types of parameters show a clear small samplebias. This is no

shiftoftheempiricaldistributionbutanasymmetrywhichleadstooverestimation

intermsofthe absolutevalues. However, thisbiasdisappearstodierentdegrees

for rising sample sizes and does not vanish completelyfor any parameter under

consideration. It reveals tobestrongest forthe own impactparameter and the

extremely asymmetric spatialscaling parameter a

1

, while the decay parameter's

b

1

estimates are the least biased ones.

As a result, despite some shortcomings, the Maximum Likelihood estimation

ofour probitmodelprovestobepossibleand meaningful. The shortcomingsare,

for allkindsofparameters, aratherhigh variancewhenthe samplesize doesnot

exceed100,andaconsiderablebiaswhichdoesnotvanishevenforsomehundreds

observations.

4. Application to Innovation Data

Research on R&D and innovation activities is familiar with circumstances in

whichdecisions ofarm -namelyto aimforand toadoptanew technology-do

not onlydepend onthe rm'sownsituation(seee.g. Geroski (1995)orEncaoua,

Hall, Laisney, and Mairesse (1998)). Reasons for that are mainly, but not only,

eectsofnetworksandofstandardization(forareview,seee.g. KatzandShapiro

(1985)) and, of course, of research spillovers (the broad discussion about this

subject has been established by Griliches (1992)). Further reasons, quite inline

with those we discuss in our introduction, are compiled by Baptista (2000) and

Manski(2000). Consequently, weapply our modeltodata ofproduct innovation

activities of German rms because the features of this decision - to innovate or

not -suitswelltoourgeneralmodelsetting: Itisabinary decision,the degreeof

uncertainty isratherhigh whichmakesthe informationhowothersdecided more

valuable, the decision makers are under the labor market's conformity pressure

described in the introduction, and the competition between rms urges to keep

(15)

The data we use is the rst wave of the Mannheim Innovation Panel (MIP),

provided by the Centre forEuropean EconomicResearch (ZEW, Mannheim; see

Harho and Licht (1994) for details on the data composition.). The MIP is a

survey of approximately 3000 German rms which has been annually collected

since1993. Its questionnairefollows the guidelinesof the OSLO-manual(OECD

(1992))andcontainsdetailedinformationonpotentialdeterminantsofinnovative

activity. In contrast to our (implicit) assumption, our data does not provide

informationabout all rms potentiallyinuencing the rm under consideration.

Havingthisdrawback inmind,weassumethatthesampleunits,whichhavebeen

randomly selected from the total population when the MIP has been designed,

can be used to predict all outside inuence a rm receives. Because the MIP

providesbinaryinformationineachoftheprecedingthreeyearsastowhether an

innovationhas taken place,we apply the binary choice framework of section2.2.

Byusingtheoutcomesfrom1990and1992,weanalyzeasampleofallrmsinthe

MIP for which the relevant information is available, in total 1380 observations.

Table 1 (section 6.2) shows the number of the adopted product innovations in

the years 1990 and 1992. We use the entirety of the MIP data instead of single

industries'data becausedoing the latterwould leadto samplesizesof about 100

or less. As the Monte Carlo study in the previous chapter shows, such samples

are too smallforreliable estimations.

It would be beyond the scope of our paper to derive an estimation equation

fromastructuralapproach. Forourreducedformapproach,weuse twogroupsof

variablesasdeterminants ofthe innovationdecisions. Whilethe rst onereects

the abilityof arm tocope withthe eort necessary fora successfulinnovation,

the second group provides information on the subjective view managers have

on theirs rms' situation. The rst group of `hard' variables encompasses the

employees' number, the square of this number, and the ratio of own capital to

totalsales. Thelatterservesasanindicatorfortherm'snancialstrength. The

`soft' variables are two expectations a rm's manager will have concerning the

relevant product market: the developments of the demand for their products on

(16)

In order to measure the distance (i.e., to dene neighborhood) between the

rms onehas todeterminesuitabledimensionsforthe abstract space. Weutilize

sixvariablesasthesecharacteristicscomposingthematrixZ,which,again,canbe

divided in `hard' variables that describe the rm's situation,and `soft' variables

that allowto see into the managers' motivation. Forthe rst group of variables,

we utilizethe rm's age, the shareits exports have initssales, the sales share of

its most important product, and whether it is located in Eastern Germany. For

the latter group, we use two questions in which rms have been asked to weigh

theimportanceofpossibleobjectivesfortheirinnovationdecisiononascalefrom

1 (no importance) to 5 (high importance). We employ `Decreasing the impact

on the environment' and `Lowering the energy consumption'. Table 2 shows

the correlation matrices of the chosen six social characteristics. Considerable

correlations between particular characteristics point out the necessity to correct

for it by using the modied distance measure asdened inequation (6).

The results of the Maximum Likelihood estimations are presented in table 3.

In paerticular, we employ four specications.

The rst specication is a common probit without any own or neighborhood

inuence from the previous period. The estimation yields coeÆcients with the

expected sign which are signicant atleast on the 10%-level, with the exception

ofown capitalpersale. The sizeofarm,expressed bythe numberofemployees,

hasadecreasinglypositiveeect ontheinnovationprobability,withaprobability

maximizingrmsizeof roughly100employees. The higherthe expecteddemand

inthecorrespondingproductmarketsthehighertheinnovationprobability,which

islittlesurprising. Thetendencytoinnovatealsoriseswiththecompetitioninthe

product marketa rm expects; supposable, a product innovation is, atleast for

somerms,animportantstrategicmoveinthemarketcompetition. Remarkably,

the estimated coeÆcients for the two `soft' variables expected demand and the

expectedcompetitionprovetobeverystablewithintheotherthreespecications

(b) till(d) whichis not similarlytrue for the three `hard' variables.

Moreprecisely,theinclusionoftheownpriordecisioninspecication(b)aects

(17)

due to the mostly low changes of employment gures within two years, while

the opinion concerning the demand and the competitors' behavior may alter

more quickly. The variable measuring the own prior eect is strongly signicant

positive,as expected.

The third specication (c) includes the neighborhood dependence eect, al-

lowing for positive inuence exerted by innovators, and for negative inuence

exerted by the non-innovators, assumingan identical decay for both kinds of in-

uence. Thedecay parameterb isstronglysignicant,whichexpresses thehigher

relevance neighbors have on a decision. The two scale parameters, being both

signicant on the 5 %-level, clearly reveal the assumed inuence pattern. The

Wald test on joint signicance of the three spatial parameters is signicant, as

well.

The full model, which allows for dierent decays for innovators' and non-

innovators' inuences, is presented in the last specication (d). It states the

results found before. The scale parameters only change mildly and keep their

signicance level. The two values for the decays are roughly of the same size.

As allfour spatialparameters are signicantat least onthe 5%-level, the Wald

statisticfor their joint signicance is,again, persuasively signicant.

5. Conclusions and outlook

In this paper, we model and estimate discrete choice problems of individuals

underthe assumptionthatdecisionswhichothershavemadeinsimilarsituations

inuence the individual decision maker. According to the distinction described

e.g. byManski(1993),these outsideinuences are interactionsof anendogenous

kind,asthepropensityforacertaindecisionoutcomedependsontheoutcomesof

otherdecisionmakers whoare relevantdue totheirsocialsimilarity. Tomeasure

the degree of social similarity between individuals we dene an abstract space

so that agents with similar decision problems are neighbors in this space. Our

approachtakescorrelationsbetweenindividuals'characteristics,whichdenethe

dimensions of the abstract space,intoaccount. Wepresent MonteCarlo studies

(18)

meaningful results. Nevertheless, unbiasedness and variance of the empirical

distribution of the estimates heavily improve for a sample of, say, n = 200. An

applicationtoGermaninnovationdata,whichestimatestheprobabilitytodecide

foraproductinnovation,revealsstrongevidencefortheexistenceofthedescribed

neighborhoodinuence.

The next research steps should explore certain aspects of the presented ap-

proach. A promising possibility seems to be the widening of the time horizon:

with panel data, the innovation decision pattern over several periods could be

described. Another aspect worth examining is the design of the distance decay

function. For concrete applications, it remains demanding to detect not only

the degree of importance which an outside inuence has on a decision maker

but especially the factors on which this inuence depends. The crucial point

is, technically, to x how the distances between decision makers can be prop-

erly determined,whichwill,inmost cases, raise the question astowhich factors

express similaritybetween theagents'decisions. Againstthatbackground,learn-

ing about social interactions could benet from data containing informationon

subjective views of the decision makers, as e.g. Manski (2000)emphasizes.

References

Akerlof, G.(1997): \Social DistanceandSocialDecisions,"Econometrica,65, 1005{1027.

Akerlof, G., and J. Yellen (1990): \The Fair Wage-Eort Hypothesis and Unemploy-

ment,"Quartely JournalofEconomics,105,255{283.

Anselin,L.(1988): \SpatialEconometrics: MethodsandModels,"Kluwer,Dordrecht.

Anselin, L., and D. Griffith (1988): \Do Spatial Eects Really Matter in Regression

Analysis?,"Papers ofthe Regional ScienceAssociation,65,11{34.

Baptista, R.(2000): \DoInnovationsDiuse FasterWithinGeographicalClusters?,"Inter-

nationalJournalof IndustrialOrganization,18,515{535.

Bernheim,B.(1994): \ATheoryofConformity,"JournalofPolitical Economy,102,841{877.

Besley, T., and A. Case (1995): \Incumbent Behaviour: Vote-Seeking, Tax-Setting, and

YardstickCompetition,"American Economic Review,85,25{45.

Brock, W., and S. Durlauf(1995): \DiscreteChoice withSocialInteractionI: Theory,"

DiscussionPaper5291,NBERWorkingPapers.

Case, A.(1992): \NeighborhoodInuence andTechnologicalChange,"Regional Scienceand

Urban Economics,22, 491{508.

Case, A., and L. Katz (1991): \The Company You Keep: The Eects of Family and

NeighborhoodonDisadvantagedYouths,"DiscussionPaper3705,NBERWorking Papers.

Dubin, R. (1995): \Estimating Logit Models with SpatialDependence," L. Anselin and R.

Florax, "NewDirectionsin Spatial Econometrics",Springer, Berlin,pp.229{242.

Encaoua,D.,B.Hall,F.Laisney,andJ.Mairesse(1998):\TheEconomicsandEcono-

(19)

Florax,R.(1992): \TheUniversity: ARegionalBooster?,"Avebury,Aldershot,pp.189{228.

Geroski, P.A. (1995): \Markets forTechnology: Knowledge, Innovationand Appropriabil-

ity,"inHandbook oftheEconomicsof InnovationandTechnological Change,ed.byP.Stone-

man,chap.4,pp.90{131.Blackwell,Oxford.

Griliches,Z.(1992): \TheSearchforR&DSpillovers,"ScandinavianJournalofEconomics,

94(supplement),29{47.

Harhoff,D.,andG.Licht(1994):\DasMannheimerInnovationspanel,"U.Hochmuthand

J. Wagner, "Firmenpanelstudien in Deutschland", Tuebinger Volkswirtschaftliche Schriften,

Tuebingen,pp.255{284.

Katz,M.,andC.Shapiro(1985): \NetworkExternalities,CompetitionandCompatibility,"

AmericanEconomic Review, 75,424{440.

Manski,C.F.(1993): \IdenticationofEndogenousSocialEects: TheReectionProblem,"

Review ofEconomic Studies,60,531{542.

(2000): \Economic Analysis of Social Interactions," Discussion Paper 7580, NBER

WorkingPapers.

McMillen, D. (1992): \Probit with SpatialAutocorrelation," Journal of Regional Science,

32,335{348.

(1995): \SpatialEectsinProbitModels,"L.AnselinandR.Florax,"NewDirections

inSpatial Econometrics",Springer,Berlin, pp.189{228.

OECD(1992): \ProposedGuidelinesforCollectingandInterpretingTechnologicalInnovation

Data-OSLOManual,"OECD/DG(92),26.

Pinkse,J.,andM.Slade(1998): \ContractinginSpace: AnApplicationofSpatialStatistics

toDiscrete-ChoiceModels,"JournalofEconometrics,85,125{154.

Poirier, D., and P. Ruud(1988): \Probitwith DependentObservations,"Review of Eco-

nomicStudies,55,593{614.

Scharfstein,D.,andJ.Stein(1990):\HerdBehaviourandInvestment,"QuarterlyJournal

of Economics,105,255{283.

Topa,G.(1997): \SocialInteractions,LocalSpilloversandUnemployment,"Discussionpaper,

DepartmentofEconomics,NewYork University.

Tukey,J. W.(1977): ExploratoryData Analysis.Addison-Wesley,Reading,MA.

(20)

6. Appendix

6.1. Monte Carlo Simulation Results. The following box plot gures show the

empiricaldistributionsoftheestimatedparametersfordierentsamplesizes,whichre-

sultedfromprobitestimationswith1,000replicationseach. Seesection3foradetailed

descriptionof thestudy'sdesign.

Figure 1. Empiricaldistributionsof constant's estimates.

Estimation of the con-

stant .

(True value: =1)

Figure 2. Empiricaldistributions of slopeparameter's estimates.

Estimation of the slope

parameter.

(Truevalue: = 1)

(21)

Figure 3. Empiricaldistributions of own impact parameter'sestimates.

Estimation of the

parameter which

reects the impact

from the own prior

period's decision.

(Truevalue: =1)

Figure 4. Empirical distributionsof scaling parameter's estimates.

Estimation of the spa-

tial scaling parameter

a

1

which expresses

the size of that neigh-

bors' inuence who

had decided posi-

tively (i.e., y 1

i

= 1)

in the prior period.

(Truevalue: a

1

=1)

Figure 5. Empiricaldistributions of decay parameter'sestimates.

Estimation of the spa-

tial decay parameter

b

1

which expresses

the decay of that

neighbors' inuence

who had decided pos-

itively (i.e., y 1

i

= 1)

in the prior period.

(Truevalue: b

1

=3)

(22)

6.2. Estimation of Product Innovation Decisions in Germany.

Table 1. Number of product innovators in the years 1990 and

1992. Basedon data of the Mannheimer Innovationspanel.

1992

No Yes

1990

No 186 375 561

Yes 69 750 819

255 1125 1380

Table2. Correlationmatricesofsocialcharacteristics. Thechar-

acteristics are denoted by

A: importanceof objection `less impacton environment' and of

B: `lower energy consumption'for the own innovation decision,

C: rm age,

D: share of exports,

E: located ineastern Germany,

F: sales share of most importantproduct.

A B C D E F

A 1.0000

B 0.3023 1.0000

C 0.0337 -0.0350 1.0000

D 0.0265 -0.0382 0.1765 1.0000

E -0.0507 0.1572 -0.4193 -0.2953 1.0000

F -0.0021 0.0552 -0.0355 0.0030 0.0353 1.0000

(23)

Table 3. Estimationresults of product innovation decisions

n=1380 (a) (b) (c) (d)

Mean Variables

Constant 0:2758

0:3115

0:4033

0:4090

Employment 0:0957

0:0420 0:0498 0:0512

(Employment) 2

0:0005

0:0002 0:0002 0:0002

Own CapitalperSales 0:0123 0:0315 0:0263 0:0265

Expected Demand 0:0924

0:1350

0:1251

0:1232

Expected Competition 0:0682

0:0693

0:0699

0:0690

Own Impact

- 0:9597

0:9677

0:9732

Spatial Parameters

a

1

- - 0:2165

0:1934

a

0

- - 0:1193

0:1789

b

1

- -

3:5275

3:5050

b

0

- - 4:1308

2

(3)resp.

2

(4) - - 59.40

72.29

logLikelihood -651.96 -585.01 -581.72 -581.41

1380 rms of the MIP sample 1990/1992. Dependent variable: Product innovation

realized in 1992 (yes/no). Inference based on robust standard errors. Wald test for

joint signicanceof thethree resp. fourspatialparameters.

: signicant on the1%-level

nostar : no signicance