• Keine Ergebnisse gefunden

This creates a need for efficient methods for organizing, storing, and summarizing the data

N/A
N/A
Protected

Academic year: 2021

Aktie "This creates a need for efficient methods for organizing, storing, and summarizing the data"

Copied!
4
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

EISENSMITH, Scott P.: Acquisition, storage, and retrieval of epldemologlcal data in plant pathology

When studying plant disease epidemics, large environmental and biological data bases are accumulated. This creates a need for efficient methods for organizing, storing, and summarizing the data. Computers are suited for storing and m a n i p u l a t i o n large, heterogeneous data bases, and Computer facilities are a v a i l a b l e in many parts of the world. Powerful statistical packages (l, 4, 12, 14) are a v a i l a b l e but do not meet a l l the needs of the p l a n t pathologist.

Although Computer programs have been written for specific teaching (3, 9, 10), extension (7, 8, 13), and research purposes (11), l i t t l e has been done to provide Computer Software for h a n d l i n g general epidemiological data. The Computer program EPISTAR was developed to f i l l this need and reduce the time researchers must spend in data organization and retrieval. This program was developed in conjuntion with Drs. Rosemary Loria, Brian 01son and Alan Jones at Michigan State University. Our approach and several of its advantages are outlined below.

EPidemiological Information STorage And Retrieval (EPISTAR) is a computer-based System designed to assist researchers in collection, reduction, storage, and analysis of host, pathogen, and environmental data. EPISTAR consists of:

o data collection forms for organizing data into a Standard format

o an interacitve FORTRAN V Computer program that w i l l display, summarize, and perform coincidence analysis (11) on data

o a us-er's manual e x p l a i n i n g how to use the System

o a programmer1s manual which discusses program modification and transportability.

The EPISTAR Computer program i s well commented and easily modified to allow for new data processing commands or additional data types. For example, at M i c h i g a n State University, algorithms developed at Cornell University (5) that allow transcription of hourly values from hygrothermograph or De Wit leaf wetness (M.

De Wit, H e n g e l o , H o l l a n d ) charts have been added to our version of EPISTAR.

The program w i l l accomodate both environmental and b i o l o g i c a l data, collected by automatic or manual Systems. Currently EPISTAR w i l l handle ten environmental Parameters (air temperature, barometric pressure, leaf wetness, r a i n f a l l , rela- tive humidity, soil moisture, solar radiation, soil temperature, wind direction, and wind speed) with up to 24 observations per day for each parame- ter. A l l Output i s in the same units äs the o r i g i n a l data.

B i o l o g i c a l data types processed by EPISTAR include: BURKARD spore trap (Burkard Scientific Sales Ltd., Rickmansworth, Hertfordshire, England) catches, Rotorod spore trap (Ted Brown Associates, Los Altos H i l l s , CA 94022) catches, water trap catches, disease ratings, and stages of host phenology. Burkard spore trap catches may be recorded up to 24 times per day, w h i l e other biological data types may be recorded, at most, once daily. Data may be taken from m u l t i p l e environmental sensors, spore traps, or plots in the same time period.

The Computer program performs three basic operations: displays stored data for examination, calculates summary statistics, and searches for possible relationships among variables. The display command allows for convenient access EISENSMITH, Gießen 293

(2)

to portions of the original data and facilitates the creation of subfiles for plotting or statistical analyses. To obtain the data, the user specifies the data type(s) and the date ränge. Then the program searches the data f i l e and prints out the results of the search. The summary command provides various statistics that are useful in analyzing epidemiological Information. Each biological and environmental factor is summarized on a daily basis. Summary calculations include m i n i m u m , m a x i m u m , and average values; number of time periods above or below user-specified thresholds; and number of m i s s i n g values.

The user may select subsets of data for summarization, i.e. data from an environmental sensor, plot, plant, shoot, shoot type, or leaf. The coincidence analysis command permits the location of time periods w i t h i n the data file with conditions that match user-specified values. The conditions are specified with the FORTRAN relational operator codes 'LT', 'LE1, 'EQ', ' N E1, 'GT1 and 'GE1 in reference to a user-supplied value. Series of comparisons may be joined together with the logical operators 'AND1 or 'OR1.

Coincidence analysis may be used on all environmental data and Burkard spore trap catches, and i s particularly useful in detecting associations between environmental conditions and biological events. For example, it may be desirable to locate time periods with greater than 0,02 cm of r a i n f a l l , tempe- ratures less than 23 C, and a spore trap catch of at least 10 spores/hour.

Once the EPISTAR program i s accessed, a l l the information necessary to run the program is requested from the user. The user may choose to display, summarize, perform coincidence analysis, change the placement of the Output, select a new data f i l e , or terminate the program by entering the appropriate mnemonic command.

EPISTAR accesses the data f i l e and writes out information about the data f i l e

— i n c l u d i n g the data collection date ränge, collector, collection location, and the data types present in the file. If the user chooses to display, summarize, or perform coincidence analysis; the desired data types and probe, plot, or trap number(s) must be selected from those listed äs a v a i l a b l e . If the data are to be summarized, appropriate thresholds should be provided for environmental data.

Relational operators, relational values, and possibly a logical operator must be chooses for coincidence analyses. When the EPISTAR program has completed the data processing, it w i l l again ask the user which procedure is desired — display, summary, coincidence analysis, new data file, or program termination.

User responses are checked for errors, and i n v a l i d responses are rejected with a diagnostic message. No knowledge of FORTRAN programming is necessary to use EPISTAR.

The form of Output depends on the kind of processing and on the type(s) of data selected. Output from the program may be printed at a Computer terminal, or /ritten onto a disk f i l e for use with statistical analysis (l, 12) and plotting (2, 6) programs. In display Output, each l i n e represents one day, and begins with the date; data type code; and the probe, plot, or trap number. This is followed by the raw data, with decimal places inserted according to the Stan- dard formats.

Each l i n e of summary Output begins with the date and the day of the year, followed by the summarized data values for the date. Summary values are calculated from data sets with missing data, but the number of missing values is indicated in the Output. If all data values are m i s s i n g , m i s s i n g value indicators are given instead of the summarized values. Each data type is separated by a column of stars. Each l i n e of coincidence analysis begins with the date and day of the year.

294

(3)

Those time periods, represented in the title äs the first through the 24th hours, for which the user-specified requirements have been satisfied, are indicated with stars. The total number of time periods for which condition where satisfied for a day, and for a particular time period, are given äs the sums of the rows and columns in the table, respectively.

The EPISTAR System facilitates collection, storage, and analysis of epidemiological data, and may serve äs a model for other standardized epidemiological data processing Systems. Data forms allow for immediate organization of the information into computercompatible format, thus facilitating the use of other Computer Software for data analysis on plotting.

Subsets of the master data f i l e may be easily selected by date ränge and data type. M u l t i p l e observations for a s i n g l e day may be summarized into daily values. Those data meeting user-specified conditions can be quickly located within the data file.

The Computer program has been written in a modular format to facilitate modification, i n c l u d i n g the addition of new data types or processing routines.

We hope it may thus serve äs a framework for additional epidemiologically-oriented Computer Software.

Literature Cited

1. ANONYMOUS: M i c h i g a n State University STAT System: User1s guide VIII.

Computer Laboratory, M i c h i g a n State University, East Lansing, MI 48824, 1974

2. ANONYMOUS: Statistical p l o t t i n g o n - l i n e command System: user's guide. Pest Management Technical Report 13. Dept. of Entomology, M i c h i g a n State University, East Lansing, MI 48824, 54 p p . , 1977

3. ARNESON, P.A., OREN, T.R., LORIA, R., JENKINS, J.J., GOODMAN, E.D. and COOPER, W.E.: APPLESCAB: A pest management game. Dept. of Entomology, M i c h i g a n State University, East L a n s i n g , MI 48824, 1978

4. BARR, A.J., GOODNIGHT, J.H., SALL, J.P. and HELWIG, J.T.: A user's guide to SAS-76 Institute, P.O.Box 10066, R a l e i g h , NC 27605, 1976

5. BLUME, M.C., SEEM, R.C. and BARNARD, J.: Two Computer programs used in the analysis of rectangular and circular charts from continously recording weather instruments. Search 9 (1) Plant Pathology Dept. No. 5, N.Y. Agric.

Expt. Stn., Geneva, NY. 11 pp., 1978

6. BOBZIN, J., EISENSMITH, S.P. and JONES, A.L.: Plotting of environmental and biological data: User's guide. Pest Management Technical Report 16, Dept.

of Botany and Plant Pathology, M i c h i g a n State University, East Lansing, MI 48824, 82 pp., 1978

7. COTTER, H.V.T., MAC HARDY, W.E. and WARREN, J.A.: Storing, retrieving, and reporting plant disease occurence data using a Computer data base manage- ment System. Plant Dis. Rept. 63 (2), 117-121, 1979

8. CROFT, B.A., HOWES, J.L. and WELCH, S.: A Computer based extension pest ma- nagement delivery System. Envir. Ent. 5 (1), 20-34, 1976 DIMOFF, K. and KILMER, N.: SAMPLE user's manual. Dept. of Entomology, M i c h i g a n State University, East Lansing, MI 48824, 21 pp., 1978

EISENSMITH, Gießen 295

(4)

9. LOGAN, P.: Orchard: A serious predator-prey interaction game. Dept. of Entomology, Michigan State University, East Lansing, MI 48824, 26 pp., 1978 10. MOGK, M.: Automatic data processing in analysis of epidemics. Pages 55-77 in: J. Kranz, ed. Epidemics of Plant Disease. Springer-Verlag, New York, NY. 170 pp., 1974

11. NIE, H.C., HÜLL, H., JENKINS, J.G., STEINBRENNER, K. and BENT, D.H.:

Statistical Package for the Social Sciences. MC Graw-Hill, Co. Inc., New York, NY. 675 pp., 1978

12. PENNYPACKER, S.P., KNOBLE, H.D. and LONGENECKER, J.L.: Plant disease survey information (PDSI): a Computer data storage and retrieval System. (Abst.

No. 192), Proc. Am. Phytopahtol. Soc. 3:246, 1976

13. RYAN, T.A., JOINER, B.L. and RYAN, B.F.: MINITAB: A Student Handbook.

Duxbury Press Div. of Wadsworth Pub. Co., North Scituate, MA. 341 pp., 1976

296

Referenzen

ÄHNLICHE DOKUMENTE

His research focuses on fast access to big data including in particular: data analytics on large datasets, Hadoop MapReduce, main-memory databases, and database indexing. He has been

Author contributions BB has led overall research activities from proposal development to data compilation, data entry and processing, data analysis, and interpretation of the result

By focusing on the (expert) user and her task, we need to think beyond natural language processing and closely cooperate with computer vision to enable multimodal systems to

The second theme under this topic centered on the challenges to information being viewed as a strategic asset, particularly the fact that there is difficulty in sharing

The aim of the BONN-IIASA Research Project is to analyse and forecast economic growth and economic structural change of Lhe most important countries (or groups of countries) in the

Table 21-1 Comparison between primary and secondary temperature sensors Table 21-2 Salinity comparison between primary and secondary sensors Table 21-3 CTD salinity from

This review has lead to: a summary of commercially available integrated thermochemical databanks; a bibliography of thermodynamic databases available in electronic format; a list

The present Policy pertains to the ownership of, the curation of and access to experimental data and Metadata collected and/or stored by PSI