Skip to content

Using FME to analyze and predict dengue outbreaks in Malaysia

 

This work was done in collaboration with Safe Software partner GIS Innovation Sdn Bhd 

FME was fundamental in the study of effectively predicting dengue outbreaks. The research aimed to develop an index to evaluate dengue outbreaks and predict when the next outbreak will occur in Malaysia by analyzing diverse data variables. Over 62,000 records were analyzed from Peninsular Malaysia, covering dates, addresses, and total case numbers across districts and localities over a five-year period. The integration of moon phase data, weather variables, and land use data provided a comprehensive understanding of the factors influencing dengue transmission.

FME played a critical role in cleaning and processing the raw data and analyzing it to create predictive models, enabling the prediction of when the next outbreak will occur in Malaysia.

 

Preparing Dengue Data with the StatisticsCalculator Transformer

Dengue data were collected from the Malaysian Ministry of Health website. This data, available as PDF files, required additional processing to suit our research requirements. Retrieved data included addresses, years, weeks, total case numbers, and weekly average new case numbers for dengue in Malaysia.

The StatisticsCalculator was the first transformer employed after data retrieval and cleaning. It facilitated data classification by state, year, and week, exporting the data in Excel format. The chosen output format was primarily for charting in MS Excel. The same transformer computed values such as mean, sum, and standard deviation.

Geocoding the Dengue Data with GoogleGeocoder Transformer

All dengue data were collected and compiled into a Microsoft Excel file. Using the GoogleGeocoder transformer, the data was geocoded and converted into point objects stored as Esri Shapefiles.

Processing Moon Phase Data with DateTimeCalculator and DateTimeConverter Transformers

Moon phase data was downloaded from the “Time and Date” website (freely available for download). As the moon phase data start from a specific date, converting the dengue cases from their actual dates and grouping them by moon phase was challenging. Processing the data required tedious arrangement of elements and standardizing temporal data.

Utilizing the DateTimeCalculator and DateTimeConverter tremendously aided in data classification and conversion of dates to days of the week.

The moon phase data comprises dates marking the beginning of each phase (New Moon, First Quarter, Full Moon, Last Quarter). Employing FME, the moon phase for the first week of each month was retrieved. This technique enabled the calculation of the week number for each moon phase. Refer to Table 1 below.

Weather Data

Precipitation and temperature were the weather variables considered in this study. According to the literature study, temperatures between 28 °C and 32 °C were associated with the highest odds of dengue transmission. Since Malaysia’s temperatures are typically over the threshold, it was assumed that this variable had to be ignored because Malaysia’s climate is conducive to mosquito development and activity. If the model needs to be utilized in other nations, such as Hong Kong, where temperatures fluctuate greatly, this temperature might be taken into account. For such data filtering, the Tester is always the transformer of choice.

Land Use Data

Land Use data was considered, and it goes without saying that FME assisted in the creation of maps with simple operations like the Clipper transformer or even the Reprojector to reproject data from MRSO to WGS84. Other geospatial processing transformers were utilized by fanning out the data, which is a method of breaking the data down into smaller data depending on a field’s unique values.

Parameters Calculation

There are different ways to manipulate data from one field using another or just to compare data, the transformer primarily used to divide data is the Tester. Other transformers such as the AttributeManager can be used alongside the ExpressionEvaluator which was used for different data manipulation and calculation. The ExpressionEvaluator is a powerful transformer that helps to input your own algorithm to process numbers and get results  no matter what your methodology would be. 

Conclusion

This study demonstrated how FME helped in the research for dengue outbreak predictions by processing and analyzing large datasets. The integration of various data types and the use of specific transformers facilitated a comprehensive analysis, leading to the development of predictive models. FME’s capabilities in data cleaning, processing, and manipulation were crucial in achieving the research objectives, showcasing its endless possibilities.

“FME has always been my go-to tool for dealing with data. It allows you to solve complex data problems by connecting a few transformers. ”
Lyes Mokraoui,
Geospatial Consultant GIS Innovation Sdn Bhd
Safe product icons
Reach out and get started with FME today

Real change is just a platform away.