Sage Journals: Discover world-class research

Abstract

Sustainable planning of waste management is contingent on reliable data on waste characteristics and their variation across the seasons owing to the consequential environmental impact of such variation. Traditional waste characterization techniques in most developing countries are time-consuming and expensive; hence the need to address the issue from a modelling approach arises. In modelling the complexity within the system, a paradigm shift from the classical models to the intelligent models has been observed. The application of artificial intelligence models in waste management is gaining traction; however its application in predicting the physical composition of waste is still lacking. This study aims at investigating the optimal combinations of network architecture, training algorithm and activation functions that accurately predict the fraction of physical waste streams from meteorological parameters using artificial neural networks. The city of Johannesburg was used as a case study. Maximum temperature, minimum temperature, wind speed and humidity were used as input variables to predict the percentage composition of organic, paper, plastics and textile waste streams. Several sub-models were stimulated with combination of nine training algorithms and four activation functions in each single hidden layer topology with a range of 1–15 neurons. Performance metrics used to evaluate the accuracy of the system are, root mean square error, mean absolute deviation, mean absolute percentage error and correlation coefficient (R). Optimal architectures in the order of input layer-number of neurons in the hidden layer-output layer for predicting organic, paper, plastics and textile waste were 4-10-1, 4-14-1, 4-5-1 and 4-8-1 with R-values of 0.916, 0.862, 0.834 and 0.826, respectively at the testing phase. The result of the study verifies that waste composition prediction can be done in a single hidden-layer satisfactorily.

Keywords

Municipal solid waste model architecture backpropagation seasonal variation physical composition ANN

Introduction

The upsurge in the rate of solid waste generation is an unavoidable repercussion of production and consumption activities, and urbanization expansion consequent upon population growth in developing countries (Gallardo et al., 2018; Pathak et al., 2020). Sustainable waste management has been prioritized in South Africa to ensure that all generated waste does not necessarily end up in landfills, because most landfill sites are reported to be running out of space for waste disposal. Well-informed decision making regarding effective collection and disposal strategic planning is contingent on possession of reliable information on characteristics, composition, generation and sources of municipal solid waste (MSW) (Bernstad et al., 2012; Kamran et al., 2015). The variations in MSW composition make it difficult to measure and quantify waste composition, while at the same time making it critical and necessary (Abylkhani et al., 2019; Gidarakos et al., 2006).

Factors such as employment status, household size, seasons, income level and population influence the variation in the composition of MSW waste streams (Intharathirat et al., 2015). Changes in weather conditions at different seasons in a year affect consumption pattern and human activities and have impacted the fractions of the waste stream such as plastics, paper, metal, textile and organic waste (Denafas et al., 2014). The study by Kamran et al. (2015) in city of Lahore, Pakistan revealed that the highest fraction of food and yard waste was generated in spring while the winter season had the highest fraction of plastic and textile waste. Jadoon et al. (2014) analysed variability in waste composition and rate of generation of MSW in Gulberg town of Lahore, Pakistan over four different seasons and gave a result similar to Kamran et al. (2015) in the same case study. Aslani and Taghipour (2018) reported that the fraction of all the waste streams in three Iranian cities were found to vary across the winter, autumn, spring and summer seasons. Winter produced the highest organic waste fraction while the summer season produced the highest paper fraction. Similar studies were extended to four European cities (Denafas et al., 2014), Island of Crete (Gidarakos et al., 2006), Chihuahua, Mexico (Gómez et al., 2009), and Columbia, Missouri (Zeng et al., 2005). Packaging waste increased in the summer season on the Island of Crete (Gidarakos et al., 2006). Seasonal variation does not only influence the physical waste stream, discrepancies were reported in the elemental composition of waste in the three Iranian cities (Aslani and Taghipour, 2018) and in moisture content in Wroclaw (Boer et al., 2010) over four different seasons.

Most of these studies are focused on experimental quantification of the MSW composition and generation in different seasons of the year while there has been little attention to developing mathematical models to quantitatively predict the extent of the effect of the seasonal changes on MSW fractions. Very few studies such as that of Denafas et al. (2014) have developed a non-parametric time series model such as simple exponential smoothing (SimpleES), double exponential smoothing (DES), seasonal exponential smoothing (SES) and linear exponential smoothing (LES) to predict monthly waste fractions. The expression in equation (1) was formulated to predict the monthly fraction of waste at a time t using the SES method.

Y_{t} = L_{t} + S_{t - p} + ε_{t}

(1)

where $Y_{t} =$ monthly MSW fraction, $S_{t - p} =$ fraction at a time $t$ with one of $p$ season factors, and $ε_{t} =$ error terms.

Sustainable planning of waste management is contingent on proper knowledge of the trend in the variation of the physical composition of MSW owing to the consequential environmental impact of such variation. Traditional waste characterization techniques in most developing countries are time-consuming and expensive; hence the need to address the issue from a modelling approach arises. The knowledge of the fact that variations in waste composition further impact the environment and the energetic content of waste, necessitates the modelling approach to the issue (Denafas et al., 2014). More so, in modelling the complexity within the system, a paradigm shift from the classical models to the intelligent models such as artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), support vector machine (SVM), genetic algorithm (GA), among others, has been observed.

Due to the ability of ANN to model non-linear time series problems it has be found useful in a wide range of applications such as in energy systems (Panapakidis and Dagoumas, 2016; Wang et al., 2016), finance (Chen and Du, 2009), traffic (Slimani et al., 2019) and even in waste management (Oliveira et al., 2019; Solano et al., 2019). Its flexible computational framework allows the users to vary its topology such as numbers of layers and neurons in the layers and this has made it suitable for many time series prediction applications (Çavu, 2019). The application of artificial intelligence modelling in waste management has been gaining traction globally. The literature is replete with several studies which applied ANN for modelling different components of waste management such as waste generation forecast (Noori et al., 2010a; Singh and Satija, 2018), leachate formation and control (Bayar et al., 2009; Karaca and Özkaya, 2006), heating value prediction (Ozveren, 2016), bin-level monitoring (Hannan et al., 2016; Islam et al., 2014), process output, biogas generation and energy recovery (Ozkaya et al., 2007; Qdais et al., 2010), waste collection truck routing (Vu et al., 2019), and automated waste sorting (Vrancken et al., 2019). The larger percentage of these studies are applied for forecasting MSW generation.

Table 1 summarizes some selected studies from literature which used an ANN model for specific applications in waste management. However, it was observed that no study was found in the literature that applied ANN for the prediction of the physical composition of waste based on meteorological parameters. A comprehensive review by Abdallah et al. (2020) on the application of artificial intelligence in waste management also revealed this gap. Predictability of physical waste streams is crucial to sustainability of MSW management. This study therefore attempts to fill this gap by building an optimal neural network model to predict the physical composition of MSW using meteorological parameters. This study aims at investigating the optimal combinations of network architecture, training algorithm (TA) and activation functions (AF) that can accurately predict the physical composition of MSW and also evaluate the impact of seasonal variation on the fractions of physical waste using the city of Johannesburg as a case study. Significant meteorological parameters such as maximum and minimum temperatures, wind speed and humidity were set as input variables to predict the fraction of organic, paper, plastics and textile waste streams. Waste characterization data in Johannesburg reveals that the waste streams considered in this study are the ones with significant variation in different seasons; there is negligible impact on other waste streams, which is also the case in the waste characterization study of Kamran et al. (2015).

Table 1.

Some selected studies which used ANN for specific applications in waste managements.

Case study	Application	Performance metrics	Reference
Istanbul, Turkey	Leachate flow control in landfill sites	R² = 0.84, MSE = 0.00168	Karaca and Özkaya, 2006
Logan city, Australia	Waste generation forecast	R² = 0.46, RMSE = 290.55	Abbasi and Hanandeh, 2016
Mashhad, Iran	Weekly waste generation forecast	R² = 0.80, AARE=3.12	Noori et al., 2010a
Faridabad city, India	Waste generation forecast	R² = 0.839, RMSE = 0.001927	Singh and Satija, 2018
Jordan	Modelling and optimization of biogas production in a waste bio-digester	R² = 0.8703	Qdais et al., 2010
China	Prediction of heating value of waste	MAPE = 8%	Ozveren, 2016
Malaysia	A MLP classifier was used to detect waste bin level	–	Islam et al., 2014
Austin City, Texas, USA	Optimization of vehicle route for waste collection	MAPE = 10.92–16.51%	Vu et al., 2019
Alborz Industrial City, Iran	Identification of the effect of MSW compost and phytoremediation of the contaminated soil	R² = 0.98	Roohi et al., 2020

MSW: municipal solid waste; MLP: multilayer perceptron; AARE: Average Absolute Relative Error.

Materials and method

Data set

In this study, the model was developed using waste characterization data obtained in summer 2015 and winter 2016 in the city of Johannesburg comprising the percentage composition of organic, paper, plastic and textile waste streams. Four significant meteorological parameters, namely maximum temperature, minimum temperature, humidity and wind speed, for the city of Johannesburg were extracted from South Africa Weather Service for the respective periods of study in 2015 and 2016. Due to the unavailability of experimental waste characterization data in the spring and autumn seasons, the impact of changes in the weather conditions for these two seasons on the physical composition of waste was not considered in this study. Waste collection in Johannesburg is from two different two sources: daily non-compacted (DNC) waste collected from hotels, restaurant and food stores and the round collected refuse (RCR) collected weekly from residential households (Ayeleru et al., 2018). Table 2 presents the statistical properties of the input and output data. The statistical correlation (R²-value) between all the independent variables of the input data and each of the output variables is presented in Table 3.

Table 2.

Statistical description of the input and output parameters.

Statistical characteristics	Input variable				Output variable
Statistical characteristics	Maximum temperature (°C)	Minimum temperature (°C)	Wind speed (m s⁻¹)	Humidity (%)	Organics (%)	Paper (%)	Plastic (%)	Textile (%)
Maximum	38.3	20.3	7.5	83.0	40.1	27.8	38.9	15.5
Minimum	10.1	−2.9	0.0	56.0	11.6	11.2	13.2	0.0
Mean	24.2	8.7	3.75	69.5	26.7	19.9	27.2	7.7
Standard deviation	19.9	16.4	5.3	19.1	20.2	11.8	18.3	10.9

Table 3.

Statistical correlation (R²-value) between the independent variables and each of the output variables.

Variable	Organics (%)	Paper (%)	Plastic (%)	Textile (%)
Maximum temperature (°C)	0.252	0.098	0.212	0.094
Minimum temperature (°C)	0.396	0.228	0.187	0.061
Wind speed (m s⁻¹)	0.432	0.035	0.207	0.108
Humidity (%)	0.072	0.031	0.044	0.038

Study area

The city of Johannesburg is the constitutional headquarters of South Africa located in the Witwatersrand range of hills (Bwalya, 2019). The city is geospatially located at latitude 26°12’08” S and longitude 28°02’37” E with an area of 1645 km² and an elevation of 1767 m. The sub-tropical highland weather in Johannesburg produces a mild sunny climate in winter and moderately warm climate in summer. The four major seasons in South Africa generally are winter, summer, autumn and spring. The warmest and wettest month of the year is January, which is in summer, while July, which is in winter, is the driest and coldest month of the year with temperature dropping as low as 4.1°C. Waste management services in the city are operated by the municipality-owned Pikitup Company whose operation capacity is 1.6 million tonnes of MSW collection per annum, with four functional landfill sites (Mbuli, 2015). Figure 1 presents a map of Gauteng showing the major landfill sites in Johannesburg. Based on information available on south database, Statistics South Africa (STATSA), the population and the quantity of waste generated in the city of Johannesburg from 2010 to 2019 is presented in Figure 2. In addition the average values of some of the characteristics of waste generated in the city are presented in Table 4.

Figure 1.

Map of Gauteng showing major landfill sites in Johannesburg.

Figure 2.

Population and waste quantity generated in the city of Johannesburg (2010–2019).

Table 4.

Some characteristics of waste generated in the city of Johannesburg.

Waste characteristics	Range of values	Average
Recyclables (w/w)%	26.0–34.0	30.0
Non-recyclables (w/w)%	18.0–21.0	19.5
Moisture content (%)	60.9–67.1	63.9
Volatile matter (%)	21.8–23.0	22.9
Fixed carbon (%)	4.4–11.9	8.2
Organic (%)	13.9–28.7	21.4
Paper (%)	13.5–18.9	16.2
Plastic (%)	18.2–26.9	22.6
Textile (%)	4.9–7.8	6.4
Metals (%)	4.9–8.5	6.7

Source: adapted from Ayeleru et al. (2018).

Artificial neural network

Unlike classical programming techniques, ANN works in a similar manner to the human brain by learning from example, making it an excellent self-learning and self-adapting tool which does not require a user-defined solving algorithm (Yaghini et al., 2013). Owing to its approximation capabilities, it is used as an appropriate tool for universal function estimators (Bahrami et al., 2019). ANN is used to approximate functions by adopting iterative procedures focused on error minimization by assigning a weight matrix through the correct choice of AF for solving non-linear processes (Chattopadhyay and Chattopadhyay, 2018). The AF represents the rate of firing in the cell and determines the output from a set of inputs. The neural network can be represented mathematically using equation (2) where the weights and bias assigned to each layer are adjusted.

y = F (\sum_{i = 0}^{m} w_{i} . x_{i} + b)

(2)

where $x_{i} =$ input value, $w_{i} =$ weight value, $b =$ bias, $y =$ output, $F =$ AF.

The learning rate of the neural network and consequently its performance to a large extent is affected by the AF selected (Ebrahimpour et al., 2008), which could be a linear or non-linear function. Figure 3 shows the architectural structure of the model used in this study

Figure 3.

Model architecture consisting of four inputs, several neurons in the hidden layers and four outputs.

Building the optimal neural network

The optimal integration of ANN architecture, AF and TA relies on factors such as the complexity of the desired functions, the size of the input–output datasets and the expected model accuracy and precision, making the choice of best network a difficult task (Bahrami et al., 2019). In this study, the training process was done using ten different TA: namely, Levenberg-Marquardt (LM), scaled conjugate gradient backpropagation (SCG), gradient descent with adaptive algorithm (GDA), Broyden -Fletcher -Goldfarb -Shanno quasi-Newton (BQN), resilient backpropagation (RP), conjugate gradient with Powell/Beale restarts (CGB), conjugate descent backpropagation with Fletcher-Reeves restarts (CGF), conjugate descent with Polak-Ribiere (CGP), one step secant (OSS), variable learning rate backpropagation (VLRB). The AF used at the hidden layers and the output layers are softmax, logsig and tansig and purelin. Table 5 summarizes the parameters of the network which were continuously varied to obtain the optimal network. The dataset was divided into 70% for training and the remaining for testing. The training data was normalized before building the model using equation (3) to ensure that it falls in the same range.

Table 5.

Network parameter specifications for building the optimal network.

Parameters	Values	Remarks
Neurons in the hidden layer	1–15	A range of 1–15 neurons was tested in a single hidden layer
Activation functions in the input layer	4	The best combination of four functions in the hidden and output layer was investigated in each topology
Activation functions in the hidden layer	4	The best combination of four functions in the hidden and output layer was investigated in each topology
Training algorithm	9	All nine training algorithms were tested at each topology

y_{n o r m} = \frac{x - x_{m i n}}{x_{m a x} - x_{m i n}}

(3)

where $x$ is the mean of the variable, $x_{m i n}$ is the minimum variable and $x_{m a x}$ is the maximum variable and $y_{n o r m}$ is the normalized data.

To obtain the optimal network, several sub-models were stimulated with different topology ranging from 1 to 15 neurons in a single hidden layer. In each topology, 36 sub-models were stimulated by a trial-and-error method through several combinations of the nine TA with all the AF at the hidden and output layers; however the optimal combination for each topology was selected based on minimum error criteria. A single hidden layer was selected because previous research with ANN has proven that a single layer is enough for complex functions approximation (Noori et al., 2010b) and more than one hidden layer is unnecessary (Noori et al., 2011)

Evaluating the model performance

The eligibility and accuracy of the model developed in this study was evaluated using some statistical metrics with the 30% hold-out data for testing. The following statistical metrics were used to evaluation the performance of the models developed for each of the waste streams: root mean square error (RMSE), mean absolute deviation (MAD), mean absolute percentage error (MAPE) and correlation coefficient (R) represented in equations (4) to (6). The RMSE and MAD measures the variability between observed and the predicted values and determines eligibility of the developed model to predict physical waste streams (Olatunji et al., 2019). The correlation coefficient (R) evaluates the agreement between the observed and the predicted waste streams.

M A P E = \frac{1}{N} \sum_{i = 1}^{N} | \frac{O_{i} - P_{i}}{O_{i}} | \times 100 %

(4)

R M S E = {(\sum_{i = 1}^{N} \frac{{(P_{i} - O_{i})}^{2}}{N})}^{\frac{1}{2}}

(5)

M A D = \sum_{i = 1}^{N} \frac{(O_{i} - P_{i})}{N}

(6)

Results and discussion

Performance evaluation result

The performance of ANN is influenced by careful choice of hidden layer and neuron numbers, AF and TA. This study has investigated the effect of these parameters on the performance of the models developed and to select the optimal network. The result of the simulation shows that satisfactory models were obtained between 1 and 15 neurons as the model’s performance showed no significant improvement above 15 neurons. More so, a decline was observed in the models performance at two hidden layers, this verifies that waste composition prediction can be done in a single hidden layer. In each topology, 36 sub-models were stimulated with 1–15 neurons in the hidden layer and varied combinations of AF and TA. However, the optimal sub-models in each topology were selected and are presented in this section.

Organic

Performance metrics values of optimal sub-models in each topology are presented in Table 6. It was observed that the performance of these sub-models based on RMSE, MAD and MAPE do not follow a regular trend as the neurons in hidden layer increased from 1 to 15; however an improvement was observed in the R-values as neuron numbers increased up to 10, above which no significant improvement occurred. The optimal network that predicted the organic waste stream was obtained at 10 neurons with the lowest error values (RMSE=3.9293, MAD=2.5738, MAPE=13.0087) and highest R-value of 0.9162 in testing with the combination of softmax and logsig at the hidden and output layer, respectively, and LM. The optimal network is italicized in Table 6. The softmax function outperformed others as its combination mostly at the hidden layer produced more optimal sub-models.

Table 6.

Performance metrics for sub-models in all topology for organic waste.

Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	Performance metrics
Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	RMSE	MAD	MAPE	R
1	tansig	purelin	SCG	7.562	4.245	17.524	0.644
2	tansig	logsig	LM	5.432	5.330	15.203	0.634
3	softmax	purelin	GDA	6.429	5.224	14.570	0.652
4	purelin	softmax	RP	4.235	4.035	14.962	0.739
5	softmax	logsig	RP	5.640	3.246	13.246	0.782
6	logsig	logsig	CGF	7.084	3.132	14.326	0.635
7	logsig	softmax	LM	6.346	2.547	16.240	0.786
8	tansig	logsig	LM	5.324	2.836	18.345	0.845
9	tansig	tansig	SCG	4.804	2.245	21.045	0.869
10	softmax	logsig	LM	3.929	2.574	13.009	0.916
11	softmax	tansig	LM	4.105	2.362	14.260	0.894
12	softmax	softmax	LM	4.305	2.732	13.708	0.889
13	purelin	tansig	SCG	5.384	3.256	13.908	0.893
14	softmax	tansig	OSS	8.530	2.884	14.320	0.902
15	tansig	softmax	CGF	6.430	2.741	14.219	0.865

AF: activation function; RMSE: root mean square error; MAD: mean absolute deviation; MAPE: mean absolute percentage error; R: correlation coefficient; SCG: scaled conjugate gradient backpropagation; LM: Levenberg-Marquardt; GDA: gradient descent with adaptive algorithm; RP: resilient backpropagation; CGF: conjugate descent backpropagation with Fletcher-Reeves restarts; OSS: one step secant.

The accuracy of the optimal model is 87% (MAPE=13.008) showing an acceptable fit between the observed and the predicted organic waste while the lower error values of RMSE and MAD show the eligibility of the optimal model in predicting organic waste. The R-value of 0.9162 shows a good agreement between the observed and predicted values. Figure 4 is the test plot of the observed and the predicted organic waste fraction. It further depicts a strong agreement between the observed and predicted values of waste streams with a similar trend between the observed and predicted values. However some under-predictions and over-predictions are observed in the model prediction outcome which are exhibited by some marginal variations in some test samples. This could be attributed to the sensitivity and the response of the model to the extreme and unusual weather parameters recorded on the respective days which represent points of mis-predictions.

Figure 4.

Observed and predicted test sample plot for organic waste.

Paper

Table 7 presents the performance metrics of the optimal models in each topology for paper waste. An unexpected early convergence at a smaller number of neurons was noticed for prediction of paper waste. The optimal network was obtained at four neurons in the hidden layer with the combination of logsig and purelin at the hidden and output layer and SCG. It was observed that the performance of the sub-models in terms of RMSE, MAD, MAPE and R began to decline at neuron numbers above four. The performance metrics of the optimal network are RMSE=3.243, MAD=3.162, MAPE=17.936 and R=0.862. The accuracy of the optimal network is 82.1% (MAPE=17.936); this depicts a reasonable fit between the observed and predicted paper waste stream. The optimal network was selected based on minimum error values. Based on its RMSE and MAD values showing the variability between the observed and predicted values of paper waste streams, the optimal network model selected is eligible to predict paper waste. The network structure with logsig combinations at either hidden or output layers produced more optimal models in each topology than other functions. All nine TA produced at least one optimal sub-model in each topology; however the SCG-trained network had a higher number of optimal sub-models in each topology. Shown in Figure 5 is the test plot of the observed and the predicted paper waste stream fraction with the optimal model selected. A similar trend is noted between the observed and predicted percentage composition; however, some test samples exhibit marginal variations. This could be attributed to the sensitivity and the response of the model to the extreme and unusual weather parameters recorded on the respective days in the season under study which represent points of over-fitting.

Table 7.

Performance metrics for sub-models in all topology for paper waste.

Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	Performance metrics
Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	RMSE	MAD	MAPE	R
1	tansig	tansig	RP	6.875	4.525	21.934	0.725
2	logsig	tansig	SCG	6.824	4.436	21.430	0.735
3	softmax	purelin	LM	6.148	4.214	22.706	0.763
4	tansig	purelin	LM	5.391	3.512	21.706	0.754
5	logsig	tansig	SCG	3.243	3.162	17.936	0.862
6	logsig	logsig	SCG	3.452	3.111	17.304	0.878
7	softmax	logsig	OSS	3.178	3.456	17.658	0.843
8	tansig	logsig	CGB	3.952	4.141	18.960	0.849
9	tansig	softmax	LM	4.680	4.884	19.637	0.793
10	tansig	logsig	GDA	4.960	4.568	20.735	0.754
11	logsig	softmax	VLRB	5.349	4.773	21.682	0.775
12	tansig	purelin	BQN	5.491	5.125	21.953	0.672
13	softmax	purelin	SCG	5.580	5.087	22.453	0.610
14	logsig	logsig	CGF	4.840	5.218	22.236	0.584
15	logsig	purelin	SCG	5.048	4.630	23.157	0.595

Figure 5.

Observed and predicted test sample plot for paper waste.

Plastic

Similar procedures for obtaining the optimal network was followed for plastics waste. Table 8 presents the performance metrics values of the optimal sub-models selected in each topology based on minimum error value and maximum R-values for plastic waste. The optimal model with the minimum error values is a network with 14 neurons in the hidden layer trained with RP algorithms and with tansig and softmax function in the hidden and output layer, respectively. The RMSE and MAD values of the optimal model are 5.126 and 4.471 while the MAPE is 12.124, presenting a model which is 87.9% accurate in mapping an output to the input in the test samples. It was observed that the performances of the sub-models were better at higher numbers of neurons on the hidden layer; however, the performance of the optimal sub-model in each topology does not follow a regular trend as the neuron numbers increase. The RP algorithm trained best to give the overall best network despite the fact that it did not produce the highest number of optimal sub-models. The optimal model is eligible in predicting plastic waste fraction based on the RMSE and MAD values, the MAPE values also depict an acceptable agreement between the observed and predicted plastic waste stream. The observed and predicted value of the plastic waste stream follows a similar trend with no significant variation as presented in Figure 6. The discrepancies at some test samples as earlier noted could be due to the sensitivity and the response of the model predicting plastic waste fraction to the extreme and unusual weather parameters recorded on the respective days in the season under study which represent points of over-fitting and under-fitting.

Table 8.

Performance metrics for sub-models in all topology for plastic waste.

Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	Performance metrics
Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	RMSE	MAD	MAPE	R
1	tansig	purelin	GDA	7.676	6.243	13.663	0.736
2	logsig	tansig	LM	6.813	6.036	16.641	0.798
3	tansig	purelin	LM	10.341	6.347	14.462	0.713
4	logsig	tansig	LM	10.463	8.843	16.725	0.607
5	tansig	purelin	SCG	9.578	9.421	14.256	0.549
6	softmax	logsig	LM	11.244	8.438	12.723	0.613
7	logsig	tansig	RP	8.631	6.372	13.321	0.724
8	softmax	tansig	CGB	7.932	5.243	13.127	0.803
9	softmax	tansig	BQN	8.212	6.376	15.631	0.764
10	logsig	tansig	CGF	7.345	5.313	17.751	0.817
11	logsig	purelin	OSS	6.889	8.165	14.632	0.738
12	softmax	tansig	SCG	6.323	7.331	17.361	0.823
13	tansig	logsig	GDA	7.648	7.734	14.751	0.795
14	tansig	softmax	RP	5.126	4.471	12.124	0.834
15	tansig	logsig	BQN	6.230	5.348	13.255	0.801

Figure 6.

Observed and predicted test sample plot for plastic waste.

Textile

The statistical metrics of sub-models in each topology are presented in Table 9. The performance of the optimal sub-model in each topology was found to improve steadily from 1 to 10 neurons; however an unexpected decline was noticed above 10 neurons. Lower error values, RMSE and MAD were noticed in the model developed for textile waste compared to other outputs; this is because of the relatively lower fraction of textile waste in the total waste. The optimal model based on minimum error values and highest R-values was obtained at eight neurons with tansig and logsig combination at the hidden and output layer and LM algorithm. The RMSE and MAD values of the optimal model are 2.115 and 2.196 while the MAPE is 14.453, presenting a model which is 85.6% accurate in mapping an output to the input in the test samples and depicting an acceptable agreement between the observed and predicted textile waste stream. The observed and predicted value of textile waste fraction in the test plot in Figure 7 follows a similar trend with no significant variation. The response of the models to usual weather parameters on some days in the season could also be accountable for the over-prediction and under-prediction for some test samples. Table 10 compares the performance results of all the optimal models for organic, paper, plastic and textile waste.

Table 9.

Performance metrics for sub-models in all topology for textile waste.

Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	Performance metrics
Neurons in the hidden layer	AF in the hidden layer	AF in the output layer	Training algorithm	RMSE	MAD	MAPE	R
1	tansig	purelin	OSS	2.971	2.607	14.347	0.809
2	logsig	logsig	VLRB	2.637	3.016	15.016	0.763
3	softmax	tansig	LM	3.703	3.107	14.679	0.793
4	logsig	purelin	GDA	3.240	2.516	15.526	0.806
5	softmax	logsig	LM	3.107	3.211	14.515	0.822
6	purelin	logsig	RP	2.524	2.776	14.214	0.814
7	purelin	tansig	CGB	2.245	2.647	14.734	0.768
8	tansig	logsig	LM	2.115	2.196	14.453	0.826
9	softmax	softmax	SCG	3.207	2.273	14.446	0.814
10	logsig	purelin	VLRB	2.434	2.991	14.785	0.809
11	tansig	softmax	LM	4.296	3.563	15.641	0.735
12	logsig	logsig	BQN	5.536	3.371	16.017	0.694
13	purelin	tansig	RP	5.601	3.264	16.846	0.718
14	tansig	logsig	LM	5.304	3.463	17.012	0.677
15	logsig	softmax	SCG	5.642	2.699	15.094	0.635

AF: activation function; RMSE: root mean square error; MAD: mean absolute deviation; MAPE: mean absolute percentage error; R: correlation coefficient; SCG: scaled conjugate gradient backpropagation; LM: Levenberg-Marquardt; GDA: gradient descent with adaptive algorithm; RP: resilient backpropagation; OSS: one step secant; CGB: conjugate gradient with Powell/Beale restarts; VLRB: variable learning rate backpropagation; BQN: BFGS quasi-Newton.

Figure 7.

Observed and predicted test sample plot for textile waste.

Table 10.

Comparison of the optimal model and parameters for each output.

Output	Optimal topology	Optimal network parameters combination	RMSE	R²
Organic	4-10-1	LM, softmax in the hidden layer and logsig in the output layer	3.929	0.916
Paper	4-5-1	SCG, logsig in the hidden layer and tansig in the output layer	3.246	0.862
Plastic	4-14-1	RP, tansig in the hidden layer and softmax in the output layer	5.126	0.834
Textile	4-8-1	LM, tansig in the hidden layer and logsig in the output layer	2.115	0.826

RMSE: root mean square error; LM: Levenberg-Marquardt; SCG: scaled conjugate gradient backpropagation; RP: resilient backpropagation.

Discussion

Waste is collected from two different sources in the city of Johannesburg. DNC waste is collected daily from hotels, restaurants and food stores and the RCR waste is collected weekly from residential households. It was observed that the pattern of variation in the fraction of waste streams from DNC and RCR sources at different seasons vary slightly. This marginal difference can be generally attributed to the different consumption lifestyle of the waste generators from the RCR and DNC sources. The highest observed fraction of organic waste is about 40% which is from the RCR sources in the winter season, while the lowest observed fraction of organic waste was about 12% obtained from the DNC source in the summer season; predicted as 45.1% and 9.8%, respectively. It was observed that the variation in the fractions of organic waste from the DNC source across the seasons is greater than that of the RCR sources. Therefore we can conclude that seasonal variation has more effect on the organic waste fraction of DNC sources than the RCR sources. This is because DNC sources are generated directly from hotels, restaurants and food shops and are collected daily; apparently the daily consumption pattern which produces food waste, fruit and vegetable, and composite waste at those points varies significantly in different seasons.

Waste from the DNC produced the highest paper waste fraction of 27.8% in summer and was predicted as 29.7% while the lowest fraction of paper waste generated was 11.2% from the RCR source in winter and predicted as 10.4%. A wider variation is observed in the fractions of RCR paper waste fractions during the two seasons. It is therefore reasonable to conclude that the changes in climatic conditions in winter and summer influence paper waste streams from the RCR source more than the DNC source. The significant changes in the residential household consumption pattern which affects the quantity of paper packaging, tissue paper and other paper waste generated in different seasons could be attributed to this.

The highest fraction of plastic waste generated was 38.9% from the DNC source in summer and was predicted as 40.5% while the lowest fraction of plastic waste generated was 13.2% from the RCR source in summer and predicted as 15%. Although more plastic waste is generated in summer than in winter from both sources, it was observed that the seasonal variation influences the plastic waste from both DNC and RCR sources in the same manner.

Generally more textile waste is expected to be generated in winter. The highest fraction of textile waste generated was 15.5% from the RCR source in winter and was predicted as 13.8% while the lowest fraction of textile waste generated was 0% from the DNC source in summer and predicted as 2.9%. More textile waste will always be produced from RCR and in winter. This is because more clothing, head coverings and gloves are used for keeping warm in residential spaces, which consequently results in more textile waste. Therefore, the difference in the textile waste from the RCR source in winter and summer is wider than the difference from the DNC source, implying a stronger impact of the changes in weather parameters on RCR textile waste than the DNC textile waste in different seasons.

Conclusion

This study has presented a neural network model to predict the percentage composition of MSW in the city of Johannesburg. Influence of the choice of several network architectures, training algorithms and activation functions on the performance of the models that predict the variability of organic, paper, plastics and textile waste in the winter and summer seasons was evaluated. The best prediction outcome was obtained with a topology 4-10-1, 4-14-1, 4-5-1 and 4-8-1 for organic, paper, plastic and textile waste, respectively. R-values of the optimal network in each topology with the best combinations of AF and TA were 0.916, 0.862, 0.834 and 0.8616, respectively for organics, paper, plastics and textile at the testing phase. Generally, the LM, SCG and RP algorithm had the best performance as they produced at least one optimal model in each output. The result of the study verifies that waste composition prediction can be done in a single hidden layer. The variations in the waste streams for all seasons were generally attributed to the change in consumption patterns, lifestyle adjustment and change in the activities of an individual, household and the municipality at large. It was further revealed that the changes in seasonal weather conditions had more effect on the DNC organic waste than the RCR, while paper from RCR was more impacted by seasonal variation than that from DNC, plastics waste is impacted in the same manner for the DNC and RCR sources while textiles from RCR had a wider difference in percentage composition in both seasons than textiles from DNC.

Footnotes

Acknowledgements

The authors appreciate the management of the Department of Mechanical Engineering Science, University of Johannesburg, South Africa for providing workspace and research facilities for this research.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Oluwatobi Adeleke

References

Abbasi

El Hanandeh

(2016) Forecasting municipal solid waste generation using artificial intelligence modelling approaches. Waste Management 56: 13–22.

Abdallah

Abu Talib

Feroz

, et al. (2020) Artificial intelligence applications in solid waste management: A systematic research review. Waste Management 109: 231–246.

Abylkhani

Aiymbetov

Yagofarova

, et al. (2019) Seasonal characterisation of municipal solid waste from Astana city, Kazakhstan: Composition and thermal properties of combustible fraction. Waste Management and Research 37: 1271–1281.

Aslani

Taghipour

(2018) Seasonal characterization and quantification of municipal solid waste: Energy content and statistical analysis. Journal of Advanced Environmental Health Research 6: 34–43.

Ayeleru

Okonta

Ntuli

(2018) Municipal solid waste generation and characterization in the City of Johannesburg: A pathway for the implementation of zero waste. Waste Management 79: 87–97.

Bahrami

Akbari

Amin

(2019) Develop 24 dissimilar ANNs by suitable architectures & training algorithms via sensitivity analysis to better statistical presentation: Measure MSEs between targets & ANN for Fe–CuO/Eg–Water nanofluid. Physica A: Statistical Mechanics and its Applications 519: 159–168.

Bayar

Demir

Engin

(2009) Modeling leaching behavior of solidified wastes using back-propagation neural networks. Ecotoxicology and Environmental Safety 72: 843–850.

Bernstad

Jansen

Aspegren

(2012) Local strategies for efficient management of solid household waste: The full-scale Augustenborg experiment. Waste Management and Research 30: 200–212.

Boer

Jedrczak

Kowalski

, et al. (2010) A review of municipal solid waste composition and quantities in Poland. Waste Management 30: 369–377.

10.

Bwalya

(2019) The smart city of Johannesburg, South Africa. In: Anthopoulos

(ed.) Smart City Emergence. 1st edn. London: Elsevier.

11.

Çavu

(2019) Improving forecasting accuracy of time series data using a new ARIMA-ANN hybrid method and empirical mode decomposition. Neurocomputing 361: 151–163

12.

Chattopadhyay

(2018) Conjugate gradient descent learned ANN for Indian summer monsoon rainfall and efficiency assessment through Shannon-Fano coding. Journal of Atmospheric and Solar Terrestrial Physics 179: 202–205.

13.

Chen

(2009) Using neural networks and data mining techniques for the financial distress prediction model. Expert Systems with Applications 36: 4075–4086.

14.

Denafas

Ruzgas

Martuzevičius

, et al. (2014) Seasonal variation of municipal solid waste generation and composition in four East European cities. Resources, Conservation and Recycling 89: 22–30.

15.

Ebrahimpour

Noor

Raja

, et al. (2008) A modeling study by response surface methodology and artificial neural network on culture parameters optimization for thermostable lipase production from a newly isolated thermophilic Geobacillus sp. strain ARM. BMC Biotechnology 8: 96.

16.

Gallardo

Carlos

Colomer

(2018) Analysis of the waste selective collection at drop-off systems: Case study including the income level and the seasonal variation. Waste Management and Research 36: 30–38.

17.

Gidarakos

Havas

Ntzamilis

(2006) Municipal solid waste composition determination supporting the integrated solid waste management system in the island of Crete. Waste Management 26: 668–679.

18.

Gómez

Meneses

Ballinas

, et al. (2009) Seasonal characterization of municipal solid waste (MSW) in the city of Chihuahua, Mexico. Waste Management 29: 2018–2024.

19.

Hannan

Arebey

Begum

, et al. (2016) Content-based image retrieval system for solid waste bin level detection and performance evaluation. Waste Management 50: 10–19.

20.

Intharathirat

Abdul Salam

Kumar

, et al. (2015) Forecasting of municipal solid waste quantity in a developing country using multivariate grey models. Waste Management 39: 3–14.

21.

Islam

Hannan

Basri

, et al. (2014) Solid waste bin detection and classification using Dynamic Time Warping and MLP classifier. Waste Management 34: 281–290.

22.

Jadoon

Batool

Chaudhry

(2014) Assessment of factors affecting household solid waste generation and its composition in Gulberg Town, Lahore, Pakistan. Journal of Material Cycles and Waste Management 16: 73–81.

23.

Kamran

Chaudhry

Batool

(2015) Effects of socio-economic status and seasonal variation on municipal solid waste composition: A baseline study for future planning and development. Environmental Sciences Europe 27: 16.

24.

Karaca

Özkaya

(2006) NN-LEAP: A neural network-based model for controlling leachate flow-rate in a municipal solid waste landfill site. Environmental Modelling and Software 21: 1190–1197.

25.

Mbuli

(2015) Alternative waste treatment technology project Ingwenyama Resort, Mpumulanga Province. A waste report of the city of Johannesburg waste management. City of Johannesburg.

26.

Noori

Karbassi

Mehdizadeh

, et al. (2011) A framework development for predicting the longitudinal dispersion coefficient in natural streams using an artificial neural network. Environmental Progress and Sustainable Energy 30: 439–449.

27.

Noori

Karbassi

Sabahi

(2010a) Evaluation of PCA and Gamma test techniques on ANN operation for weekly solid waste prediction. Journal of Environmental Management 91: 767–771.

28.

Noori

Khakpour

Omidvar

, et al. (2010b) Comparison of ANN and principal component analysis-multivariate linear regression models for predicting the river flow based on developed discrepancy ratio statistic. Expert Systems with Applications 37: 5856–5862.

29.

Olatunji

Akinlabi

Madushele

, et al. (2019) Estimation of municipal solid waste (MSW) combustion enthalpy for energy recovery. EAI Endorsed Transactions on Energy Web 19: 1–9.

30.

Oliveira

Sousa

Dias-Ferreira

(2019) Artificial neural network modelling of the amount of separately-collected household packaging waste. Journal of Cleaner Production 210: 401–409.

31.

Ozkaya

Demir

Bilgili

(2007) Neural network prediction model for the methane fraction in biogas from field-scale landfill bioreactors. Environmental Modelling and Software 22: 815–822.

32.

Ozveren

(2016) An artificial intelligence approach to predict a lower heating value of municipal solid waste. Energy Sources, Part A: Recovery, Utilization and Environmental Effects 38: 2906–2913.

33.

Panapakidis

Dagoumas

(2016) Day-ahead electricity price forecasting via the application of artificial neural network based models. Applied Energy 172: 132–151.

34.

Pathak

Mainali

Abo

, et al. (2020) Quantification and characterization of the municipal solid waste for sustainable waste management in newly formed municipalities of Nepal. Waste Management and Research 38: 1007–1018.

35.

Qdais

Hani

Shatnawi

(2010) Modeling and optimization of biogas production from a waste digester using artificial neural network and genetic algorithm. Resources, Conservation and Recycling 54: 359–363.

36.

Roohi

Jafari

Jahantab

, et al. (2020) Application of artificial neural network model for the identification the effect of municipal waste compost and biochar on phytoremediation of contaminated soils. Journal of Geochemical Exploration 208: 106399.

37.

Sibiya

Olukunle

Okonkwo

(2017) Seasonal variations and the influence of geomembrane liners on the levels of PBDEs in landfill leachates, sediment and groundwater in Gauteng Province, South Africa. Emerging Contaminants 3: 76–84.

38.

Singh

Satija

(2018) Prediction of municipal solid waste generation for optimum planning and management with artificial neural network—case study: Faridabad City in Haryana State (India). International Journal of Systems Assurance Engineering and Management 9: 91–97.

39.

Slimani

Sbiti

, et al. (2019) Traffic forecasting in Morocco using artificial neural networks. Procedia Computer Science 151: 471–476.

40.

Solano Meza

Orjuela Yepes

Rodrigo-Ilarri

, et al. (2019) Predictive analysis of urban waste generation for the city of Bogotá, Colombia, through the implementation of decision trees-based machine learning, support vector machines and artificial neural networks. Heliyon 5: e02810.

41.

Vrancken

Longhurst

Wagland

(2019) Deep learning in material recovery: Development of method to create training database. Expert Systems with Applications 125: 268–280.

42.

Bolingbroke

KTW

, et al. (2019) Assessment of waste characteristics and their impact on GIS vehicle collection route optimization using ANN waste forecasts. Waste Management 88: 118–130.

43.

Wang

Liu

Song

, et al. (2016) A novel model: Dynamic choice artificial neural network (DCANN) for an electricity price forecasting system. Applied Soft Computing Journal 48: 281–297.

44.

Yaghini

Khoshraftar

Fallahi

(2013) A hybrid algorithm for artificial neural network training. Engineering Applications of Artificial Intelligence 26: 293–301.

45.

Zeng

Trauth

Peyton

, et al. (2005) Characterization of solid waste disposed at Columbia Sanitary Landfill in Missouri. Waste Management and Research 23: 62–71.

Application of artificial neural networks for predicting the physical composition of municipal solid waste: An assessment of the impact of seasonal variation

Abstract

Keywords

Introduction

Materials and method

Data set

Study area

Artificial neural network

Building the optimal neural network

Evaluating the model performance

Results and discussion

Performance evaluation result

Organic

Paper

Plastic

Textile

Discussion

Conclusion

Footnotes

Acknowledgements

Declaration of conflicting interests

Funding

ORCID iD

References