Abstract
The electrical conductivity of mould flux with chemical constitution of CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO has been investigated. The assessed database contains one unitary, five binary, nine ternary, four quaternary, two quinary, two senary and one octonary subsystems. Each constitutional component is in connection to another via some direct or indirect links. A multilayer artificial neural network method was developed and implemented in the database. The work provides a method to calculate the relationships between the composition, temperature and electrical property of the mould flux within the defined parameter ranges. The results have been validated against those experimental data that are not included in the training of the neural networks.
Introduction
Many steelmaking companies are using a mould flux with chemical constitution of CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO to cast stainless steels in continuous casting mould. The mould slag contains high concentration of fluorine to reduce viscosity and solidification temperature [1]. Cr2O3-FeO-MnO can either pre-exist in mould powder or enter to liquid slag from oxidation of alloying elements in liquid stainless steel at casting mould and tundish [2]. The electrical conductivity of this system has never been assessed systematically although the data for its subsystem are notably rich.
The primary driving force to study the electrical properties of mould powder is for electroslag remelting processing [3,4]. This has, therefore, attracted considerable experimental measurement activities [3-6], theoretical modelling [7,8] and data assessment [9] for several subsystems. The recent environment regulation on carbon neutral steelmaking promotes the electrification of continuous casting. Electric field affects materials segregation [10], viscosity [11], distribution of oxide inclusions [12] and surface roughness in the cast mould [13]. This demands an analytical expression to represent the relationship between the chemical constitution, temperature and electrical conductivity of the system. The aim of the present work was to provide a mean to calculate the constitution and temperature-dependent electrical conductivity of the mould flux materials.
This work uses artificial neural network to approach the target. The method relies on available data to extrapolate values in the unknown parameters’ range [14]. Ideally, a relationship between the electrical conductivity, constitution and temperature should be derived from micro-mechanisms [15]. The previous theoretical modelling for the electrical conductivity of mould slag has been based largely on an assumption that electrical conduction is carried out by the moving ionised atoms. Obviously, viscosity affects the mobility of ionised atoms and hence plays an important role in electrical conduction. The basicity of mould slag affects the amount and length of silicate chains. This lays a foundation for the optical basicity model to calculate the electrical conductivity of mould slag [7]. On the other side, the interactions between constitutional components influence the mobility of atoms. This forms a basis to use particle interaction to calculate electrical conductivity [8]. For the oxides which can conduct electricity by electronic means, such as FeO, the above-mentioned theoretical models stray away [2]. The data-based artificial neural network has proved to be an effective solution to provide alternative solution [9].
The artificial neural network is different from data fitting such as the least square method. The former prevents overfitting but the latter seeks best fitting to data. The recent development of artificial intelligent learning enables the method to reduce its dimension according to the conservative laws that are hidden in the data [16,17]. Artificial neural network has potential to indicate some physics natures buried in the big data. This work intends to provide a method to calculate the constitution and temperature-dependent electrical conductivity of the mould flux.
Artificial neural network
A supervised multilayer artificial neural network with back propagation learning algorithm has been coded for the present purpose. The network has an input layer containing 11 units to record the constitutional compositions and temperature. The output layer has 1 unit to provide the computational result for electrical conductivity. There are T-1 hidden layers each containing numerous units. The architecture of 2-layer network is illustrated schematically in Figure 1, where the hidden layer has Schematic diagram illustrates 2-layer neural network.




The mapping function for >2 layers neural network can be manipulated in the same way. A code package to calculate up to 4-layer neural network has been developed by the author.
is the weight factor between
is the bias for
is the activation of
is its activation function.
is the activation of output layer. The electrical conductivity
is the activation function of
.
The activation functions in Equations (1.2) and (1.4) are nonlinear and with output values between 0 and 1. Many other neural network models choose other functions such as
[14], which has value between −1 and 1, or Gaussian distribution [16]. It is important to normalise the data according to the format of activation functions. In the present model, all the electrical conductivity data are normalised to a value between 0 and 1. The numerical results are denormalised to get the true value.
Following the standard method in neural network calculation [18], the total difference is defined as

and
are the
. The overall target function is defined as [18]


is the learning rate. Equation (4) defines one of the learning methods that always works but unnecessarily the most effective method. In the following section, other learning methods will be used and compared with Equation (4). The analytical expression for each weight factor and bias can be obtained by back propagation of the partial difference of target function according to Equation (4).
Data assessment
The subsystems in the electrical conductivity database.
The range of parameters’ values in the database.
Neural network computation and results
The activation functions, as illustrated in Equations (1.2) and (1.4), have an output value between 0 and 1 for the activation between -∞ and +∞. However,
at
and
at
, which indicate an extremely slow approximation to either 0 or 1. Based on this consideration, the training data for electrical conductivity is not normalised by the true minimum and maximum values in the database but multiplied by 0.7 to the minimum value and 1.3 to the maximum value. The input parameters for composition and temperature are all normalised to a value between 0 and 1 to ensure every input parameter has the same weight of contribution. The denormalisation and normalisation procedure followed the following equations


and
are the normalised electrical conductivity and composition for
and temperature when
. The initial values for all the weight factors and biases are assigned to a random float value between −5 and 5. A high-quality random number generator was coded according to a probability theory developed by Marsaglia et al. [25].
In neural network calculation, it has been noted that the artificial learning by means of Equation (4) in every time iteration does not help to find the weight factors and biases to achieve minimum total differences between the target values and calculated value. The weight factors soon adjust their values to minimise the overall target function ( The evolution of total difference (
) rather than the total difference (
). To overcome this problem, the regularisation term (
) is not included in each time iteration but replaced at the final step assessment. It is also found that the convergence rate is almost doubled by the following artificial learning mechanism, which agrees with the suggestion from Rumelhart et al. [19]

and
, the evolution of total difference, regularisation term and target function vs iteration steps are demonstrated in Figure 2. It shows that that total difference drops sharply in the early stage (labelled by A), followed by slow drops (labelled by B) until a flat stage (labelled by C) to fluctuate around a minimum value. However, the regularisation term was increased slowly but monotonically until stage C. This is due to the early mentioned decision of not to include the minimisation of regularisation term in the time iteration. The target function has been reduced monotonically until the flat stage.

), regularisation term (
) and target function (
) vs. iteration steps for 2-layer neural network with hidden layer containing 16 units.
To determine the optimum number of units in the hidden layer in 2-layer neural network calculation, one has calculated the change of differences until 1.6 × 107 iteration steps for various number of units. The results are plotted in Figure 3. Although The evolution of total difference ( The optimised weight factors and biases obtained by artificial neural network calculations.
decreases when the number of units increases,
demonstrates some optimised values.
increases sharply when the number of units is away from the optimised one. The target function,
, reveals an optimised value for the number of units in the hidden layer of the 2-layer neural network. Based on the results,
is chosen. It is worth mentioning that the local minimum at
for the curve of
is out of expectation. To double check whether it is a numerical coincidence, the code and parameters were run at three different workstations but the results were very similar, given the fact that the initialisation of weight factors involves a random number generator which should be different at different computers. The smallest
appeared at 9,012,000th iteration step. The values for the weight factors and biases at this optimised condition are listed in Table 3. These values can be used to calculate the electrical conductivity of the system at any composition and temperature in the parameters’ range.

), regulation term (
) and target function (
) vs. number of units in hidden layer after 1.6 × 107 iteration steps.
The optimised weight factors and bias values have been implemented to calculate the electrical conductivity for 752 sets of compositions and temperature. The results are plotted in Figure 4 and compared with the values in the database. Figure 4(a) shows the comparison at linear scale coordinates. The 45° line indicates the perfect agreement between the artificial neural network computational results and the value in database, where majority data are from experimental measurement and the rest from assessment based on experimental values. The figure shows good agreement. Owing to the wide range distribution of the electrical conductivity values from 0.016 to 23.771, which across three orders of magnitude, the comparison in logarithmic scale is shown in Figure 4(b). The data shows some almost evenly distribution around the 45° line, majority with absolute error below 5%. The largest absolute discrepancy appears in the lowest electrical conductivity end, as is circled in Figure 4(b). Those data are found all belong to CaO-SiO2-Al2O3 subsystem at a temperature either in 1623 K or 1673 K, and was reported in one paper.
The electrical conductivity from numerical results by artificial neural network calculation vs. the value in the database: (a) in linear scale plotting, and (b) in logarithmic scale plotting.
To validate the artificial neural network calculations, the optimised weight factors and bias values have been implemented to calculate two binary systems CaF2-Al2O3 and Cao-CaF2 at different temperature. The results have been compared with the experimental results reported in various literature [6,26,27]. Figure 5 presents the results for (a) CaF2-Al2O3 at 1773K, (b) CaF2-Al2O3 at 1973K, (c) Cao-CaF2 at 1773K and (d) Cao-CaF2 at 1873K. These experimental data are not in database during the training of neural network. Figure 5 shows that the electrical conductivity obtained in the neural network calculations are within the fluctuation of various experimental measurements. It proves that the artificial neural network prediction for the electrical conductivity can be used to predict the change of electrical conductivity at various subsystems in different compositions and temperatures.

The artificial neural network and machine learning have many potential applications in steel metallurgy [28]. In the future, more works will be done to include other components to the system, such as NiO, TiO2, MgF2, BaF2, BaO, ZrO, CaS. The availability of the new experimental measurement method for electrical conductivity enables to get more accurate data in other systems [29], which will help to build up database for training and validation of the neural networks. The future work can, hopefully, also include the effort to use the data and machine learning method to identify the main oxides that control the electrical conductivity of mould flux and the influence of temperature on the electrical properties, and to compare the results with the theoretical predictions [7,8,15].
Conclusions
An electrical conductivity database for CaO-SiO2-Al2O3-NaO-K2O-MgO-CaF2-Cr2O3-FeO-MnO liquid mould slag system has been built up. The database has been implemented to train an artificial neural network. It is found that the two-layer neural network with 16 units in hidden layer provides the minimum difference in target function. The optimised weight factors and bias values can be used to calculate the electrical conductivity of the system in a wide range of compositions and temperature. The numerical prediction has been validated by the experimental results reported in literature. Excellent performance of artificial neural network derivation has been proved.
Footnotes
Acknowledgements
This work was financially supported by the European Commission RFCS (No. 847269) and Engineering and Physical Sciences Research Council in UK (No. EP/R029598/1).
Disclosure statement
No potential conflict of interest was reported by the author(s).
