Development of A Predictive Model For Minimizing Ladle Desulfurization Cycle Time and Associated Costs
Development of A Predictive Model For Minimizing Ladle Desulfurization Cycle Time and Associated Costs
Development of a Predictive Model for Minimizing Ladle Desulfurization Cycle Time and
Associated Costs
Keywords: Desulphurization, Online Process Control, Ladle Resource Optimization , Ridge Regression, Neural Network
INTRODUCTION
High Sulphur content is detrimental to the mechanical properties of steel. Though the degree of undesirability varies over
different grades, the general attempt is to minimize it and avoid the phenomenon of ‘hot shortness’. This is caused by the
formation of the eutectic Iron Sulphide (FeS). Segregating to the iron grain boundaries, Iron Sulphide decreases the inter-
granular strength. As a consequence, by raising the stress levels, it leads to processing related issues such as the formation of
cracks during solidification.
Desulphurization or sulphur removal is carried out at multiple stages in the steelmaking process, as depicted in Figure 1. The
primary source of Sulphur in the refining procedure is coking coal. A significant percentage (~75%) of the initial Sulphur is
removed by moving to the slag phase during reduction carried out in the Blast Furnace. Following the Blast Furnace
treatment, an exclusive hot metal desulphurization step is executed in the torpedo. Here, the reagents lime and/or calcium
carbide are injected via a lance with nitrogen as a carrier gas[2]. This aids in dropping the Sulphur percentage of the bath, to
approximately the desired level for several high-Sulphur tolerant grades but not all.
Figure 1. Sources and sinks of sulphur across the BF-BOF steelmaking route at Tata Steel Ijmuiden in 2015. [2]
Figure 2. Ladle furnace station showing gas stirred ladle supplied with electric arc heating and power supply systems (right)
and alloying addition systems (left) [12]
Ladle operators across the spectrum require some method to track the progress of a heat and take decisions accordingly.
Currently, they rely on heuristics and sparse sampling procedures to assess the steel bath. The objective of this paper is to put
forward a data-driven predictive model and help operators monitor the heat. The existing practice relies on sampling the heat
and determining the composition via chemical analysis. This method is both time-consuming and not precisely on-line. It
indicates the bath state at the time of sampling, which lags behind analysis by around 2- 4 minutes. The model suggested
helps to bridge this time lag and gain a reasonable understanding of the current bath state. Other compelling reasons for
deploying this model in steel plants include:
METHOD
where ( ) indicates species in the slag phase, and [ ] for species in the steel bath. In practice, this oxide ion is provided by the
Calcium Oxide from the slag. Sulphur dissolved in the bath displaces oxygen in lime during slag-metal emulsification driven
by argon purging. Hence, sulphur is absorbed by the slag as it forms Calcium Sulphide (CaS). From the Eq. (1) and (2), the
thermodynamics of the reaction is influenced by both the slag basicity and the bath’s oxygen potential. While higher basicity
(not too high either as it would affect slag fluidity) provides a higher driving force, a higher oxygen potential has a tendency
to re-sulphurize the bath. Therefore the following derived quantities were considered for determining their cumulative
influence on the desulphurisation process.
Basicity
where Λ is the optical basicity of the slag and Λi is the optical basicity of individual oxides and
Through Eq. (8), Bulko et al.[13] showed how, statistically, the sulphur distribution ration (Ls) decreases with rise in the
weight percentage of MnO, which signifies an increase in the oxygen potential of the bath. Bearing this in mind, we took into
consideration the weight of these oxides in the slag for including the role played by oxygen potential.
Rate Constant
From the past work, it is well known that the kinetics of the process is largely governed by the fluid flow phenomenon.
Argon stirring of the bath greatly increases the slag-metal interfacial area through slag entertainment, leading to enhanced
kinetics[14]. Accordingly in Figure 3, it is seen that the rate parameter (mass transfer capacity coefficient) is mostly a function
of stirring power (due to gas injection), despite the scatter in the plot, which can be attributed to the variations in steel grade
and mass of carry over slag.
Figure 3. Relationship between mass transfer capacity coefficient and effective stirring power [14]
Model Development
The predictive model developed is based on the 41 heat dataset from PhD thesis of Graham[14]. Further, Graham had
developed a rigorous kinetic model based on first principles to predict the trajectory of bath and slag components. He
computed those based on the argon purging profile, flux additions and the electrical energy provided via arcing. Here,
however, to further enhance the predictability and make it robust for multiple grades, a data-driven approach has been
adopted. The 41 heat data set was split into a training set comprising 32 heats and a test set comprising eight heats after
eliminating an outlier (based on the initial and final bath composition). This data was recorded at the Steel Melt Shop of
ArcelorMittal, Dofasco. Furthermore, the dataset is not unique to a particular grade of steel instead; it accommodates multiple
grades. An exploratory data analysis of the initial and final bath composition reveals the same.
Figure 5. Violin plots of the initial and final levels of aluminium and silicon in the steel bath
Figure 6. Violin plots of the initial levels of CaO, Al2O3, SiO2, MgO, MnO and FeO in slag.
Figure 7. Argon purging, flux addition and electrical heating energy profile [14]
Figure 8. (a) Element-wise composition details of the liquid steel bath (markers depict the sampling results throughout the
heat. (b) Composition of individual components of the slag[14]
Model Flow
As mentioned earlier an arbitrary set of eight heats was shortlisted and isolated from the training heats for testing the model's
performance. From the thirty-two training heats, six (~20%) were utilized for cross-validation. The sampling results of both
steel and slag phase were then piece-wise linearly interpolated for the 32 training heats giving us trajectories for training the
model. Following this interpolation procedure, each training set heat was sub-divided into one minute time steps giving
approximately 40 steps per heat i.e. around 1200 steps in total. Derived quantities listed above were computed (for all the
time steps) and appended to the set of predictors. Each predictor variable from the training set was then normalised (mean-
centred & standard deviation scaled) to address the distortion in the predictors’ vector space. Later however, normalisation
for the test heats, was carried out using the metrics (mean and standard deviation) generated from the training set.
The model was then trained based on two machine learning algorithms discussed in the 'Algorithm Selection' Section.
Dynamic prediction of the bath’s sulphur content necessitates an assessment of both the current time step and the previous
one for gauging the extent of desulphurization reaction. Marching with time, after training the model with 5-fold cross-
validation, the sulphur trajectory for test heats was predicted and augmented with every time step.
Beginning with the initial composition (bath and slag) for each test set heat, the model first computed the change in sulphur
level for the first minute. This value was used to calculate the current step’s final sulphur level and the next step’s initial
sulphur level, both of which are the same. Similarly, for the step that followed, change in sulphur was computed with the
initial sulphur value obtained from the step that preceded it. In this fashion, as the heat progressed, the model gave its
approximation of the path traced for determining the steel bath's sulphur level.
Algorithm Selection
Two different methods were used to predict the sulphur trajectory of the test heats: 1. Ridge Regression (Multivariate Linear
Regression with Regularization) and 2. Neural Networks.
Ridge Regression
Essentially, ridge regression is a multivariate-linear regression technique that is coupled with l1 regularization[17]. It penalizes
coefficients of predictor variables thereby assigning them weights based on their influence on the optimization process.
(9)
(10)
where, wi represents the coefficients of each of the features of the multivariate linear model, xij the value of the jth predictor
variable of the ith data point, yi the real value of the target,𝑦^ the predicted value of the target and λ, the regularization
parameter . So, while Eq. (9) gives the cost function for a simple linear model, Eq. (10) shows the method for computing the
cost function for ridge regression. The minimization of this cost function gives us an optimized set of coefficients for
regression.
Neural Networks
Based on the architecture of neural connections in our brains, neural networks are believed to be a simulation of the way
humans learn from experience. They are capable of capturing non-linearity in better ways than Ridge regression.
Ridge Regression
A grid of values of the regularization parameter lambda (λ) was created to ascertain the optimum value which gave minimum
average cross-validation scores. Figure 12(a) shows the variation in the CV scores with lambda (λ), described in Eq. (10).
On the other hand, Figure 12 (b) is a scatter plot of the predicted and real Sulphur differences. The sulphur differences plotted
are over all the one minute time steps of the test heats. The predicted Sulphur trajectories for some test heats are shown in
Figure 13.
Figure 12. (a) Variation of Cross-Validation Score with the regularization parameter. (b) Scatter plot of predicted and real
Sulphur differences of all 1min time steps from the test heats.
Figure 12(b) show the degree of accuracy of the model’s predictions on the test set. Best results were obtained with a value of
10-2(0.01) for the regularization parameter λ. Among the eights test heats chosen, predictions for the three depicted in Figure
13 gave the best results. Despite being reasonable in its approximations (average 5-fold cross-validation score of 0.66 for
sulphur differences over the test heats’ time steps), the trajectories computed by the model have certain shortcomings: 1.
They tend to show deviation (a higher gradient than expected) towards the end of the process and 2. The Sulphur differences
show a lag before sudden change in gradient. These errors can be attributed partially to the nature of the dataset. The forty
heats analysed are not only a few in number but also comprise a variety of steel grades. Information regarding the grade
Figure 13.Sulphur difference(1) over each time step and sulphur trajectory (predicted v/s real)(2) for heat numbers 23, 4 and
37 marked as a, b and c respectively as predicted by Ridge Regression
Neural Networks
As Table 2 and Figure 15 depict, an average 5-fold cross-validation score of 0.675 was obtained for sulphur differences over
the test heats’ time steps. This is marginally better than what we observed for Ridge Regression. Contrary to the conclusions
from the R2 values for the two models, we see that the trajectories are better than Ridge Regression. This should not misguide
us for the overall performance of the neural networks because the Sulphur curves shown here are for three best heats from an
arbitrary test set. They do not reflect the model’s accuracy over all 5 arbitrary test sets based on which the R2 value has been
computed.
Model R2_1 R2_2 R2_3 R2_4 R2_5 Avg CV Score R2 for Test set
Figure 14. Sulphur difference(1) over each time step and sulphur trajectory (predicted v/s real)(2) for heat numbers 23, 4 and
37 marked as a, b and c respectively as predicted by Neural Networks
Investigating the sulphur trajectories in Figure 14 further reveals that the model suffers from the same issues as the previous
learning technique. The predictions are more chaotic and higher than expected towards the end of a heat. There are two things
that could possibly be causing this. One reason here could be the fact that as a heat progresses, the driving force for
desulphurization reaction drops because of lower sulphur content in the bath. Since the change in sulphur values is modest,
the author's believe that it tends to the order of magnitude of the noise that this process is characterized by. As a consequence,
it gives fluctuating and relatively higher values. Another issue observed during exploratory data analysis was that some heats
Figure 15. Scatter plot of predicted and real Sulphur differences of all 1min time steps from the test heats.
Other than that, the neural networks were reasonably successful in predicting the sulphur differences over the time steps of a
heat despite their magnitudes being of the order of 10-4. Based on such qualified predictions, the model performed well in
capturing the shape of the sulphur curves. Here again, we believe that a larger dataset with additional features (flux addition
weights, quality of flux, grade of steel, etc.) can enhance the predictability of the model.
Economic Assessment
Integration of this online predictive model with the ladle operations system can help monitor and thus control the evolution of
a heat more reliably. For the operators, this model would assist them in ensuring the achievement of quality targets (based on
stringent composition demands) under minimum energy, material and production costs. Not only would the ladle run at
maximum productivity but also have indirect advantages in the logistics of the plant. Most importantly, the plant would not
have to bear the economic burden of downgrading due to a violation of the Sulphur target limits.
Based on the best results from the model, cost-benefit analyses were performed for a 4.5 mtpy steel melt shop having 3 x
150t BOF and 2 x 150t ladle furnaces. As per the current scenario, the absence of an online monitoring system often results in
higher re-processing time, frequent caster sequence miss, additional flux material, increased argon consumption, increased
electricity consumption(arcing), electrode wear due to longer use and higher percentages of off-grades. Considering the
following assumptions of: (a) 0.2% yield improvement in caster with higher sequence (b) average profit margin of 200
US$/ton of additional slab and (c) reduction of processing time by 2-3 minutes with an average of 40 minutes d)
improvement in average specific consumption for energy, argon, lime, electrode and refractory, the potential cost savings can
range from 5-7 million US$ per annum.
CONCLUSIONS
A dynamic predictive model has been developed for online monitoring of the desulphurisation process in a ladle. Machine
learning techniques, Ridge Regression and Neural Networks, were used for analysing a dataset from ArcelorMittal,
Dofasco[14]. For ensuring robustness of approach, the idea was to come up with a tool which can be utilised by operators for
exercising better control over the steel bath composition. The model, therefore, aims at eliminating intermediate sampling of
the bath for probing the progress of a heat. The model is found to give reasonably accurate predictions for sulphur differences
over one minute time steps despite their order of magnitude being as low as 10-4. Based on the precise computation of sulphur
differences, the model was successful in capturing the shape of sulphur trajectories.
In addition, upon integration with real-time operations, the suggested model holds tremendous potential to optimize ladle
heats by reducing the processing time, product off-grades and consumption of flux, argon and electricity. Economic
assessment showed that cumulatively these factors can contribute to significant savings on the production front.