Comparitive Analysis of Gradient Boosting and Transformer Based Models for Binary Classification in Tabular Data
Comparitive Analysis of Gradient Boosting and Transformer Based Models for Binary Classification in Tabular Data
Jebaraj Vasudevan1
1
Visa Inc., Atlanta, GA, USA
Abstract: This study compares the classification performance of the Gradient Boosting (XGBoost), and Transformer based
model with multi-head self-attention for Tabular Data. While the methods exhibit broadly similar performance, the
Transformer model particularly excels in Recall by about 8% showing that it would be better suited to applications such as
Fraud Detection in Payment processing and Medical Diagnostics.
How to Cite: Jebaraj Vasudevan (2025). Comparitive Analysis of Gradient Boosting and Transformer Based Models for Binary
Classification in Tabular Data. International Journal of Innovative Science and Research Technology,
10(3), 466-470. https://doi.org/10.38124/ijisrt/25mar416
As evident from the table shown above, the models systems, overlooking a fraudulent transaction could result in
have very similar overall performance similar to what [2] had substantial financial loss, making it preferable to flag more
also noticed in their results. But what we also see here is that transactions for review even if some are false alarms. In these
the Transformer model outperforms the Boosting method in circumstances, the Transformer based model can be preferred
Recalling the positive examples by about 8%. So, in over the Gradient Boosting XG Boost.
scenarios, when the cost of missing a true positive far
outweighs the inconvenience or cost of incorrectly flagging a
negative instance as positive. For instance, in medical
diagnostics—such as screening for cancer or infectious
diseases—failing to identify a diseased patient (a false
negative) can have severe or even fatal consequences,
whereas a false positive might lead to further testing that,
while potentially anxiety-inducing and costly, is
comparatively less harmful. Similarly, in fraud detection
V. CONCLUSTION
REFERENCES