Author
Listed:
- Stephanie Ng
(Deakin University)
- James Zhang
(Deakin University)
- Samson Yu
(Deakin University)
- Asim Bhatti
(Deakin University)
- Kathryn Backholer
(Deakin University)
- C. P. Lim
(Deakin University)
Abstract
Hansard, or the official verbatim transcripts of parliamentary debates, contains rich information for analysing discourse and political activities on a wide range of policy issues. A fundamental task in political text analysis is to predict whether a speaker takes on a positive or negative view about a debate topic. Unlike social media data, which has received extensive attention for political text mining, stance analysis on Hansard data remains understudied. The main distinctions between the two include longer text and context dependency related to a motion in the Hansard data. As a result, it is difficult to devise a text mining model for parliamentary debates based on existing studies of other applications. This raises the question of the generalisability of prominent methods for cross-domain classification under low-resourced data situations. To address this issue, we construct and compare various state-of-the-art natural language processing techniques and machine learning models for stance classification, using two benchmark datasets from the UK Hansard. To improve the model accuracy, a hybrid approach is designed, which leverages both text and numerical features in the classification process. The devised method achieves 15–20% improvement in accuracy compared to the baseline methods. Transfer learning of pre-trained language models is further investigated for political text representation and domain adaptation in a new stance classification task: Australian Hansard with debates focusing on the public health issue of obesity and related junk food marketing policies. Then, a feature augmentation technique is employed to optimise the learning model from the source domain for prediction on unseen test data in the target domain. This approach results in approximately 10% improvement in accuracy compared to those from the baseline methods. Finally, an error analysis is conducted to gain further insights into the devised model, which reveals the characteristics of commonly misclassified samples and suggestions for future work.
Suggested Citation
Stephanie Ng & James Zhang & Samson Yu & Asim Bhatti & Kathryn Backholer & C. P. Lim, 2025.
"Stance classification: a comparative study and use case on Australian parliamentary debates,"
Journal of Computational Social Science, Springer, vol. 8(2), pages 1-37, May.
Handle:
RePEc:spr:jcsosc:v:8:y:2025:i:2:d:10.1007_s42001-025-00366-y
DOI: 10.1007/s42001-025-00366-y
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:8:y:2025:i:2:d:10.1007_s42001-025-00366-y. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.