IDEAS home Printed from https://ideas.repec.org/a/igg/jthi00/v12y2016i2p83-95.html
   My bibliography  Save this article

A Message Topic Model for Multi-Grain SMS Spam Filtering

Author

Listed:
  • Jialin Ma

    (Huaiyin Institute of Technology, Huaian and College of Computer and Information, Hohai University, Nanjing, China)

  • Yongjun Zhang

    (Huaiyin Institute of Technology, Huaian and College of Computer and Information, Hohai University, Nanjing, China)

  • Zhijian Wang

    (College of Computer and Information, Hohai University, Nanjing, China)

  • Kun Yu

    (Huaiyin Institute of Technology, Huaian, China)

Abstract

At present, content-based methods are regard as the more effective in the task of Short Message Service (SMS) spam filtering. However, they usually use traditional text classification technologies, which are more suitable to deal with normal long texts; therefore, it often faces some serious challenges, such as the sparse data problem and noise data in the SMS message. In addition, the existing SMS spam filtering methods usually consider the SMS spam task as a binary-class problem, which could not provide for different categories for multi-grain SMS spam filtering. In this paper, the authors propose a message topic model (MTM) for multi-grain SMS spam filtering. The MTM derives from the famous probability topic model, and is improved in this paper to make it more suitable for SMS spam filtering. Finally, the authors compare the MTM with the SVM and the standard LDA on the public SMS spam corpus. The experimental results show that the MTM is more effective for the task of SMS spam filtering.

Suggested Citation

  • Jialin Ma & Yongjun Zhang & Zhijian Wang & Kun Yu, 2016. "A Message Topic Model for Multi-Grain SMS Spam Filtering," International Journal of Technology and Human Interaction (IJTHI), IGI Global, vol. 12(2), pages 83-95, April.
  • Handle: RePEc:igg:jthi00:v:12:y:2016:i:2:p:83-95
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJTHI.2016040107
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jthi00:v:12:y:2016:i:2:p:83-95. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.