Informasi Umum

Kode

17.05.121

Klasifikasi

C -

Jenis

Karya Ilmiah - Thesis (S2) - Reference

Subjek

Text Mining

Dilihat

163 kali

Informasi Lainnya

Abstraksi

Semantic Argument Classification is the process of analyzing the sentence to investigate the pattern of WHO did WHAT to WHOM, WHEN, WHERE, WHY, HOW, from a structure text. Research on the classification of semantic arguments requires semantically labeled data in large numbers, called corpus. Because building a corpus is costly and time-consuming, recently many studies have used existing corpus as the training data to conduct semantic argument classification research on new domains without the need to build a new corpus for those new domains. This study carries on semantic argument classification on a new domain that is Quran English Translation by utilizing Propbank corpus as the training data. Previous studies have proven that there is a significant decrease in performance when classifying semantic arguments on different domain between the training and the testing data. The main problem is when there is a new argument that found in the testing data but it is not found in the training data. To recognize the new argument in the training data, extending the argument features in the training data to accommodate the new features of the new argument becomes one of the solutions. By using SVM Linear, the experiment has proven that augmenting the proposed features to the baseline system with some combinations option improve the performance of semantic argument classification on Quran data using Propbank Corpus as training data. When tested on auto labeled data, the augmentation of PTO+SP features to the baseline system improve the accuracy by 1.25% and F-1 score by 1.30%. When tested on hand-labeled data, the augmentation of combination PO+PTO features to the baseline system improve the accuracy by 0.47% and F-1 score by 0.40%.

  • CSG5G3 - DATA MINING LANJUT
  • CSG5H3 - TOPIK KHUSUS DALAM PENAMBANGAN DATA B
  • CSG613 - TESIS
  • CSH623 - TESIS
  • IEH6B6 - TESIS
  • CII733 - TESIS
  • IMI2B6 - TESIS
  • CII733 - TESIS
  • TTI7Z4 - TESIS
  • CII9H5 - PENELITIAN DISERTASI DAN SEMINAR 1
  • CII9J5 - PENELITIAN DISERTASI DAN SEMINAR 2
  • CII9L5 - PENELITIAN DISERTASI DAN SEMINAR 3
  • CII9I1 - PENULISAN PUBLIKASI ILMIAH 1
  • CII9K2 - PENULISAN PUBLIKASI ILMIAH 2
  • CII9M3 - PENULISAN PUBLIKASI ILMIAH 3

Koleksi & Sirkulasi

Tersedia 1 dari total 1 Koleksi

Anda harus log in untuk mengakses flippingbook

Pengarang

Nama DINA KHAIRA BATUBARA
Jenis Perorangan
Penyunting Arif Bijaksana, Adiwijaya
Penerjemah

Penerbit

Nama Universitas Telkom, S2 Magister Teknik Informatika
Kota Bandung
Tahun 2017

Sirkulasi

Harga sewa IDR 0,00
Denda harian IDR 0,00
Jenis Non-Sirkulasi