Authors
Maike Erdmann, Duc Dung Nguyen, Tomoya Takeyoshi, Gen Hattori, Kazunori Matsumoto, Chihiro Ono
Publication date
2012
Conference
PRICAI 2012: Trends in Artificial Intelligence: 12th Pacific Rim International Conference on Artificial Intelligence, Kuching, Malaysia, September 3-7, 2012. Proceedings 12
Pages
27-39
Publisher
Springer Berlin Heidelberg
Description
The abundance of information published on the Internet makes filtering of hazardous Web pages a difficult yet important task. Supervised learning methods such as Support Vector Machines can be used to identify hazardous Web content. However, scalability is a big challenge, especially if we have to train multiple classifiers, since different policies exist on what kind of information is hazardous. We therefore propose a transfer learning approach called Hierarchical Training for Multiple SVMs. HTMSVM identifies common data among similar training sets and trains the common data sets first, in order to obtain initial solutions. These initial solutions then reduce the time for training the individual training sets without influencing classification accuracy. In an experiment, in which we trained five Web content filters with 80% of common and 20% of inconsistently labeled training examples, HTMSVM was able to …
Total citations
201320142015221
Scholar articles
M Erdmann, DD Nguyen, T Takeyoshi, G Hattori… - PRICAI 2012: Trends in Artificial Intelligence: 12th …, 2012