{"id":204,"date":"2012-06-28T17:13:29","date_gmt":"2012-06-28T17:13:29","guid":{"rendered":"http:\/\/garysieling.com\/blog\/?p=204"},"modified":"2012-06-28T17:13:29","modified_gmt":"2012-06-28T17:13:29","slug":"a-brief-introduction-to-weka","status":"publish","type":"post","link":"https:\/\/www.garysieling.com\/blog\/a-brief-introduction-to-weka\/","title":{"rendered":"A brief introduction to Weka"},"content":{"rendered":"<p>Weka is a GPL data mining tool written in Java, published by the University of Waikato. It includes an extensive series of pre-implemented machine learning algorithms, including well known classification and clustering algorithms. If you&#8217;ve ever been curious how Bayes Theorem works, this is a great tool to get up and running.<\/p>\n<p><a href=\"http:\/\/172.104.26.128\/wp-content\/uploads\/2012\/06\/weka.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-206\" title=\"weka\" src=\"http:\/\/172.104.26.128\/wp-content\/uploads\/2012\/06\/weka.png\" alt=\"\" width=\"361\" height=\"248\" srcset=\"https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/weka.png 361w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/weka-300x206.png 300w\" sizes=\"(max-width: 361px) 100vw, 361px\" \/><\/a><\/p>\n<p>Weka uses a custom data format, called ARFF files (Attribute Relation File Format). This, in essence, specifies a table of data, along with a CSV style data listing. The data types are scaled down from a type of database: numerics, strings, dates, and nominal attributes (i.e. an equivalent to enumeration or pick list).<\/p>\n<p><a href=\"http:\/\/172.104.26.128\/wp-content\/uploads\/2012\/06\/explorer.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-209\" title=\"explorer\" src=\"http:\/\/garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-300x225.png\" alt=\"\" width=\"300\" height=\"225\" srcset=\"https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-300x225.png 300w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-768x576.png 768w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer.png 800w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<p>You can connect to any database with a JDBC connection string, provided the appropriate jars are on the classpath. Weka ships with a file called\u00a0DatabaseUtils.props, which maps database types to the Weka types listed above.<\/p>\n<p><a href=\"http:\/\/172.104.26.128\/wp-content\/uploads\/2012\/06\/explorer-data1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-211\" title=\"explorer-data\" src=\"http:\/\/garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-data1-300x216.png\" alt=\"\" width=\"300\" height=\"216\" srcset=\"https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-data1-300x216.png 300w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-data1-768x553.png 768w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-data1.png 901w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<p>Once you get some data in, you can try out different algorithms (and see how difficult it is to build predictive systems!)<\/p>\n<p><a href=\"http:\/\/172.104.26.128\/wp-content\/uploads\/2012\/06\/explorer-prediction.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-213\" title=\"explorer-prediction\" src=\"http:\/\/garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-prediction-300x216.png\" alt=\"\" width=\"300\" height=\"216\" srcset=\"https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-prediction-300x216.png 300w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-prediction-768x553.png 768w, https:\/\/www.garysieling.com\/blog\/wp-content\/uploads\/2012\/06\/explorer-prediction.png 901w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Weka is a GPL data mining tool written in Java, published by the University of Waikato. It includes an extensive series of pre-implemented machine learning algorithms, including well known classification and clustering algorithms. If you&#8217;ve ever been curious how Bayes Theorem works, this is a great tool to get up and running. Weka uses a &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/www.garysieling.com\/blog\/a-brief-introduction-to-weka\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;A brief introduction to Weka&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[5,6],"tags":[80,115,147,157,300,352,437,532,595],"aioseo_notices":[],"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/posts\/204"}],"collection":[{"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/comments?post=204"}],"version-history":[{"count":0,"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/posts\/204\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/media?parent=204"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/categories?post=204"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.garysieling.com\/blog\/wp-json\/wp\/v2\/tags?post=204"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}