INCLUDE_DATA
Wed Sep 8, 10:45:29 UTC 2010



Posts Tagged ‘ai’

Would you participate in this kind of contest?

Tuesday, January 19th, 2010

I’m thinking, along w/ some people I’m working on a start-up with, of running a contest to help up “train” the back end “artificial intelligence engine” which is used in our software.

Here’s the gist: you would log-in to a website, and be presented w/ an “article” – this would be a blog, website, etc. you would then rate it as positive, negative, & so forth. Anyone who rated 1000 articles in a month (each takes about 1-2 seconds) would be eligible to win a prize, which would either be an xbox 360, or a playstation 3.

So: Three questions:

  • Would you do something like this?
  • Would you be more inclined to do it for an xbox or a ps3
  • If you would not be inclined, what could we change to make you more inclined to do it?

    Thanks so much!

  • Playing with Bayes

    Saturday, April 4th, 2009

    For the last day or so I’ve been playing with moving over my simple word-count-analysis of blogs to actually creating a database with manually ranked training data and extrapolating from that. There were some hiccups and I’ve still got to go back and replace a lot of code, but it’s effectively categorizing new blog entries based on previous rankings. YAY! I’ve been using the perl Algorithm::NativeBayes cpan module, and it’s pretty great – although the the documentation is really poor. The main thing to get is that it returns a hash reference, which means you end up referring to your result as something like:

    <pre>print “Sport’s ranking: \t ${$result}{sports} \n” ; </pre>

    Which, lets face it, is kinda ugly, but it’s really the only good way to do it, really. It should really be better documented, though. Aside from that, as long as you get the back-end math, you’re pretty OK. Just because you’re doing AI stuff doesn’t mean that you’re automatically familiar with how perl handles references to hashes, though. One of my to-do items is to go back and update the perldoc on it. Anyway, with that in effect I’ve gone and updated the database and I’m now able to get positivity over time. This means I’m actually getting closer to building an internet happiness index, and prediciting how “happy” the internet is as a whole. The next steps are:

    • incorporate new bayes functions into existing codebase
    • add more sources to the rss feedlist scripts
    • optimize the blogparser
    • put a nice (fusioncharts?) front end together
    • add more hardware for doing the catagorization
    • get more people to do more training data
    • ???
    • profit!

    Actually, the “???” is pretty well defined, but honestly, this project will have been fun even it it doesn’t make a dime. Anyway, one step closer.