Friday, May 3, 2013

Naive Bayes: a simple classifier







      The purpose of this blog post is to introduce a probabilistic classifier that is often implemented through computer software called “Naive Bayes” which is essentially used for pattern recognition within some data set. I will draw the majority of my understanding in order to write this post from the this video.



   He begins the video by explaining the structure of the dataset necessary for the application of the Naïve Bayes classifier.  From the video, the set up for the data should follow this form.
D = ((x(1), y1), … , (x(n),y(n)))        D is an algebraic expression for the data set. In an attempt to make that more clear, the variable x(1)  represents a coordinate pair    (  x1(1),xd(1))  where the superscript shows what point that coordinate belongs to and the subscript indexes that coordinate. X(i)  is a point in the space of Rd . Yi belongs to some finite set. In the video, he states that y represents a finite set that will be the integers from 1 to n.
He mentions in the video that there are several assumptions made when taking on a Naïve Bayes approach to classification. Those assumptions are listed as the following.
1)      We assume we have a family of some set of distributions parametrized by theta and these distributions will have the following properties. Each of these is a joint distribution on x and y. So here  x is going to be in Rd and Y is a class.
2)      PΘ(x,y) = PΘ(x|y) PΘ(y) = PΘ(x1|y) … PΘ(xd|y)  PΘ(y)
He mentions in the video that this second assumption is very key to the Naïve Bayes classification approach and that what this essential means is that the first expression of assumption 2 (PΘ(x,y) = PΘ(x|y) PΘ(y)) will factor to equal the second expression. What I essential draw from this is that we assume that PΘ(x|y) factors out to be the probability of the first coordinate x given y up to the last coordinate of x given y.
3)      Assume that the points are independently, identically distributed based on parameter Θ. He mentions that in this context, the coordinates  x1 … xn are independent given y if (X,Y) ~ PΘ.
He finally makes things a bit clearer at 7:35 in the video. He mentions that the main assumption is a conditional independence assumption.

At this point, he explains the “goal” of Naïve Bayes. Essentially he says that when some new x enters the data set, we want to “predict” its y. He mentions that the algorithm initiates by attempting to estimate the parameter Θ for which it is believed the distribution of the (x,y)s follow. Theta is estimated from the data and then we compute the prediction of y that maximizes over all possible classes the probability of that class given the new x.  Because we assume that  PΘ(x|y) PΘ(y) factors out to be = PΘ(x1|y) … PΘ(xd|y)  PΘ(y), we will attempt to maximize the prediction y across all x’s, and that value should give the new prediction for the value of y given to the new x. For a better understanding, please watch the video.

57 comments:

  1. I can see that you are are genuinely passionate about this! great information.
    thank you...!
    Big data training

    ReplyDelete
  2. Thank you so much for sharing this worthwhile to spent time on. You are running a really awesome blog. Keep up this good work Big Data Course in Chennai

    ReplyDelete
  3. Learning new technology would give oneself a true confidence in the current emerging Information Technology domain. With the knowledge of big data the most magnificent cloud computing technology one can go the peek of data processing. As there is a drastic improvement in this field everyone are showing much interest in pursuing this technology. Your content tells the same about evolving technology. Thanks for sharing this.

    Hadoop Training in Chennai | Big Data Training | Big Data Course in Chennai | Best Hadoop Training in Chennai

    ReplyDelete
  4. Wonderful blog.. Thanks for sharing informative blog

    Training on CSTM/CSQP/CISQA

    ReplyDelete
  5. I have finally found a Worth able content to read. The way you have presented information here is quite impressive. I have bookmarked this page for future use. Thanks for sharing content like this once again. Keep sharing content like this.

    Software testing training in chennai | Software testing course | Manual testing training in Chennai

    ReplyDelete
  6. Your blog is really nice and informative. Thanks for sharing this post. Keep posting.
    datastage training in chennai

    ReplyDelete
  7. • thanks for ur valuable information about informatica.
    oracle training in chennai

    ReplyDelete
  8. • such a good website and given to more information thanks! and more visit.
    oracle training inchennai

    ReplyDelete
  9. Great and useful article. Creating content regularly is very tough. Your points are motivated me to move on
    SEO Company in Chennai

    ReplyDelete
  10. Thanks for sharing Valuable information about Bigdata. This post is really helped me a lot. Keep sharing........... If it possible share some more tutorials???????????

    ReplyDelete
  11. this is very nice post thanks for updating this information.

    Hadoop Training in Chennai

    ReplyDelete
  12. I get a lot of great information from this blog. Thanks for sharing this valuable information to our vision. Big Data Hadoop Online Training Bangalore

    ReplyDelete
  13. This comment has been removed by the author.

    ReplyDelete

  14. It was so good to read and useful to improve my knowledge as updated one.Thanks to Sharing.
    Informatica Training In Chennai | Hadoop Training In Chennai | Sap MM Training In Chennai

    ReplyDelete
  15. Best informative blog. Thanks for posting such a useful information with us.

    DevOps Online Training


    ReplyDelete
  16. Really great post, I simply unearthed your site and needed to say that I have truly appreciated perusing your blog entries. I want to say thanks for great sharing.
    python Online training in chennai
    python Online training in bangalore
    python interview question and answers

    ReplyDelete
  17. Inspiring writings and I greatly admired what you have to say , I hope you continue to provide new ideas for us all and greetings success always for you..Keep update more information..
    angularjs-Training in velachery

    angularjs Training in bangalore

    angularjs Training in bangalore

    angularjs Training in btm

    angularjs Training in electronic-city

    angularjs online Training

    ReplyDelete
  18. The best analytics and visualization of big data are shared in Big Data Training in Bangalore

    ReplyDelete
  19. The knowledge of technology you have been sharing thorough this post is very much helpful to develop new idea. here by i also want to share this. Thanks for sharing this information,this is useful to me. This post helped to make the career.
    Splunk Training | Splunk Training Institute | Splunk Training in Chennai | Splunk Training Institute in Chennai

    Machine Learning Training | Machine Learning Course | Machine Learning Training in Chennai | Machine Learning Course in Chennai

    ReplyDelete
  20. Thanks for the informative article. This is one of the best resources I have found in quite some time. Nicely written and great info. I really cannot thank you enough for sharing.
    Devops Training in Chennai | Devops Training Institute in Chennai

    ReplyDelete
  21. I am actually getting ready to across this information, It’s very helpful for this blog.Also great with all of the valuable information you have Keep up the good work you are doing well.
    Java training in Chennai | Java training Institute in Chennai

    ReplyDelete
  22. Thanks for one marvelous posting! I enjoyed reading it; you are a great author. I will make sure to bookmark your blog and may come back someday. I want to encourage that you continue your great posts.
    lg mobile service center
    lg mobile service chennai
    lg mobile repair

    ReplyDelete
  23. Thanks for providing wonderful information with us. Thank you so much.
    Data Scientist course in chennai

    ReplyDelete
  24. This comment has been removed by the author.

    ReplyDelete
  25. Its help me to improve my knowledge and skills also.im really satisfied in this session.datascience with python training in bangalore

    ReplyDelete
  26. Thanks for one marvelous posting! I enjoyed reading it; you are a great author. I will make sure to bookmark your blog and may come back someday. I want to encourage that you continue your great post.Microsoft Dynamics CRM Training in Bangalore

    ReplyDelete
  27. Excellent....
    https://www.kaashivinfotech.com/internship-with-stipend-for-ece-in-chenna
    https://www.kaashivinfotech.com/mechanical-internship-in-chennai
    https://www.kaashivinfotech.com/inplant-training-in-chennai-for-it
    https://www.kaashivinfotech.com/internship-in-pune-for-computer-engineering-students
    https://www.kaashivinfotech.com/internship-for-mca-students
    https://www.kaashivinfotech.com/iot-internship
    https://www.kaashivinfotech.com/internship-for-cse-students-in-hyderabad
    https://www.kaashivinfotech.com/inplant-training-in-chennai
    https://www.kaashivinfotech.com/tag/internship-for-aeronautical-engineering-students-in-bangalore
    https://www.kaashivinfotech.com/tag/inplant-training-certificate-format-for-mechanical-engineering

    ReplyDelete
  28. incredible article!! sharing these kind of articles is the decent one and I trust you will share an article on information science.By giving an organization like 360DigiTMG.it is one the best foundation for doing guaranteed courses
    data science courses in noida

    ReplyDelete
  29. Clavax is a reputed Web Development Company California, We fully understand the objective of website development. Whether you want to generate inquiries or just want a profile for your agency or you want to sell commodities to the buyers, we do web development according to your specification. We are well known for delivering flexible and cost-effective Web Development using modern Website Development platforms like Kentico, Wordpress, PHP, OpenCart, Magento, and Joomla.

    ReplyDelete
  30. Progressive web apps could be the next big thing for the mobile web. Originally proposed by Google in 2015, they have already attracted a lot of attention because of the relative ease of development and the almost instant wins for the application’s user experience. Hire top PWA App Development to get your app developed.

    ReplyDelete
  31. I read this article fully on the topic of the resemblance of most recent and preceding technologies, it’s remarkable article.
    Recliner sofa set in Bangalore

    ReplyDelete

  32. What a piece of amazing and meaningful information you have written on Build an App Like Uber. I appreciate you and your precious time that you devoted to this blog. Additionally, I also want to clear some more doubts about Food Delivery App Development Companies in USA. Normally i don't leave comments on blogs, but i can't stop myself here to write a few words for you.

    ReplyDelete
  33. These are truly fantastic ideas regarding blogging. You have touched on some pleasant points here. Any way keep up writing. Bada Business

    ReplyDelete
  34. Hey, thanks for the blog article.Really looking forward to read more. Cool.
    P0wer bi onlinetraining
    P0wer bi training

    ReplyDelete
  35. Just found your blog, its amazing!! About what is the importance of Digital Marketing and how much time does it take to learn digital marketingI would highly recommend you attend the free demo session from one the leading Digital Marketing institute in  Amritsar i.e. Amritsar Digital Academy and get to know about how much time it take to learn digital marketing in detail.

    ReplyDelete
  36. Cool you write, the information is very good and interesting, I'll give you a link to my site.
    data science courses in chennai

    ReplyDelete
  37. Thanks for sharing
    Village Talkies a top-quality professional corporate video production company in Bangalore and also best explainer video company in Bangalore & animation video makers in Bangalore, Chennai, India & Maryland, Baltimore, USA provides Corporate & Brand films, Promotional, Marketing videos & Training videos, Product demo videos, Employee videos, Product video explainers, eLearning videos, 2d Animation, 3d Animation, Motion Graphics, Whiteboard Explainer videos Client Testimonial Videos, Video Presentation and more for all start-ups, industries, and corporate companies. From scripting to corporate video production services, explainer & 3d, 2d animation video production , our solutions are customized to your budget, timeline, and to meet the company goals and objectives.
    As a best video production company in Bangalore, we produce quality and creative videos to our clients.

    ReplyDelete
  38. Gladias the best Web Design Company In Bangalore has a dedicated department for all your design needs. We follow a systematic process of design/redesign to create your website with all the features that you require as well as provide an appealing and elegant online presence. Gladias Consulting Web Design Company

    Oyewebsite offer bespoke web design services by understanding the importance of your online presence strategy for your business in this digital era. The web development comes with state-of-the-art features that reflect your brand. Oyewebsite Web Design Company

    ReplyDelete

  39. 1 BHK, 2 BHK Residential Property in Haridwar for Sale near Har ki Paurion Main Rishikesh Highway amidst the
    scenic beauty of residential projects in Haridwar near Ganga . Ready to move flats for sale in Haridwar are
    available with a bank loan facility

    ReplyDelete
  40. Thank you for sharing this fantastic article sainik school entrance exam coaching centre near me in Bhiwani . I've been reading for a long time but have never felt compelled to leave a comment.

    ReplyDelete
  41. Nice post. Very useful information Beautifully written, as usual, I like the post. Thank you so much for nice sharing with us. Keep posting! i just suggests you the best digital marketing company in surat


    ReplyDelete