玉帝和王母是什么关系| 大姨的女儿叫什么| 反酸烧心吃什么药| 乳香是什么东西| 糖尿病吃什么菜最好| 什么情况下需要做宫腔镜| 辗转是什么意思| 东莞有什么好玩的| 归脾丸治什么病| 天冬氨酸氨基转移酶高是什么原因| 火车为什么会晚点| 心内科全称叫什么| 换手率是什么意思| 8月12号是什么星座| 沈阳六院主要治什么病| 体脂是什么| 人丹是什么| 女票什么意思| 托孤是什么意思| 立冬是什么时候| 钾低会出现什么症状| 牙龈充血是什么原因| 追龙什么意思| 女人吃什么补气血效果最好| 肾透析是什么意思| 崩漏下血是什么意思| ubras是什么牌子| 为什么大便会拉出血| 米酒是什么酒| 什么是有机奶粉| 人为什么会长智齿| 上火流鼻血是什么原因| 什么是向量| 黄芪配升麻有什么作用| 天花是什么病| 渗透压低是什么原因| ghz是什么单位| 什么是功能性消化不良| 周瑜为什么打黄盖| 男人都喜欢什么样的女人| 祛湿是什么意思| 四不像是指什么动物| 肾气不足吃什么药好| 吃薄荷叶有什么好处和坏处| 一代明君功千秋是什么生肖| 肌肉萎缩是什么症状| 孺子可教什么意思| 诗经又称什么| 澳门什么时候回归| 人生得意须尽欢是什么意思| 失能是什么意思| 什么是男人| 烧包是什么意思| 口臭要做什么检查| 哑巴是什么生肖| 高潮是什么感觉| 上海青是什么菜| 晕车为什么读第四声| 阴道吹气是什么原因| 辗转是什么意思| 麝牛是什么动物| 什么一梦| 得了艾滋病有什么症状| 生理盐水和食用盐水有什么区别| 多吃黄瓜有什么好处| 金银花泡水喝有什么好处| ms是什么单位| 颈椎间盘突出有什么症状| 取环后吃什么恢复子宫| 含蓄是什么意思| hope是什么意思啊| 呼吸音粗是什么原因| 拉肚子出血是什么原因| 什么是传染性软疣| 金字旁有什么字| 肚子拉稀是什么原因| 肺主治节是什么意思| 平均红细胞体积偏高是什么原因| 眼泡是什么意思| 肛周脓肿什么症状| 部署是什么意思| 大是大非是什么意思| 粒字五行属什么| chevy是什么车| 类风湿忌吃什么| 握手是什么意思| 哔哩哔哩会员有什么用| 莫名其妙的心情不好是什么原因| 什么是芡实| 头眩晕看什么科| 洛神是什么意思| 诗和远方是什么意思| 梦见豆腐是什么意思| 为什么会晕3d| 麒麟儿是什么意思| 现在买什么股票好| 茯苓泡水喝有什么功效| orf是什么意思| 半熟芝士是什么意思| 衄血是什么意思| 白马王子是什么意思| 双什么意思| 5月24日是什么星座| 拔掉智齿有什么影响| 喝牛奶就拉肚子是什么原因| 什么是动态口令| 小孩吐奶是什么原因| 眼睛模糊用什么眼药水| 感染幽门螺杆菌吃什么药| 多发肿大淋巴结是什么意思| 日希是什么字| 什么心所什么| 西瓜汁加什么好喝| 益生菌吃了有什么好处| 宫颈萎缩意味着什么| 颔是什么意思| 三十三天都是什么天| 年兽叫什么| 脂肪肝用什么药物治疗| 耳朵蝉鸣是什么原因引起的| 池塘里有什么| 肾阳虚吃什么药| 盆腔炎吃什么消炎药效果好| 确认妊娠是什么意思啊| 最里面的牙齿叫什么| 扬州有什么好玩的地方| 脚底发凉是什么原因| 什么是远视眼| 装垃圾的工具叫什么| 弥漫什么意思| 正月初一是什么节日| 烂好人是什么意思| 商字五行属什么| 阳春三月指什么生肖| 瞅瞅是什么意思| 小孩手上脱皮是什么原因| 甲状腺密度不均匀是什么意思| 行房时硬度不够是什么原因| 二月十九是什么星座| 咖啡什么牌子的好| 什么人不适合喝骆驼奶| 肚脐眼上面疼是什么原因| 强迫症是什么意思| 什么屁股摸不得| 低密度脂蛋白高的原因是什么| 995是什么意思| 三竖一横念什么| 急性尿道炎吃什么药| 中老年人补钙吃什么牌子的钙片好| 幽门螺杆菌什么药最好| 得了破伤风是什么症状| 91年出生的属什么| 骨相美是什么意思| legacy什么意思| 啊囊死给什么意思| 女生什么时候绝经| 防晒衣的面料是什么| 妈妈的爷爷叫什么| 什么是抗阻运动| 饭票是什么意思| 农历五月二十四是什么星座| 眩晕看什么科| 为什么会上火| 打牛是什么意思| 移花接木的意思是什么| 省政协常委是什么级别| 空调什么牌子的好| 上朝是什么意思| 静态纹用什么除皱| 神奇的近义词是什么| 经常困想睡觉是什么问题| 宫颈ecc是什么意思| 青口是什么东西| maga是什么意思| 洗手做羹汤是什么意思| 喝什么能补肾| 女攻是什么意思| 什么是血管瘤| 什么是非甾体抗炎药| 一视同仁什么意思| 手指发麻什么原因| 90年属什么| 长痔疮有什么症状| 陕西有什么特产| 老爹鞋适合什么人穿| 煮玉米放盐起什么作用| 小孩掉头发是什么原因引起的| 两个a型血的人生的孩子什么血型| 什么水果含维c最多| 眼睛老是肿着是什么原因造成的| 孕妇上火了吃什么降火最快| 交替脉见于什么病| 标准的青色是什么颜色| 阴虚火旺吃什么食物好| 嘴唇发白是什么原因引起的| 白无常叫什么名字| 奥利司他排油是什么油| 男生下面叫什么| 煮羊肉放什么调料| 送老人什么礼物最好| 6.30是什么星座| 新生儿什么时候上户口| 大便真菌阳性说明什么| vd是什么| 胆囊小是什么原因| 脑白质脱髓鞘改变是什么意思| 央行放水是什么意思| 肠易激综合症用什么药能治好| 三个降号是什么调| 脚有酸臭味是什么原因| 头发有什么用处| 肚脐眼周围痛什么原因| 柒牌男装什么档次| 吃了避孕药有什么副作用| 胃不舒服想吐吃什么药| 小孩嘴唇发白是什么原因| 干什么挣钱快| 做肠镜要做什么准备| 什么原因造成耳鸣| 火龙果是什么颜色| 什么是动脉瘤| 5.23是什么星座| 日本为什么投降| 烫伤起水泡涂什么药膏| 丹毒用什么抗生素| dx是什么意思| 药引子是什么意思| psv医学是什么意思| 双向情感障碍是什么病| 跟着好人学好人下句是什么| 白细胞数目偏高是什么意思| 血糖高什么水果可以吃| 老婆饼是什么馅| 太白金星叫什么| 小暑吃什么| 冰激凌和冰淇淋有什么区别| 生眼屎是什么原因引起的| 三什么九什么成语| 5年存活率是什么意思| 性病有什么症状| 清宫和无痛人流有什么区别| 幼稚细胞是什么意思| 尿检蛋白质弱阳性是什么意思| 为什么想吐却吐不出来| 图谋不轨什么意思| 宝宝缺钙吃什么补得快| 疾控中心是干什么的| c2是什么车型| 白茶是什么茶| 什么是道| 梦见上楼梯是什么意思| 什么情况下做试管婴儿| pwi是什么意思| 31岁属什么生肖| 悸是什么意思| 大小脸去医院挂什么科| 基底是什么意思| 安痛定又叫什么| 什么是电子邮件地址| 儿童感冒流鼻涕吃什么药好得快| 尿是褐色的是什么原因| 前列腺钙化有什么影响| 总警司相当于大陆什么官| 立秋当天吃什么| 百度
 

新时代新涿州--河北频道--人民网

百度   但对于另一些科学家来说,“备份大脑”不过是超人主义者们“绝望的虚假幻想”。

Top researcher Pedro Domingos on useful maxims for Data Mining, Machine Learning as the Master Algorithm, new type of Deep Learning called sum-product networks, Big Data and startups, and great advice to young researchers.



By Gregory Piatetsky, @kdnuggets, Aug 19, 2014.

Pedro DomingosThis is the second part of my interview with Prof. Pedro Domingos, a leading researcher in Machine Learning and Data Mining, winner of ACM SIGKDD 2014 Innovation Award, widely considered the Data Mining/Data Science "Nobel Prize".

Here is the first part: Interview: Pedro Domingos, Winner of KDD 2014 Data Mining/Data Science Innovation Award.

Many of Prof. Domingos award winning research ideas are implemented in software which is freely available, including
 
To learn more about his research, here are some of his most cited papers via Google Scholar and Citeseerx.

Gregory Piatetsky: Q7. You published a very good article "A few useful things to know about Machine Learning" which lists 12 key observations. Are there a few additional ones that you would add for data mining / data science ?

Pedro Domingos: Yes!   
  • Data is either curated or decaying; minding the data is as important as mining it.
  • Every number has a story, and if you don't know the story, you can't trust the number.
  • Model the whole, not just the parts, or you may miss the forest for the trees.
  • Tame complexity via hierarchical decomposition.
  • Your learner's time and space requirements should depend on the size of the model, not the size of the data.
  • The first job you should automate is yours; then you can mine a thousand things in the time it took you to mine one.

 
There's many more, and I'll have more to say about some of these in my award talk at KDD-2014.

GP: Q8. When you were visiting MIT CSAIL Lab in 2013, you were working on a new book. Can you tell us about this book? What other work you did there as a visiting scientist?

PD: It's a popular science book about machine learning and big data, entitled "The Master Algorithm: Machine Learning and the Big Data Revolution."

It's almost done, and will come out in 2015. The goal is to do for data science what "Chaos" did for complexity theory, or "The Selfish Gene" for evolutionary game theory: introduce the essential ideas to a broader audience, in an entertaining and accessible way, and outline the field's rich history, connections to other fields, and implications.

Now that everyone is using machine learning and big data, and they're in the media every day, I think there's a crying need for a book like this. Data science is too important to be left just to us experts! Everyone - citizens, consumers, managers, policymakers - should have a basic understanding of what goes on inside the magic black box that turns data into predictions.

MIT Frank Gehry Building At MIT I worked with Josh Tenenbaum on a joint research project we have. The goal is to be able to go all the way from raw sensor data to a high-level understanding of the situation you're in, with Markov logic as the glue that lets all the pieces come together. Josh is a cognitive scientist, and his role in the project is to bring in ideas from psychology. In fact, one of the funnest parts of my sabbatical was to hang out with computer scientists, psychologists and neuroscientists - there's a lot you can learn from all of them.

GP: Q9. What are the major research directions on which you are working currently?

PD: I'm working on a new type of deep learning, called sum-product networks. SPNs have many layers of hidden variables, and thus the same kind of power as deep architectures like DBMs and DBNs, but with a big difference: in SPNs, the probabilistic inference is always tractable; it takes a single pass through the network, and avoids all the difficulties and unpredictability of approximate methods like Markov chain Monte Carlo and loopy belief propagation. As a result, the learning itself, which in these deep models uses inference as a subroutine, also becomes much easier and more scalable.

Sum-product networks, a new type of deep learning


The "secret sauce" in SPNs is that the structure of the network is isomorphic to the structure of the computation of conditional probabilities, with a sum node where you need to do a sum, and a product node where you need to do a product.


In other deep models, the inference is an exponentially costly loop you have to wrap around the model, and that's where the trouble begins. Interestingly, the sums and products in an SPN also correspond to real concepts in the world, which makes them more interpretable than traditional deep models: sum nodes represent subclasses of a class, and product nodes represent subparts of a part. So you can look at an SPN for recognizing faces, say, and see what type of nose a given node models, for example.

I'm also continuing to work on Markov logic networks, with an emphasis on scaling them up to big data. Our approach is to use tractable subsets of Markov logic, in the same way that SQL is a tractable subset of first-order logic.

One of our current projects is to build something akin to Google's knowledge graph, but much richer, based on data from Freebase, DBpedia, etc. We call it a TPKB - tractable probabilistic knowledge base - and it can answer questions about the entities and relations in Wikipedia, etc. We're planning to make a demo version available on the Web, and then we can learn from users' interactions with it.

GP: Q10. Big Data and Machine Learning are among the hottest tech areas, and many researchers in data mining and machine learning have been involved in start-ups. Have you considering start-ups and why have you not started a company?

Startup PD: That's what my wife keeps asking me. Seriously, I do think there's a startup in my future. There are two reasons I haven't done it yet. First, I want to do a startup that's based on my research, and in the last decade my research has been fairly long-term. This means there's a longer arc until it's ready for deployment, but hopefully when it is the impact is also larger.

Second and related, I want to do a startup that has at least the potential to be world-changing, and many stars have to align for that to happen. I often see colleagues do a startup without giving much thought to all the non-technical issues that are even more important than the technical ones, which is not a recipe for success. In the data science space, it's rare for a startup to be a complete failure, just because the acqui-hire value of a company is so high, but if that's all you wind up with then maybe it wasn't the greatest use of your time.

GP: Q11. What is your opinion on "Big Data" boom - how much is hype and how much is reality? Is there a Machine Learning "boom" going on now? (Note: Gartner latest "Hype Cycle" report has "Big Data" in the trough of disillusionment).

PD: There's a fair amount of hype, but at heart the big data boom is very real. I like the "army of ants" metaphor: it's not that any single big data project will drastically change your bottom line - although it does on occasion - but that when you add up all the places where data analysis can make a difference, it really is transformative. And we're still only scratching the surface of what can be done. The bottleneck really is the lack of data scientists.

Machine learning is booming along with big data, because if data is the fuel and computing is the engine, machine learning is the spark plugs.

To date machine learning has been less of a meme in industry or the public's mind than data mining, data science, analytics or big data, but even that is changing.
I think the term "machine learning" has a longer half-life than "data science" or "big data," and that's good, because there's progress to be made in both the short and the long term.


GP: Q12. What advice would you give to young researchers interested in Machine Learning, Data Mining, Data Science?

Advice PD:
Swing for the fences in everything you do; incremental research is not worth your time.


Learn everything you can, but don't necessarily believe any of it; your job is to make some of those things outdated.

Don't be intimidated by all the math in the textbooks; in this field, the math is a servant of the data, not the other way around.

Listening to the data - doing experiments, analyzing the results, digging deeper, following up on surprises - is the path to success.

If you're not confused and flailing most of the time, the problem you're tackling is probably too easy.

Talk continually with people from not just one company or industry, but many, and try to figure out what problems they have in common. That way you know you'll have a lot of impact if you solve one of them.

Read widely, but with a view to the research problems you care about; the greatest insights often come from putting previously separate things together.

Work with tomorrow's computing power in mind, not today's.

Beware of hacking; a hack feels clever, but it's the opposite of a general solution.

Complexity is your greatest enemy. Once you think you've solved a problem, throw out the solution and come up with a simpler one. Then do it again.

And of course, have fun - no field has more scope for it than this one.

GP: Q13. What do you like to do in your free time, when away from a computer? What book have you read and liked recently?

PD: I like to read books and listen to music. I'm a movie buff, and I enjoy traveling. My tastes in all of these things are pretty eclectic. I'm also a swimmer and long-distance runner. And, most of all, I spend time with my family.

A fascinating book I've read recently is The Scientist in the Crib "The Scientist in the Crib: What Early Learning Tells Us About the Mind," by Alison Gopnik, Andy Meltzoff and Pat Kuhl. Infants and small children go through an amazing series of learning stages, assembling piece by piece the consciousness we adults take for granted. I can't help thinking that the answers to a lot of our questions in machine learning are right there in the baby's mind, if only we can decode them from the often-astonishing experimental observations that Gopnik and Co. summarize in the book.

On the fiction side, the best book I've read recently is probably "The Road," by Cormac McCarthy. It's about a father and son trying to survive in a post-apocalyptic world, and it's a powerful, unforgettable book.

BIO: Pedro Domingos is Professor of Computer Science and Engineering at the University of Washington. His research interests are in machine learning, artificial intelligence and data mining. He received a PhD in Information and Computer Science from the University of California at Irvine, and is the author or co-author of over 200 technical publications.

He is a member of the editorial board of the Machine Learning journal, co-founder of the International Machine Learning Society, and past associate editor of JAIR. He was program co-chair of KDD-2003 and SRL-2009, and has served on numerous program committees. He is a winner of the SIGKDD Innovation Award, the highest honor in the data mining field. He is a AAAI Fellow, and received a Sloan Fellowship, an NSF CAREER Award, a Fulbright Scholarship, an IBM Faculty Award, and best paper awards at several leading conferences.

Related:



狗眼屎多是什么原因 东成西就是什么生肖 胃不消化吃什么药 胃酸过多吃什么食物好 吃叶酸有什么好处
胸口不舒服挂什么科 vs什么意思 阴道痒用什么药好 潘粤明老婆现任叫什么 鼻窦炎用什么药好
肺部不好有什么症状 为什么小腹隐隐作痛 枸杞什么时候吃最好 什么是米其林 swag什么意思
病毒性感染是什么原因 蚊子怕什么味道 莫名心慌是什么原因 什么叫做犯太岁 鼻息肉长什么样子图片
什么叫自慰hcv8jop0ns0r.cn 小孩感冒发烧吃什么药hcv9jop1ns7r.cn 黄柏胶囊主要治什么病youbangsi.com lo娘是什么意思hcv9jop7ns2r.cn 骨感是什么意思helloaicloud.com
梦见鬼是什么意思hcv8jop6ns6r.cn gm墨镜是什么牌子hcv9jop0ns3r.cn 做什么生意hcv9jop0ns3r.cn 脚趾抽筋是什么原因引起的hcv9jop0ns7r.cn ts和cd有什么区别hcv8jop9ns4r.cn
男宝胶囊为什么不建议吃huizhijixie.com 春天有什么植物hcv7jop5ns4r.cn 抑郁症是什么病aiwuzhiyu.com 世界上最难的数学题是什么hcv9jop1ns1r.cn 带状疱疹挂什么科hcv9jop2ns6r.cn
什么是铅中毒hcv9jop0ns9r.cn 人间仙境是什么意思hcv8jop4ns4r.cn 肝火旺吃什么降火最快xianpinbao.com 2012年是什么命hcv7jop9ns9r.cn 归脾丸和健脾丸有什么区别hcv7jop6ns1r.cn
百度