新闻中心
网站首页   学会概况   学会规章   新闻中心   学术交流
社会服务   科学普及  计算机大赛   会员中心   联系方式
一键拨号
一键留言
会员中心
通知公告
青年学者学术报告《On Generalization and Implicit Bias of Gradient Methods in Deep Learning》
2019-10-24

计算机软件新技术国家重点实验室

要:

Deep learning has enjoyed huge empirical success in recent years. Although training a deep neural network is a highly non-convex optimization problem, simple (stochastic) gradient methods are able to produce good solutions that minimize the training error, and more surprisingly,  can generalize well to out-of sample data, even when the number of parameters is significantly larger than the amount of training data. It is known that changing the optimization algorithm, even without changing the model, changes the implicit bias, and also the generalization properties. What is the bias introduced by the optimization algorithms for neural networks? What ensures generalization in neural networks?  In this talk, we attempt to answer the above questions by proving new generalization bounds and investigating the implicit bias of various gradient methods.

报告人简介:

Jian Li is currently an associate professor at Institute for Interdisciplinary Information Sciences (IIIS, previously ITCS), Tsinghua University, headed by Prof. Andrew Yao. He got his BSc degree from Sun Yat-sen (Zhongshan) University, China, MSc degree in computer science from Fudan University, China and PhD degree in the University of Maryland, USA. His major research interests lie in algorithm design and analysis, machine learning, databases and finance. He co-authored several research papers that have been published in major computer science conferences and journals. He received the best paper awards at VLDB 2009 and ESA 2010. He is also a recipient of the "221 Basic Research Plan for Young Faculties" at Tsinghua University, the "new century excellent talents award" by Ministry of Education of China, and the National Science Fund for Excellent Young Scholars.

时间:1029  10:30-11:30

地点:计算机科学技术楼230

上一篇:2019江苏省大数据与人工智能高峰论坛第二轮通知
下一篇:青年学者学术报告《智能手机近超声特性与目标定位和动作追踪》
版权所有:江苏省计算机学会
苏ICP备14049275号-1