Tao WANG Curriculum Vitae 1011 S 2nd Street Lafayette, IN 47905 USA tao.wang.cs@gmail.com http://cecs.anu.edu.au/˜twang mobile: +1 765 337 8206 SKILLS SUMMARY Management and Leadership: Excellent problem analysis and solving Excellent time management Effective organization skills e.g., leading a group of people Experience with conducting individual consultations as well as facilitating workshops Computer Skills: Familiar with Java, C/C++, and Matlab Good at Microsoft Office Word/PowerPoint/Excel Experience with Unix/Linux, Mac OS X, and various flavors of Windows Languages: Fluent in English Chinese (mother tongue) EMPLOYMENT HISTORY Research Fellow 2007–2008 Australian National University, Canberra, Australia System Analyst and Technical Consultant 1999–2000 DELL Computer (China) Co., Ltd., Shanghai, China Software Engineer 1998–1999 Telecommunication Technological Research Institute, Shanghai, China EDUCATION Ph.D. in Computing Science, University of Alberta, Canada 2007 Advisors: Dale Schuurmans and Michael Bowling Committee: Richard S. Sutton, Paul R. Messinger, and Doina Precup Thesis: New Representations and Approximations for Sequential Decision Making Finalist for the Department of Computing Science 2007 Ph.D. Thesis Award M.E. in Automatic Control, Northwestern Polytechnical University, China 1998 Thesis: Text-Independent Speaker Recognition Based on Continuous HMMs B.E. in Automatic Control, Northwestern Polytechnical University, China 1996 With University Honors ADDITIONAL PROFESSIONAL DEVELOPMENT Academic Leadership and Management (Australian National University) 2008 Microsoft Certified System Engineer (MCSE) 2000 Sun Certified Java Programmer (SCJP) 1999 Tao WANG Curriculum Vitae OUTREACH Program Committee Member The Twenty-Third Conference on Artificial Intelligence (AAAI) 2008 International Conference on Machine Learning and Cybernetics (ICMLC) 2007 Reviewer Neural Information Processing Systems (NIPS) 2007 International Conference on Machine Learning (ICML) 2005 and 2007 Journal of Adaptive Behavior 2004 IEEE Transactions on Systems, Man and Cybernetics 2002 and 2003 Councilor 2005–2006 Graduate Students’ Association of the University of Alberta Volunteer Team Leader International Conference on Intelligent Systems for Molecular Biology (ISMB) 2002 Volunteer Neural Information Processing Systems (NIPS) 2006 Grace Hopper Celebration of Women in Computing (GHC) 2006 IEEE International Conference on Robotics and Automation (ICRA) 2001 International Community Outreach Programs at University of Alberta 2001–2003 Ambassador of Univ. of Alberta for the World Track and Field Championships 2001 Media Attention CBC Radio, Canada 2007 Express News, University of Alberta, Canada 2007 CH Television News, Canada 2006 Victoria Times Colonist, Canada 2006 A-Channel Victoria, Canada 2006 AWARDS AND HONORS Best Student Paper Award 2007 At the 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning [?]. Scholarships and Awards SIGAR/AAAI Doctoral Consortium Scholarship 2006 Canberra Machine Learning Summer School Scholarship 2006 ICML 2005 Student Scholarship 2005 J Gordin Kaplan Graduate Student Award 2004 Sea Eagle Comprehensive Award 1997 Excellent Graduate Student Fellowship 1996–1998 Excellent Undergraduate Student Fellowship 1992–1996 Honors Third Prize, Shanghai Science & Technology Promotion 1999 Second Prize, Experimental Skill Competition in Physics 1994 October, 2008 2 Tao WANG Curriculum Vitae PUBLICATIONS Journal Articles [1] Adam Milstein and Tao Wang. Dynamic motion models in Monte Carlo localiza- tion. Integrated Computer-Aided Engineering, 14(3):243–262, 2007. [2] Tao Wang and Naiping Xu. Speaker recognition and its applications. Journal of Microprocessors, (4):50–53, November 1997. Top Peer Refereed Conferences [3] Tao Wang, Daniel Lizotte, Michael Bowling, and Dale Schuurmans. Stable dy- namic programming. In Proceedings of Advances in Neural Information Processing Systems 20 (NIPS), 2008. To appear (8 pages), [Acceptance Rate: 10% (poster spot- light)]. [4] Daniel Lizotte, Tao Wang, Michael Bowling, and Dale Schuurmans. Automatic gait optimization with Gaussian process regression. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI), pages 944–949, 2007. [Acceptance Rate: 35%]. [5] Tao Wang, Pascal Poupart, Michael Bowling, and Dale Schuurmans. Compact, convex upper bound iteration for approximate POMDP planning. In Proceedings of the Twenty-First National Conference on Artificial Intelligence (AAAI), pages 1245–1251, 2006. [Acceptance Rate: 22% (oral presentation)]. [6] Adam Milstein and Tao Wang. Localization with dynamic motion models: Deter- mining motion model parameters dynamically in Monte Carlo localization. In Proceed- ings of the Third International Conference on Informatics in Control, Automation and Robotics (ICINCO), pages 120–127, 2006. [Acceptance Rate: 10% (full paper and oral presentation)]. [7] Tao Wang, Daniel Lizotte, Michael Bowling, and Dale Schuurmans. Bayesian sparse sampling for on-line reward optimization. In Proceedings of the Twenty-second International Conference on Machine Learning (ICML), pages 961–968, 2005. [Ac- ceptance Rate: 27% (oral presentation)]. Book Chapters [8] C. Ronald Kube, Chris A. C. Parker, Tao Wang, and Hong Zhang. Biologically Inspired Collective Robotics, chapter 15. Recent Developments in Biologically Inspired Computing. Idea Group, 2004. ISBN: 159140313-8. Other Refereed Publications [9] Tao Wang, Michael Bowling, and Dale Schuurmans. Dual representations for dy- namic programming and reinforcement learning. In Proceedings of the 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, pages 44–51, April 2007. [Acceptance Rate: 61.5% (oral presentation)]. [10] Tao Wang and Hong Zhang. Collective sorting with multiple robots. In Pro- ceedings of the IEEE International Conference on Robotics and Biomimetics (ROBIO), pages 716–720, 2004. October, 2008 3 Tao WANG Curriculum Vitae [11] Tao Wang and Hong Zhang. Multi-robot collective sorting with local sensing. In Proceedings of the IEEE Intelligent Automation Conference (IAC), 2003. [12] Tao Wang, Juhua Shi, and Mario A. Nascimento. Experimental results towards content-based sub-image retrieval. In Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC), pages 230–235, 2002. Refereed Abstracts and Short Papers [13] Tao Wang. Action selection in Bayesian reinforcement learning. In Proceedings of the Twenty-First National Conference on Artificial Intelligence (AAAI), pages 1928– 1929, 2006. [Acceptance Rate: 37% (oral presentation)]. [14] Tao Wang, Michael Bowling, and Dale Schuurmans. Reinforcement learning with dual representations. NIPS Workshop on Towards a New Reinforcement Learning, 2006. [15] Daniel Lizotte, Tao Wang, Michael Bowling, and Dale Schuurmans. Gaussian process regression for optimization. NIPS Workshop on Value of Information in Infer- ence, Learning and Decision-Making, 2005. [16] Linli Xu, Li Cheng, Tao Wang, and Dale Schuurmans. Convex hidden Markov models. NIPS (Neural Information Processing Systems) Workshop on Advances in Structured Learning for Text and Speech Processing, 2005. [17] Tao Wang and Naiping Xu. Speaker recognition based on continuous Gaussian mixture HMMs. In Proceedings of the Second Conference of Graduate Students’ Aca- demic Reports, pages 412–415, June 1996. [18] Tao Wang. A multi-channels data acquisition and dynamic display system. In Proceedings of the First Conference of Graduate Students’ Academic Reports, pages 197–199, November 1994. Presentations and Demonstrations [1] Dual dynamic programming and reinforcement learning. Workshop on Modelling and Mining of Networked Information Spaces, MITACS (Mathematics of Information Technology and Complex Systems), December 2006. Banff, Canada. [2] Action selection for reinforcement learning and approximate POMDP planning. Game Theory and Decision Theory Seminar at University of British Columbia, De- cember 2006. Vancouver, Canada. [3] Quadratic approximation for POMDP planning. Logic Lab Seminar at Simon Fraser University, November 2006. Vancouver, Canada. [4] Bayesian reinforcement learning. Women in Machine Learning Workshop, October 2006. San Diego, USA. [5] Action selection in Bayesian reinforcement learning. AAAI/SIGART Doctoral Con- sortium, July 2006. Boston, USA. [6] Compact, convex upper bound iteration for approximate POMDP planning. Artifi- cial Intelligence Seminar at University of Alberta, June 2006. Edmonton, Canada. October, 2008 4 Tao WANG Curriculum Vitae [7] Bayesian action selection. The Sixteenth Annual Canadian Conference on Intelli- gent Systems, June 2006. Victoria, Canada. [8] Sony AIBO ERS-7 (robotic dog) technology demonstration. The Sixteenth Annual Canadian Conference on Intelligent Systems, June 2006. Victoria, Canada. [9] Bayesian sparse sampling. Banff Informatics Summit, Best Student Poster Award, September 2005. Banff, Canada. [10] Bayesian sparse sampling for on-line reward optimization. Artificial Intelligence Seminar at University of Alberta, July 2005. Edmonton, Canada. OTHER RESEARCH EXPERIENCE Stochastic Optimization, Canberra, Australia April–June 2007 Visiting Researcher. Investigated stochastic optimization algorithms with researchers in the Statistical Machine Learning program at National Information and Communications Technology Australia. Swarm Intelligence Based Robotics, Edmonton, Canada 2002–2004 Graduate Researcher. Designed strategies for controlling robots to achieve a represen- tative task—collective sorting—motivated by brood sorting in ants. The challenge is to design robotic behaviors that only depend on local sensing information in order to cluster objects of different types into piles. Speaker Recognition System Project, Xi’an, China 1996–1998 Project Leader. Designed and developed a speaker recognition system for police to an- alyze wiretaps. I proposed a new feature extraction method and built a speaker recog- nition system based on continuous Hidden Markov Models. The project was sponsored by the National Security Bureau of China and Aptronix Research Institute of Hainan Co., Ltd. User Interface Design for Heart Disease Inspection, Xi’an, China 1995 Undergraduate Researcher. Designed and developed the user interface for a heart dis- ease inspection system. TEACHING Lecturer Signal Processing in Fault-Tolerant Control Winter 1996 Computer Control and Computer Simulation Fall 1995 Lab Instructor User Interfaces and Software Design Fall 2001; Winter, Fall 2002 Java Programming Fall 2000; Winter, Fall 2003 Programming with Data Structures Winter 2001 PERSONAL INFORMATION Citizenship: Canadian Gender: Female October, 2008 5