All publications by year: 2012, 2011, 2010, 2009, 2008, 2005-2007, earlier.

Others: videos, workshop papers, seminars/talks, demos.


SELECTED PAPERS

  • Learning Object Arrangements in 3D Scenes using Human Context,
    Yun Jiang, Marcus Lim, Ashutosh Saxena.
    To appear in International Conference of Machine Learning (ICML), 2012. [PDF] bibtex

    @inproceedings{jiang2012humancontext,
      title={Learning Object Arrangements in 3D Scenes using Human Context},
      author={Yun Jiang and Marcus Lim and Ashutosh Saxena},
      year={2012},
      booktitle={ICML}
    }
    
  • Learning to Place New Objects in a Scene,
    Yun Jiang, Marcus Lim, Changxi Zheng, Ashutosh Saxena.
    In International Journal of Robotics Research (IJRR), 2012. [PDF, ijrr-pdf, more] bibtex

    @article{jiang2012placingobjects,
      title={Learning to Place New Objects in a Scene},
      author={Yun Jiang and Marcus Lim and Changxi Zheng and Ashutosh Saxena},
      year={2012},
      journal={IJRR}
    }
    
    
    (IJRR is ranked 1/17 in robotics by Journal Citation Reports, 2010.)
  • Learning the Right Model: Efficient Max-Margin Learning in Laplacian CRFs,
    Dhruv Batra, Ashutosh Saxena.
    To appear in Computer Vision and Pattern Recognition (CVPR), 2012. [PDF, supplementary material] bibtex

    @inproceedings{laplaciancrfs_cvpr2012,
      title={Learning the Right Model: Efficient Max-Margin Learning in Laplacian CRFs},
      author={Dhruv Batra and Ashutosh Saxena},
      year={2012},
      booktitle={CVPR}
    }
    
  • Semantic Labeling of 3D Point Clouds for Indoor Scenes,
    Hema Koppula, Abhishek Anand, Thorsten Joachims, Ashutosh Saxena.
    In Neural Information Processing Systems (NIPS), 2011. [PDF, More, Code] bibtex

    @inproceedings{koppula2011semantic,
      title={Semantic Labeling of 3D Point Clouds for Indoor Scenes},
      author={Koppula, H.S. and Anand, A. and Joachims, T. and Saxena, A.},
      year={2011},
      booktitle={NIPS}
    }
    
  • θ-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding,
    Congcong Li, Ashutosh Saxena, Tsuhan Chen.
    In Neural Information Processing Systems (NIPS), 2011. [PDF] bibtex

    @inproceedings{li2011_thetamrf,
      title={$\theta$-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding},
      author={Li, C. and Saxena, A. and Chen, T.},
      year={2011},
      booktitle={NIPS}
    }
    
  • MDPs with Unawareness,
    Joseph Y. Halpern, Nan Rong, Ashutosh Saxena.
    In Uncertainty in Artificial Intelligence (UAI 2010), 2010. [PDF, Extended version] bibtex

    @inproceedings{halpern2010mdps,
      title={MDPs with Unawareness},
      author={Halpern, J.Y. and Rong, N. and Saxena, A.},
      booktitle={UAI},
      year={2010}
    }
    
  • Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models,
    Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen.
    In Neural Information Processing Systems (NIPS), 2010. [PDF, More] bibtex

    @inproceedings{li2010feccm,
      title={Towards holistic scene understanding: Feedback enabled cascaded classification models},
      author={Li, C. and Kowdle, A. and Saxena, A. and Chen, T.},
      booktitle={Neural Information Processing Systems (NIPS)},
      year={2010}
    }
    

    Full version to appear in IEEE Trans Pattern Analysis and Machine Intelligence (PAMI), 2011. [PDF-ieee-tpami, IEEE link, More] bibtex

    @article{li2011feccm,
      title={Towards holistic scene understanding: Feedback enabled cascaded classification models},
      author={Li, C. and Kowdle, A. and Saxena, A. and Chen, T.},
      booktitle={Pattern Analysis and Machine Intelligence, IEEE Transactions on},
      year={2011}
    }
    
  • Monocular Depth Perception and Robotic Grasping of Novel Objects,
    Ashutosh Saxena.
    Ph.D. Thesis, Stanford University, June 2009. [PDF] bibtex

    @phdthesis{saxena2009monocular,
      title={Monocular depth perception and robotic grasping of novel objects},
      author={Saxena, A.},
      year={2009},
      school={Stanford University}
    }
    
  • Cascaded Classification Models: Combining Models for Holistic Scene Understanding,
    Geremy Heitz, Stephen Gould, Ashutosh Saxena, Daphne Koller.
    In Neural Information Processing Systems (NIPS), 2008. (full oral) [PDF, More] bibtex

    @inproceedings{heitz2008cascaded,
      title={Cascaded classification models: Combining models for holistic scene understanding},
      author={Heitz, G. and Gould, S. and Saxena, A. and Koller, D.},
      booktitle={Neural Information Processing Systems},
      year={2008}
    }
    
  • Make3D: Learning 3D Scene Structure from a Single Still Image,
    Ashutosh Saxena, Min Sun, Andrew Y. Ng.
    IEEE Transactions of Pattern Analysis and Machine Intelligence (PAMI), vol. 30, no. 5, pp 824-840, 2009. [PDF, Make3d]
    (Original version received best paper award at ICCV 3dRR in 2007.) bibtex

    @article{saxena2009make3d,
      title={Make3d: Learning 3d scene structure from a single still image},
      author={Saxena, A. and Sun, M. and Ng, A.Y.},
      journal={Pattern Analysis and Machine Intelligence, IEEE Transactions on},
      volume={31},
      number={5},
      pages={824--840},
      year={2009},
      publisher={IEEE}
    }
    
  • Robotic Grasping of Novel Objects using Vision,
    Ashutosh Saxena, Justin Driemeyer, Andrew Y. Ng.
    International Journal of Robotics Research (IJRR), vol. 27, no. 2, pp. 157-173, Feb 2008. [PDF, ijrr-PDF, more] bibtex

    @article{saxena2008roboticgrasping,
      title={Robotic grasping of novel objects using vision},
      author={Saxena, A. and Driemeyer, J. and Ng, A.Y.},
      journal={The International Journal of Robotics Research},
      volume={27},
      number={2},
      pages={157},
      year={2008},
      publisher={SAGE Publications}
    }
    
    
    (IJRR is ranked 1/17 in robotics by Journal Citation Reports, 2010.)
  • 3-D Depth Reconstruction from a Single Still Image,
    Ashutosh Saxena, Sung H. Chung, Andrew Y. Ng.
    International Journal of Computer Vision (IJCV), vol. 76, no. 1, pp 53-69, Jan 2008. (Online first: Aug 2007). [PDF, more] bibtex

    @article{saxena20083Ddepth,
      title={3-d depth reconstruction from a single still image},
      author={Saxena, A. and Chung, S.H. and Ng, A.Y.},
      journal={International Journal of Computer Vision},
      volume={76},
      number={1},
      pages={53--69},
      year={2008},
      publisher={Springer}
    }
    
    (IJCV had the highest impact factor (ISI 6.085 in 2006) in all computer sciene journals.)
  • Robotic Grasping of Novel Objects,
    Ashutosh Saxena, Justin Driemeyer, Justin Kearns, Andrew Y. Ng.
    In Neural Information Processing Systems (NIPS) 19, 2006. (spotlight paper) [PDF, more] bibtex

    @inproceedings{saxena2006roboticgrasping,
      title={Robotic grasping of novel objects},
      author={Saxena, A. and Driemeyer, J. and Kearns, J. and Ng, A.Y.},
      booktitle={Neural Information Processing Systems},
      year={2006},
    }
  • High Speed Obstacle Avoidance using Monocular Vision and Reinforcement Learning,
    Jeff Michels, Ashutosh Saxena, Andrew Y. Ng.
    In 22nd Int'l Conf on Machine Learning (ICML), 2005. [PDF, PPT, more, aerial vehicles] bibtex

    @inproceedings{michels2005obstacleavoidance,
      title={High speed obstacle avoidance using monocular vision and reinforcement learning},
      author={Michels, J. and Saxena, A. and Ng, A.Y.},
      booktitle={Proceedings of the 22nd international conference on Machine learning},
      pages={593--600},
      year={2005},
      organization={ACM}
    }
    
  • Learning Depth from Single Monocular Images,
    Ashutosh Saxena, Sung H. Chung, Andrew Y. Ng.
    In Neural Information Processing Systems (NIPS) 18, 2005. [PDF] bibtex

    @inproceedings{saxena2005learningdepth,
      title={Learning depth from single monocular images},
      author={Saxena, A. and Chung, S.H. and Ng, A.},
      booktitle={Neural Information Processing Systems 18},
      year={2005},
    }
    			


PEER-REVIEWED PUBLICATIONS BY YEAR

2012

  • Learning Object Arrangements in 3D Scenes using Human Context,
    Yun Jiang, Marcus Lim, Ashutosh Saxena.
    To appear in International Conference of Machine Learning (ICML), 2012. [PDF] bibtex

    @article{jiang2012humancontext,
      title={Learning Object Arrangements in 3D Scenes using Human Context},
      author={Yun Jiang and Marcus Lim and Ashutosh Saxena},
      year={2012},
      booktitle={ICML}
    }
    
  • Learning to Place New Objects in a Scene,
    Yun Jiang, Marcus Lim, Changxi Zheng, Ashutosh Saxena.
    In International Journal of Robotics Research (IJRR), 2012. [PDF, ijrr-pdf] bibtex

    @article{jiang2012placingobjects,
      title={Learning to Place New Objects in a Scene},
      author={Yun Jiang and Marcus Lim and Changxi Zheng and Ashutosh Saxena},
      year={2012},
      booktitle={IJRR}
    }
    
  • Learning the Right Model: Efficient Max-Margin Learning in Laplacian CRFs,
    Dhruv Batra, Ashutosh Saxena.
    To appear in Computer Vision and Pattern Recognition (CVPR), 2012. [PDF, supplementary material] bibtex

    @article{laplaciancrfs_cvpr2012,
      title={Learning the Right Model: Efficient Max-Margin Learning in Laplacian CRFs},
      author={Dhruv Batra and Ashutosh Saxena},
      year={2012},
      booktitle={CVPR}
    }
    
  • Robotic placing of objects with human and robot context,
    Yun Jiang, Alejandro Perez, Ashutosh Saxena.
    To appear in 13th International Symposium on Experimental Robotics (ISER), 2012. [PDF coming soon] bibtex

    @article{jiang2012placingobjects_context,
      title={Robotic placing of objects with human and robot context},
      author={Yun Jiang and Alejandro Perez and Ashutosh Saxena},
      year={2012},
      booktitle={ISER}
    }
    
  • Learning to Place New Objects,
    Yun Jiang, Changxi Zheng, Marcus Lim, Ashutosh Saxena.
    In International Conference on Robotics and Automation (ICRA), 2012. First appeared in RSS workshop on mobile manipulation, June 2011. [PDF, more] bibtex

    @inproceedings{jiang2011learningtoplace,
      title={Learning to place new objects},
      author={Jiang, Y. and Zheng, C. and Lim, M. and Saxena, A.},
      booktitle={International Conference on Robotics and Automation (ICRA)},
      year={2012}
    }
    

  • Unstructured Human Activity Detection from RGBD Images, Jaeyong Sung, Colin Ponce, Bart Selman, Ashutosh Saxena. To appear in International Conference on Robotics and Automation (ICRA), 2012. [PDF, more]

  • Learning Hardware Agnostic Grasps for a Universal Jamming Gripper, Yun Jiang, John Amend, Hod Lipson, Ashutosh Saxena. To appear in International Conference on Robotics and Automation (ICRA), 2012. [PDF, more]

  • Co-evolutionary Predictors for Kinematic Pose Inference from RGBD Images, Daniel Ly, Ashutosh Saxena, Hod Lipson. To appear in Genetic and Evolutionary Computation Conference (GECCO), 2012. [PDF]

2011

  • Semantic Labeling of 3D Point Clouds for Indoor Scenes,
    Hema Koppula, Abhishek Anand, Thorsten Joachims, Ashutosh Saxena.
    In Neural Information Processing Systems (NIPS), 2011. [PDF, More, Code] bibtex

    @inproceedings{koppula2011semantic,
      title={Semantic Labeling of 3D Point Clouds for Indoor Scenes},
      author={Koppula, H.S. and Anand, A. and Joachims, T. and Saxena, A.},
      year={2011},
      booktitle={NIPS}
    }
    
  • θ-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding,
    Congcong Li, Ashutosh Saxena, Tsuhan Chen.
    In Neural Information Processing Systems (NIPS), 2011. [PDF] bibtex

    @inproceedings{li2011_thetamrf,
      title={$\theta$-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding},
      author={Li, C. and Saxena, A. and Chen, T.},
      year={2011},
      booktitle={NIPS}
    }
    
  • Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models,
    Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen.
    To appear in IEEE Trans Pattern Analysis and Machine Intelligence (PAMI), July 2012. (Online First: 2011) [PDF, More]

    Original version in Neural Information Processing Systems (NIPS), 2010. [PDF, More]
  • Efficient Grasping from RGBD images: Learning using a new Rectangle Representation, Yun Jiang, Stephen Moseson, Ashutosh Saxena. In International Conference on Robotics and Automation (ICRA 2011), 2011. [PDF]

  • Autonomous MAV Flight in Indoor Environments using Single Image Perspective Cues, Cooper Bills, Joyce Chen, Ashutosh Saxena. In International Conference on Robotics and Automation (ICRA 2011), 2011. [PDF, More]

  • Robotic Object Detection: Learning to Improve the Classifiers using Sparse Graphs for Path Planning. Zhaoyin Jia, Ashutosh Saxena, Tsuhan Chen. In 22nd International Joint Conference on Artificial Intelligence (IJCAI), 2011. [PDF]

2010

2009


2000-2004 (During B.Tech. at IIT Kanpur)

  • Non-Linear Dimensionality Reduction by Locally Linear Isomaps,
    Ashutosh Saxena, Abhinav Gupta and Amitabha Mukerjee. In Lecture Notes in Computer Science, Proc 11th Int'l Conf on Neural Information Processing- ICONIP 2004, vol. 3316, 2004. [PDF, Springer]

  • In Use Parameter Estimation of Inertial Sensors by Detecting Multilevel Quasi-Static States,
    Ashutosh Saxena, Gaurav Gupta, Vadim Gerasimov, Sebastian Ourselin. In Lecture Notes in Computer Science, vol. 3684, KES, 2005. [PDF, Springer]

  • Robust Facial Expression Recognition using Spatially Localized Geometric Model,
    Ashutosh Saxena, Ankit Anand and Amitabha Mukerjee. In proc. Int'l Conf Systemics, Cybernetics and Informatics ICSCI, vol. 1, pp 124-129, 2004. [PDF]

  • A Microprocessor based Speech Recognizer for Isolated Hindi Digits,
    Ashutosh Saxena, and Abhishek Singh. In IEEE Annual Convention and Exhibition ACE 2002, India, 2002.
    Also awarded Best Paper in IEEE India Student Paper contest 2002. [PDF, More]

  • Bioinspired Modification of Polystyryl Matrix: Single-step Chemical Evolution to a Moderately Conducting Polymer,
    Ashutosh Saxena, S.G. Srivatsan, Vishal Saxena, Sandeep Verma. Chemistry Letters, vol. 33, no. 6, pp. 740-741, 2004. [PDF]

  • A Novel Electric Shock Protection System based on Contact Currents on Skin Surface,
    Ashutosh Saxena, Supratim Ray, and Rajiv K. Varma. In proc. Twelfth National Power Systems Conference, India, vol. 2, pp 584-587, 2002. [PDF, Extended version: PDF]



SEMINARS / INVITED TALKS / TECHNICAL REPORTS / DEMOS / WORKSHOPS

  1. Learning to Place Objects: Organizing a Room, Gaurab Basu, Yun Jiang, Ashutosh Saxena. Video contribution in ICRA,2012. [youtube video]

  2. 3D Perception for Personal Assistant Robots, Ashutosh Saxena. Talk in R:SS workshop on RGB-D cameras, 2011. [slides]

  3. Inferring 3D Articulated Models for Box Packaging Robot, Paul Heran Yang, Tiffany Low, Matthew Cong, Ashutosh Saxena. In RSS workshop on mobile manipulation, 2011. [PDF, More]

  4. Human Activity Detection from RGBD Images, Jae Y. Sung, Colin Ponce, Bart Selman, Ashutosh Saxena. In AAAI workshop on Pattern, Activity and Intent Recognition (PAIR), 2011. [PDF, More]

  5. Labeling 3D Scenes for Personal Assistant Robots, Hema Koppula, Abhishek Anand, Thorsten Joachims, Ashutosh Saxena. In R:SS workshop on RGB-D cameras, 2011. [PDF, More]

  6. Pose estimation from a single depth image for arbitrary kinematic skeletons, Daniel Ly, Ashutosh Saxena, Hod Lipson. In R:SS workshop on RGB-D cameras, 2011. [PDF]

  7. FeCCM for Scene Understanding: Helping the Robot to Learn Multiple Tasks, Congcong Li, TP Wong, Norris Xu, Ashutosh Saxena. Video contribution in International Conference on Robotics and Automation (ICRA 2011), 2011. [PDF, mp4, youtube, More]

  8. Robotic Grasping and Depth Perception: Learning 3D Models from a Single Image. Ashutosh Saxena. In:
    AFRL workshop, 2010.
    UC Berkeley, 2010.
    Cornell University, 2009.
    University of California, Los Angeles (UCLA), 2009.
    TTI-C, 2009.
    Oxford University (UK), 2008.
    MSR Cambridge (UK), 2008.
    Grasp seminar, University of Pennsylvania (Upenn), 2008.
    PIXL seminar, Princeton University, 2008.
    VASC seminar. Carnegie Mellon University (CMU), 2008.
    GRAIL seminar. University of Washington, 2008.
    Microsoft Research / Live-labs (Redmond), 2008.
    PAML seminar, UIUC, 2008.

  9. Rapid Interactive 3D Reconstruction from a Single Still Image, Ashutosh Saxena, Nuwan Senaratna, Savil Srivastava, Andrew Y. Ng. In SIGGRAPH Late Breaking work (Informal Session), 2008. [1-page PDF, Video]

  10. Monocular 3D Depth Perception for Navigation, Ashutosh Saxena. In ARO/NSF Workshop on Future Directions in Visual Navigation, May 2008.

  11. Learning to Open New Doors,
    Ellen Klingbeil, Ashutosh Saxena, Andrew Y. Ng. In AAAI 17th Annual Robot Workshop and Exhibition, 2008. [PDF]

  12. Building a 3-D Model From a Single Still Image,
    Ashutosh Saxena, Min Sun and Andrew Y. Ng. Demonstration in Neural Information Processing Systems (NIPS), 2007.
    Also presented at NIPS Workshop on The Grammar of Vision: Probabilistic Grammar-Based Models for Visual Scene Understanding and Object Categorization, 2007. [png]
    Also in AAAI IS Demonstration, 2008.

  13. Learning 3-D Object Orientation from Images,
    Ashutosh Saxena, Justin Driemeyer and Andrew Y. Ng. NIPS workshop on Robotic Challenges for Machine Learning, 2007. [abstract, extended full version]

  14. Data Manipulation and Creation Techniques for Learning Tasks,
    NIPS workshop on Principles of Learning Problem Design, 2007. [ppt]

  15. Monocular Vision and its applications,
    HomeBrew Robotics Club, Jan 2007;  Stanford PAIL, Apr 2007;  Bay Area Vision Research Day (BAVRD), Aug 2007;   Stanford DAGS, Oct 2007;   Smith-Kettlewell Colloquium, Oct 2007;   Stanford GRAI, Oct 2007;   Nokia-NRC, Nov 2007;   MIT, Jan 2008;   Google, Jan 2008.

  16. STAIR: The STanford Artificial Intelligence Robot project, Andrew Y. Ng, Stephen Gould, Morgan Quigley, Ashutosh Saxena and Eric Berger.
    Learning Workshop, Snowbird, Apr 2008.

  17. Learning to Grasp Novel Objects using Vision,
    Ashutosh Saxena, Justin Driemeyer, Justin Kearns, Chioma Osondu, Andrew Y. Ng, RSS Workshop on Manipulation for Human Environments, 2006.

  18. STAIR: Robotic Grasping of Novel Objects,
    Stanford-KAIST Robotics Workshop, 2007.

  19. Ultrasonic Sensor Network: Realtime Target Localization with Passive Self-Localization,
    Ashutosh Saxena, and Andrew Ng, Project Report, CS229: Machine Learning, Stanford University, Dec 2004.

  20. A New Embedded Multiresolution Signaling Scheme for CPFSK ,
    Ashutosh Saxena, Ajit K. Chaturvedi, B. Tech. research thesis, IIT Kanpur, India, April 2004.

  21. Adaptive Multirate CDMA for Uplink ensuring Maximum Proportional Fairness,
    Ashutosh Saxena, Ajit K. Chaturvedi, IIT Kanpur tech report, April 2004.

  22. SANKET: Hand Gesture Recognition,
    Ashutosh Saxena, Aditya Awasthi and Vaibhav Vaish, IEEE CSIDC 2003. (More)




  23. PATENTS

    1. Ashutosh Saxena, Jingwei Lu, Nimish Khanolkar, RETRIEVAL AND RANKING OF ITEMS UTILIZING SIMILARITY, US Patent Application.

    2. Ashutosh Saxena, Sung Chung, Min Sun, Andrew Y. Ng, ARRANGEMENT AND METHOD FOR THREE-DIMENSIONAL DEPTH IMAGE CONSTRUCTION, US Patent Application.

    3. Undisclosed, Stanford University, 2008.

    Copyright notice

    All papers may be copyrighted by the journals/conferences, therefore, do not download without checking the journals' or conferences' copyright notices!

    * The final, definitive version of this paper has been published in IJRR, vol, issue, Feb 2008 by Sage Publications Ltd, All rights reserved. (c) SAGE publications Ltd, 2008. It is available online at http://online.sagepub.com