2006 MaximumMarginPlanning
- (Ratliff et al., 2006) ⇒ Nathan D. Ratliff, J. Andrew Bagnell, and Martin A. Zinkevich. (2006). “Maximum Margin Planning.” In: Proceedings of the 23rd International Conference on Machine learning. ISBN:1-59593-383-2 doi:10.1145/1143844.1143936
Subject Headings: Imitation Learning.
Notes
Cited By
- http://scholar.google.com/scholar?q=%222006%22+Maximum+Margin+Planning
- http://dl.acm.org/citation.cfm?id=1143844.1143936&preflayout=flat#citedby
Quotes
Author Keywords
connectionism and neural nets problem solving, control methods, and search theory
Abstract
Imitation learning of sequential, goal-directed behavior by standard supervised techniques is often difficult. We frame learning such behaviors as a maximum margin structured prediction problem over a space of policies. In this approach, we learn mappings from features to cost so an optimal policy in an MDP with these cost mimics the expert's behavior. Further, we demonstrate a simple, provably efficient approach to structured maximum margin learning, based on the subgradient method, that leverages existing fast algorithms for inference. Although the technique is general, it is particularly relevant in problems where A* and dynamic programming approaches make learning policies tractable in problems beyond the limitations of a QP formulation. We demonstrate our approach applied to route planning for outdoor mobile robots, where the behavior a designer wishes a planner to execute is often clear, while specifying cost functions that engender this behavior is a much more difficult task.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2006 MaximumMarginPlanning | Nathan D. Ratliff J. Andrew Bagnell Martin A. Zinkevich | Maximum Margin Planning | 10.1145/1143844.1143936 | 2006 |