Operácie

Reinforcement Learning: Rozdiel medzi revíziami

Z SensorWiki

 
Riadok 2: Riadok 2:
 
* http://www.jair.org/media/301/live-301-1562-jair.pdf  
 
* http://www.jair.org/media/301/live-301-1562-jair.pdf  
 
* http://www.cs.indiana.edu/~gasser/Salsa/rl.html
 
* http://www.cs.indiana.edu/~gasser/Salsa/rl.html
 +
* http://www.fdaw.unimaas.nl/education/3.2is/slides/05.Reinforcement%20Learning.pdf
 +
 +
 
* Mance E. Harmon, Stephanie S. Harmon: ''[http://www.nbu.bg/cogs/events/2000/Readings/Petrov/rltutorial.pdf Reinforcement Learning: A Tutorial.]''
 
* Mance E. Harmon, Stephanie S. Harmon: ''[http://www.nbu.bg/cogs/events/2000/Readings/Petrov/rltutorial.pdf Reinforcement Learning: A Tutorial.]''
  
Riadok 8: Riadok 11:
 
* Example 1: http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/index.html
 
* Example 1: http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/index.html
 
* Example 2: http://lslwww.epfl.ch/~anperez/RL/
 
* Example 2: http://lslwww.epfl.ch/~anperez/RL/
 +
* Simulator: http://www.cs.cmu.edu/~awm/rlsim/
 
* Resources: http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/Resources.html
 
* Resources: http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/Resources.html

Aktuálna revízia z 11:31, 4. jún 2009