Using Reinforcement Learning in the tuning of Central Pattern Generators

Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/28037

Registo completo

Campo DC	Valor	Idioma
dc.contributor.advisor	Analide, César	-
dc.contributor.advisor	Santos, Cristina	-
dc.contributor.author	Duarte, Ana Filipa de Sampaio Calçada	-
dc.date.accessioned	2014-02-18T15:07:46Z	-
dc.date.available	2014-02-18T15:07:46Z	-
dc.date.issued	2012-12-12	-
dc.identifier.uri	https://hdl.handle.net/1822/28037	-
dc.description	Dissertação de mestrado em Engenharia Informática	por
dc.description.abstract	É objetivo deste trabalho aplicar técnicas de Reinforcement Learning em tarefas de aprendizagem e locomoção de robôs. Reinforcement Learning é uma técnica de aprendizagem útil no que diz respeito à locomoção de robôs, devido à ênfase que dá à interação direta entre o agente e o meio ambiente, e ao facto de não exigir supervisão ou modelos completos, ao contrário do que acontece nas abordagens clássicas. O objetivo desta técnica consiste na decisão das ações a tomar, de forma a maximizar uma recompensa cumulativa, tendo em conta o facto de que as decisões podem afetar não só as recompensas imediatas, como também as futuras. Neste trabalho será apresentada a estrutura e funcionamento do Reinforcement Learning e a sua aplicação em Central Pattern Generators, com o objetivo de gerar locomoção adaptativa otimizada. De forma a investigar e identificar os pontos fortes e capacidades do Reinforcement Learning, e para demonstrar de uma forma simples este tipo de algoritmos, foram implementados dois casos de estudo baseados no estado da arte. No que diz respeito ao objetivo principal desta tese, duas soluções diferentes foram abordadas: uma primeira baseada em métodos Natural-Actor Critic, e a segunda, em Cross-Entropy Method. Este último algoritmo provou ser capaz de lidar com a integração das duas abordagens propostas. As soluções de integração foram testadas e validadas com recurso ao simulador Webots e ao modelo do robô DARwIN-OP.	por
dc.description.abstract	In this work, it is intended to apply Reinforcement Learning techniques in tasks involving learning and robot locomotion. Reinforcement Learning is a very useful learning technique with regard to legged robot locomotion, due to its ability to provide direct interaction between the agent and the environment, and the fact of not requiring supervision or complete models, in contrast with other classic approaches. Its aim consists in making decisions about which actions to take so as to maximize a cumulative reward or reinforcement signal, taking into account the fact that the decisions may affect not only the immediate reward, but also the future ones. In this work it will be studied and presented the Reinforcement Learning framework and its application in the tuning of Central Pattern Generators, with the aim of generating optimized robot locomotion. In order to investigate the strengths and abilities of Reinforcement Learning, and to demonstrate in a simple way the learning process of such algorithms, two case studies were implemented based on the state-of-the-art. With regard to the main purpose of the thesis, two different solutions are addressed: a first one based on Natural-Actor Critic methods, and a second, based on the Cross-Entropy Method. This last algorithm was found to be very capable of handling with the integration of the two proposed approaches. The integration solutions were tested and validated resorting to Webots simulation and DARwIN-OP robot model.	por
dc.language.iso	eng	por
dc.rights	openAccess	por
dc.subject	Inteligência Artificial	por
dc.subject	Aprendizagem	por
dc.subject	Reinforcement Learning	por
dc.subject	Central Pattern Generators	por
dc.subject	Locomoção Robótica	por
dc.subject	Otimização	por
dc.subject	Natural Actor-Critic	por
dc.subject	Cross-Entropy Method	por
dc.subject	Artificial Intelligence	por
dc.subject	Machine Learning	por
dc.subject	Robot Locomotion	por
dc.subject	Optimization	por
dc.title	Using Reinforcement Learning in the tuning of Central Pattern Generators	por
dc.type	masterThesis	por
dc.comments	eeum_di_dissertacao_pg17654	por
dc.subject.udc	681.3	-
Aparece nas coleções:	BUM - Dissertações de Mestrado

Ficheiros deste registo:

Ficheiro	Descrição	Tamanho	Formato
eeum_di_dissertacao_pg17654.pdf		3,34 MB	Adobe PDF	Ver/Abrir

Ver registo simples Sugerir correção Estatísticas