Hybrid Reward Architecture

This is the name given to the methodology applied by the team at Maluuba.

The architecture consists of a super agent which is responsible to carry out decisions.

This super agent is assisted by over 150 normal agents which provide input data for the super agent.

Microsoft’s AI makes history, sets the highest possible score on Ms Pac-Man

The super agent used the data replayed by them to decide the direction to move Ms. Pac-Man in.

Source:Neowin

Read More

spot_img

source: www.techworm.net