#1
Which of the following is a type of reinforcement learning?
Reinforcement Learning
ExplanationReinforcement learning involves learning from rewards or penalties.
#2
What is the main goal of reinforcement learning?
To maximize a reward signal
ExplanationThe main goal of reinforcement learning is to maximize the cumulative reward signal.
#3
In reinforcement learning, what does the agent learn from?
Rewards or penalties
ExplanationThe agent in reinforcement learning learns from receiving rewards or penalties.
#4
What is positive reinforcement in operant conditioning?
Adding a stimulus to increase a behavior
ExplanationPositive reinforcement involves adding a stimulus to increase the likelihood of a behavior.
#5
What is the difference between reinforcement learning and supervised learning?
Supervised learning uses labeled data, while reinforcement learning does not.
ExplanationSupervised learning relies on labeled data for training, while reinforcement learning learns from feedback without explicit supervision.
#6
What is continuous reinforcement?
Reinforcing a behavior every time it occurs
ExplanationContinuous reinforcement involves reinforcing a behavior every time it is exhibited.
#7
Which of the following is an example of a reinforcement learning algorithm?
Q-Learning
ExplanationQ-Learning is a reinforcement learning algorithm used for learning optimal policies in Markov decision processes.
#8
Which of the following is an example of negative reinforcement?
Stopping a loud noise when a seatbelt is fastened
ExplanationNegative reinforcement involves removing a stimulus to increase the likelihood of a behavior.
#9
In reinforcement learning, what is an 'environment'?
The set of all possible interactions between the agent and its surroundings
ExplanationThe environment in reinforcement learning encompasses all possible interactions between the agent and its surroundings.
#10
What is extinction in operant conditioning?
When a conditioned response is weakened or eliminated
ExplanationExtinction in operant conditioning occurs when a previously reinforced behavior diminishes or disappears.
#11
In reinforcement learning, what is the 'policy' of an agent?
The strategy that the agent uses to determine its actions
ExplanationThe policy of an agent in reinforcement learning defines the strategy for selecting actions in different states.
#12
What is the concept of temporal credit assignment in reinforcement learning?
It refers to the process of assigning credit to actions based on their long-term impact on rewards.
ExplanationTemporal credit assignment involves attributing credit to actions based on their influence on long-term rewards.