Correlated Q Learning