Logical team Q-learning: An approach towards factored policies in cooperative MARL