Nelson Salazar-PeñaAlejandra TabaresAndrés González-Mancera2026-03-222026-03-22202610.2139/ssrn.6150849https://doi.org/10.2139/ssrn.6150849https://andeanlibrary.org/handle/123456789/84620Decentralised systemComputer scienceReinforcement learningBenchmark (surveying)Context (archaeology)GridMathematical optimizationStability (learning theory)Distributed computingMarkov decision processHarnessing Implicit Cooperation: A Multi-Agent Reinforcement Learning Approach Towards Decentralized Local Energy Marketspreprint