The use of the corpus becomes essential in the development of applications based on natural language processing (NLP). In Ecuador, these applications are incompatible because in each region use words outside the context of Spanish. This article presents the development of a corpus compatible with Ecuadorian natural language words. We applied a identification algorithm to take advantage of local literature and power a new data base. The corpus mounted is verified by a quantitative and qualitative comparison with an open access corpus. The result is the first corpus in this country with high scalability and great versatility. {\textcopyright} 2017 IEEE.