Abstract:In order to solve the problem of diffificult analysis and data integration of multi-source heterogeneous data in Beijing water resources, based on the analysis of Beijing water resources, big data and cloud computing technologies are adopted to effectively integrate Beijing water resources. In view of the structured and unstructured data in Beijing, the corresponding data extraction, transformation and storage technology, the technical architecture of data based on water resources integration, including structured data extraction using D2RQ tools, unstructured data extraction using jieba segmentation tools and tf- idf weight algorithm are researched, and experimental verifification is made, to prove the feasibility of this set of technical solutions and credibility. Furthermore, the data storage module uses the distributed data storage based on cloud computing technology, after fusion is used to store huge amounts of data. The technical scheme of data resource fusion can help improve the effificiency of data resource fusion and the application ability of data resource.