英文摘要 |
The project consists of the following four main items of work:
1. Case studies of big data applications and text mining module:
Provided Tornado international cases about big data analytic applications and natural semantic analysis module,And provides a complete API and SDK,And comply with open industry standard interface。In addition, through the system Adapter for different data structures, Provides .NET or JAVA interface and for the Web Service, XML or JSON-format data exchange. Comply with open industry standard interface to achieve the requirements of other systems, Series can be carried out more quickly Systems Integration.
2. Assist the integration of a huge amount of data analysis EPD sharing platform:
Hold four Tornado industrial training, To facilitate the integration and EPA colleagues to establish the ability to interpret data, Incorporated into the processing scope of the case, in order to carry out a single data source or across data source integration retrieval.
3. To assist the EPA text mining model training, And the establishment professional corpus of EPD, And the establishment of a professional of Corpus EPA Environmental Protection Agency summary information through open data, And plus EPA public hearing, EPD news material further extracted jargon, Establish a black list and white list.
4. Provide technology transfer, For EPD Application Integration ELAND provides customization website, Through customization website, ELAND provides a part of retrieval and semantic analysis, And holds four employee training. For EPA colleagues to facilitate the indexing information and the ability to use information retrieval, And can use semantic analysis
|