SWSTAT - 소프트웨어통계포털

2016-006

지능정보기술 동향 조사

안성원AI정책연구실 실장
김태호 역대 연구원
송지환SW기반정책·인재연구실 책임연구원
추형석 역대연구원

2017.06.23

11976

- 과제명
- 지능정보기술 동향 조사 – 자연어 처리를 중심으로
- 연구 목적
- 최근 자연어처리 분야 연구 동향 및 SW/DB 조사·분석
- 연구 필요성
- 최근 여러 연구기관에서 다양한 자연어처리 세부분야의 연구를 진행
- 지능정보기술 동향 조사 요약
- 1. 자연어처리
- - 자연어처리의 기본 개념과 하위 분류 조사
- 2. 학술대회 및 경진대회
- - 학술대회, 경진대회별 역사와 범위, 특징 조사
- 3. 실제 사례
- - 언어처리 기술이 적용된 실제 사례 조사
- - 사례별 역사와 특징 나열
- 4. 언어처리 기술수준
- - 공개된 SW의 성능과 지원 모듈 조사
- 5. 지식 데이터 베이스 현황조사
- - 공개된 국내외 지식 데이터베이스 현황 조사
- - 각 지식데이터베이스의 문서량과 종류 조사
- 연구 성과
- - 학술대회와 경진대회의 특징과 역사 조사 성과물
- - 언어처리 기술이 적용된 실제사례 조사 성과물
- - 각 기관별 언어처리 기술 수준 조사 성과물
- - 공개된 데이터베이스 조사 및 정리 성과물
- 기대효과
- 국내•외 학회 또는 경진대회의 특징 및 분야를 참고하면 앞으로의 동향을 읽는 데에 도움이 될 것이다. 또한 언어처리 기술이 적용된 실제 사례를 참고해 공개된 데이터베이스를 활용하면 새로운 연구를 진행하는 데 좋은 기반이 될 수 있을 것으로 예상된다.
SUMMARY
- Title
- Survey on the trends of Intelligence-information technology - Focused on Natural language processing
- Purpose
- Recent Research Trends in Natural Language Processing and Research of SW/DB
- Need
- In recent years, various research institutes have been conducting research on various fields of natural language processing.
- Summary of Intelligent Information Technology Trend
- 1. NLP (Natural Language Processing)
- Basic concept and sub-classification of NLP
- 2. Related academic conferences and contests
- History, scope and characteristics of domestic and international academic conferences and contests
- 3. Real case study
- survey on NLP　Real Case
- case History and List features
- 4. NLP technology level
- Survey of NLP open source’s performance and spec
- 5. Survey on Knowledge Database for NLP
- Survey on domestic and international open Knowledge Database for NLP
- Survey on characteristic of each Knowledge Database
- Research Results
- - Survey on domestic and international academic conferences and contests
- - Real case study on NLP technology
- - Research on NLP technology level for each application
- - Categorization on open knowledge databases for NLP
- Expectations
- his report will help understand trends for NLP with its details for academic conferences and content. Additionally, it will contribute to the future works by using the real case study and the free knowledge database for NLP.

- 제1장 서 론
- 제2장 자연어처리 기술
- 제1절 자연어처리 개요
- 제2절 형태소 분석 및 형태소 품사 부착
- 제3절 함의(Entailment)
- 제4절 질의 응답(Question Answering System)
- 제5절 정보 추출(Information Extraction)
- 제6절 구문 분석
- 제7절 상호 참조
- 제8절 개체명
- 제9절 의미역
- 제10절 감성 분석
- 제3장 기술 동향
- 제1절 학술 대회
- 1. ICON
- 2. ACL
- 3. 한글 및 한국어 정보처리 학술대회
- 제2절 경진 대회
- 1. NLP tools contest
- 2. EVALITA
- 3. TAC
- 4. 국어 정보 처리 시스템 경진대회
- 제4장 언어지능 기술이 적용된 실제 사례 조사
- 제1절 Virtual Assistance
- 제2절 Social Network analytics tools
- 1. Keyhole
- 2. Signal Media
- 3. Brandwatch
- 제5장 언어처리 기술 수준
- 제1절 공개된 SW의 종류와 성능
- 1. CoreNLP
- 2. Natural Language Toolkit
- 3. KoNLPy
- 4. Google SyntaxNet
- 5. Apache OpenNLP
- 6. Apache Lucene
- 7. UIMA
- 8. MALLET
- 9. ESPRESSO
- 10. 꼬꼬마
- 11. 코모란
- 12. 동아대학교 띄어쓰기 모델
- 제2절 연구소, 대학, 기업의 연구 수준
- 1. Stanford
- 2. Berkeley group
- 3. MIT
- 4. Sheffield
- 5. Univ. of Washington
- 6. MS Human language technologies
- 7. IBM NLP group
- 8. 다이퀘스트
- 9. ETRI(한국전자통신연구원)
- 제6장 지식데이터베이스 현황 조사
- 제1절 한국어 지식 데이터베이스
- 1. 서울대학교
- 2. 언어정보연구원
- 3. 국어 국립원
- 4. ETRI
- 5. 한국과학기술정보연구원과 충남대학교
- 6. 울산대학교
- 7. 고려대학교
- 8. 펜실베니아 대학교
- 9. LDC
- 10. kAIST
- 11. 언어자원은행
- 제2절 해외 지식 데이터 베이스
- 1. 영국 코퍼스
- 2. 미국 코퍼스
- 3. 체코 코퍼스
- 4. 헝가리 코퍼스
- 5. 러시아 코퍼스
- 6. 이탈리아 코퍼스
- 7. 그리스 코퍼스
- 8. 독일 코퍼스
- 9. 중국 코퍼스
- 제3절 모니터 코퍼스
- 1. The Bank of English(BoE) 코퍼스
- 2. Global English Monitor 코퍼스
- 제4절 Brown 계열 코퍼스
- 제5절 공시적 코퍼스
- 1. The International Corpus of English(ICE)
- 2. The Longman/Lancaster Corpus
- 3. The Longman Written American Corpus
- 4. The CREA corpus
- 5. The LIVAC corpus
- 제6절 통시적 코퍼스
- 1. The Helsinki corpus
- 2. The ARCHER corpus
- 3. The Corpus of Early English Correspondence
- 4. The Zurich English Newspaper Corpus(ZEN)
- 5. 영어대화 코퍼스
- 제7절 음성 코퍼스
- 1. The London-Lund Corpus
- 2. 10대 언어의 Bergen corpus
- 3. The Spoken corpus of the Survey of English Dialect
- 4. The Intonational Variation in English Corpus
- 5. The HongKong Corpus of Conversational English(HKCCE)
- 제8절 학업 및 전문영어 코퍼스
- 1. The Michigan Corpus of Academic Spoken English
- 2. The British Academic Spoken English Corpus(BASEC)
- 3. The Academic Corpus
- 4. The Corpus of Professional English(CPE)
- 제9절 구문 분석 코퍼스
- 1. The Lancaster-Leeds Treebank
- 2. The Lancaster Parsed Corpus
- 3. The SUSANNE corpus
- 4. The Penn Treebank
- 제10절 Developmental and learner corpus
- 1. The Child Language Data Exchange System
- 2. The Louvain Corpus of Native English Essays
- 3. The Polytechnic of Wales corpus
- 4. The Cambridge Learner Corpus
- 5. The HKUST Corpus of Learner English
- 6. The Japanese EFL Learner corpus
- 7. The Standard Speaking Test Corpus
- 제11절 다국어 코퍼스
- 1. The English-Norwegian parallel Corpus
- 2. The English-Swedish Parallel Corpus
- 3. The Oslo Multilingual Corpus
- 4. The IJS-ELAN Slovene-English Parallel Corpus
- 5. The CLUVI parallel corpus
- 6. The PAROLE corpus
- 7. The EMILLE corpus
- 8. Hong Kong parallel Text
- 제12절 영어 이외의 단일 언어 코퍼스
- 1. The COSMAS corpus
- 2. The CETEMPúblico Corpus
- 3. The INL corpus
- 4. The CEG corpus
- 5. The Scottish Corpus of Texts and Speech
- 6. Academia Sinica Balanced Corpus
- 7. Spoken Chinese Corpus of Situated Discourse
- 제7장 결론 및 시사점
- 참고문헌
CONTENTS
- Chapter 1. Introduction
- Chapter 2. Natural language processing technology
- Part 1. Natural language processing overview
- Part 2. Morphological analysis and morphological parts
- Part 3. Entailment
- Part 4. Question Answering System
- Part 5. Information Extraction
- Part 6. Syntatic Analysis
- Part 7. Co-reference
- Part 8. Named Entity
- Part 9. Semantic Role
- Part 10. Emotional Analysis
- Chapter 3. Technology trends
- Part 1. Conference
- Part 2. Contest
- Chapter 4. Practical case study with language intelligence technology
- Part 1. Virtual Assistance
- Part 2. Social Network analytics tools
- Chapter 5. Language processing technology level
- Part 1. Types and performance of open SW
- Part 2. Research level of research institutes, universities and companies
- Chapter 6. Knowledge database status survey
- Part 1. Korean knowledge database
- Part 2. Global knowledge database
- Part 3. Monitor corpus
- Part 4. Brown Corpus
- Part 5. Disclosed Corpus
- Part 6. Transitive Corpus
- Part 7. Voice Corpus
- Part 8. Academic and professional English corpus
- Part 9. Parsing Corpus
- Part 10. Developmental and learner corpus
- Part 11. Multilingual Corpus
- Part 12. Non-English single language corpus[]
- Chapter 7. Conclusion and implications
- Reference