首页情报学
0


敞开来源的情报

Open Source Intelligence
课程网址: http://videolectures.net/mmdss07_best_osi/  
主讲教师: Clive Best
开课单位: 联合研究中心
开课时间: 2007-12-03
课程语种: 英语
中文简介:
开源智能可以定义为从公开来源获取,提取和分析信息。这三个过程中的每一个都是正在进行的研究的主题,从而产生专门技术。今天,最大的开源信息来源是互联网。大多数报纸和新闻机构都有网站,其中包含有关世界事件的展开活动,观点和观点的实时更新。大多数政府监督新闻报道,以感受公众舆论的脉搏,以及对新出现的危机的早期预警和当前意识。互联网上发布的知识,数据和意见的显着增长需要先进的软件工具,使分析师能够应对信息溢出。恶意使用互联网也迅速增长,特别是在线欺诈,非法内容,虚拟跟踪和各种诈骗。这些都给安全和执法机构带来了重大挑战。极端主义和恐怖主义团体使用互联网的情况出现了惊人的增长。恐怖主义链接网站的数量已从1998年的约15个增加到今天的约4500个。这些网站使用光滑的多媒体来宣传宣传,其主要目的是1)激发和激起嵌入式社区的反叛2)在“敌人”中灌输恐惧。并打击心理战。通过公告板,聊天室和电子邮件在恐怖组织之间进行匿名通信也很普遍。联合研究中心通过其为欧盟委员会进行的媒体监测(EMM)工作,在互联网内容监测方面积累了丰富的经验。 EMM是委员会每日新闻监测服务的核心,也已被欧洲理事会情况中心采用其ODIN系统。 JRC的一个新研究课题是Web挖掘和开源智能。这将EMM技术应用于更广泛的互联网,而不仅仅是新闻网站。这适用于先进的多语言搜索技术,以识别潜在的网络资源以及所有文本内容的提取和下载。然后是自动变化检测,地点,名称和关系的识别,以及对结果大量文本的进一步分析。这些工具可帮助分析人员处理大量文档,并使结构化数据更易于分析。本演讲将回顾4个主题:•互联网趋势和Web 2.0用户快速崛起产生的内容和牛市;信息检索:多语言新闻报道的实时内容监控。网络抓取& RSS feed生成,Web挖掘和内容监控•信息提取:主题过滤,主题聚类,多语言命名实体提取,地理编码和地理定位文本,事件提取,意见挖掘。 &公牛;信息分析:社交网络推导,地理空间索引和分析,事件跟踪数据库,统计趋势分析,威胁监测和评估。
课程简介: Open Source Intelligence can be defined as the retrieval, extraction and analysis of information from publicly available sources. Each of these three processes is the subject of ongoing research resulting in specialised techniques. Today the largest source of open source information is the Internet. Most newspapers and news agencies have web sites with live updates on unfolding events, opinions and perspectives on world events are published. Most governments monitor news reports to feel the pulse of public opinion, and for early warning and current awareness of emerging crises. The phenomenal growth in knowledge, data and opinions published on the Internet requires advanced software tools which allow analysts to cope with the overflow of information. Malicious use of the Internet has also grown rapidly particularly on-line fraud, illegal content, virtual stalking, and various scams. These are all creating major challenges to security and law enforcement agencies. The alarming increase in the use of the Internet by extremist and Terrorist groups has emerged. The number of terrorist linked websites has grown from about 15 in 1998 to some 4500 today. These sites use slick multimedia to distil propaganda whose main purpose is to 1) enthuse and stir up rebellion in embedded communities 2) instill fear in the “enemy” and fight psychological warfare. Anonymous communication between terrorist cells via bulletin boards, chat rooms and email is also prevalent. The Joint Research Centre has developed significant experience in Internet content monitoring through its work on media monitoring (EMM) for the European Commission. EMM forms the core of the Commissions daily press monitoring service, and has also been adopted by the European Council Situation Centre for their ODIN system. A new research topic at the JRC is Web mining and open source intelligence. This applies EMM technology to the wider Internet and not just to news sites. This applies advanced multi-lingual search techniques to identify potential web resources and the extraction and download of all the textual content. This is then followed by automatic change detection, the recognition of places, names and relationships, and further analysis of the resultant large bodies of text. These tools help analysts to process large amounts of documents and derive structured data easier to analyse. This talk will review 4 main topics: • Internet trends and the rapid rise of Web 2.0 user generated content • Information retrieval: Live content monitoring of multilingual news reports. Web scraping & RSS feed generation, Web Mining and content monitoring • Information Extraction: Topic filtering, Topic Clustering, multilingual named entity extraction, geocoding and geolocating text, event extraction, opinion mining. • Information Analysis: Social Network derivation, geospatial indexing and analysis, incident tracking databases, statistical trend analysis, threat monitoring and assessment.
关 键 词: 公开源情报; 检索; 源代码
课程来源: 视频讲座网
最后编审: 2020-09-28:yumf
阅读次数: 84