Zoom
Trash
Related: DATA SCIENTIST
大数据领域33个预测,开启未知的2016 转自: 瀚思安信(微信ID: HanSight) 英文原文: 导读:2016年大数据领域会发生什么情况?考虑到如今在深层神经网络和规范性分析方面取得的进展,你可能觉得这个问题很好回答。而实际上,来自业界的大数据预测大不相同,本文精选出了最值得关注的33个预测,为您开启未知的2016! 数据平民崛起 甲骨文公司预测一种新型用户:数据平民(Data Civilian)会崛起。 “大数据”会消亡 Nucleus Research公司公开发表了不同意见,预测我们所知道的大数据会消亡。 风险投资公司更关注大数据给出的结论 据Opera Solutions公司的高级副总裁Keri Smith声称,由于风险投资公司往数据初创公司纷纷投入资金,是时候开始提出尖锐的问题了。 机器学习和人的洞察力组合渗透新行业 Spare5公司的首席执行官Matt Bencke表示,我们在2016年会看到数据绝地武士(Data Jedis)的兴起。 数据科学再银行界大放光彩 数据科学咨询公司Profusion的首席执行官Mike Weston预测,数据科学在银行界会大放光彩。 人工智能和认知计算让个性化医疗成为现实 先进的人工智能引起机器人成为统治者,这种场景吓坏了Elon Musk。 首席数据官将成为信息技术领域的“新宠儿” Blazent公司首席技术官办公室负责人Michael Ludwig认为,首席数据官(CDO)会成为信息技术领域的“新宠儿”,永远让办公室政治更显错综复杂。 首席洞察官成为大数据整理过程的关键领导者 但不是每个人都这么认为,其中包括PROS公司的首席远见官Craig Zawada。 云服务被充分利用 但是颇有势力的CIO能重新发号施令吗? 流分析逐渐成熟 DataTorrent公司的首席执行官兼联合创始人Phu Hoang预测,流分析(streaming analytics)会开始成熟起来,并在大数据阵营中证明其价值。 实时分析异常火爆 实时分析在明年会很火爆,这个我们懂。 大数据让娱乐更加“娱乐” 喜欢鼓乐? 物联网影响半导体行业 物联网会如何影响半导体行业? 机器学习、大数据自动化和人工智能大放异彩 Infosys公司高级副总裁兼平台、大数据和分析部门主管Abdul Razack表示,机器学习、大数据自动化和人工智能在2015年大出风头,明年会出更大的风头。 合并兴起的关键年 外包大行其道
Search Engines & Discovery Tools This is a page of the DIRECTORY OF LEARNING & PERFORMANCE TOOLSIf you would like to submit a tool for this Directory, please use this form. Top Tools 2014 Free Tool 43 marks : Start your search here. Ambiently : Semantically connects webpages of similar or related meanings together. Bing : Microsoft’s search engine. Boolify : Makes it easier for students to understand their web search by illustrating the logic of their search, and by showing them how each change to their search instantly changes their results. ButtonALL : All your favourite search engines and reference websites aggregated on ONE page. BuzzzyA search engine for Google Buzz. DeeperWeb : Search engine tool for Google users that allows navigating through search results by employing Tag Cloud techniques. dogpile : Meta search engine. Factbites : Where results make sense. Find that File : comprehensive file search for the Internet. Google Search : Powerful search engine. Gravee : The social search and recommendation engine.
IBM's Watson to help sequence cancer DNA IBM is teaming up with the New York Genome Center to help fight brain cancer. The company said Wednesday that its Watson cloud computing system will be used in partnership with a New York-based genetic research center mainly to help sequence DNA for the treatment of glioblastoma, the most common type of brain cancer in U.S. adults. New York Genome Center, a consortium of academic, medical and industry officials, will use Watson to sequence the DNA of cancer tumors at much faster rate than would be possible if done by a human being. The DNA information would then be combined with clinical information and fed to Watson to help determine the best way to treat a particular patient. What makes Watson unique is that it isn't programed like most computers. Instead of relying on the information that's put into it, Watson learns by "reading" vast amounts of information and combining it with the results of previous work to find answers to problems. Dr. Armonk, N.Y.
人人都可成為資料科學大師!一整年的網路自學清單就在這了 本文由微信公眾號「大數據文摘」授權轉載,選文:孫強,翻譯:趙娟、王珏。大數據文摘微信 ID:BigDataDiagest。原文標題為〈New Year Resolutions for a Data Scientist〉,作者/ MANISH SARASWAT,以下為作者第一人稱描述。 新年並非僅僅是更換日曆或是清晨起床後揉開雙眼。新年是充滿喜悅的一個嶄新開始。 如果你正在閱讀這篇文章,我確信資料科學會讓你興奮! 註:這些通用的學習計畫是為有抱負的 / 有經驗的資料科學家準備的。 我已經將這些學習計畫根據資料科學家的三個水準階段進行了分類。 初級水準 誰是初學者? 1. 我曾看到有學生同時學習 R 和 Python。 學習課程:在Codecademy完成 Python 的學習。 2. 統計學是關於假設和運算的學科。 學習課程:在 Udacity 上完成 Inferential和Descriptive統計學習。 3. 大型開放式網路課程(簡稱 MOOC)可以自由訪問和學習。 學習課程:在Coursera完成資料科學專業(R)的學習。 4. 你需要知道這個行業正在發生哪些變化。 中級水準 誰是中等水準的資料科學家? 1. 機器學習是資料科學與技術的未來。 學習課程:在Andrew Ng完成機器學習的課程。 2. 一旦你對機器學習充滿自信,那麼轉攻下一個模型吧。 學習課程:閱讀 Kaggle 的 Ensembling 指南。 3. 本年,你將開啟自己的大數據之旅。 學習課程:Spark 4. 還有什麼比知識分享更棒! 5. 是時候檢驗你的學習效果了。 行動指南:加入Kaggle 和Data Hack。 高級水準 我無需定義這類人群。 1. 今年,你要為立志成為資料科學家的人樹立榜樣。 學習課程:完成Tutorial 的深度學習。 2. 我相信知識是用於分享而不是用於存起來放的。 行動計畫:在Discuss上分享你的知識。 3. 強化學習是(Reinforcement Learning)機器學習中最強大的,然而少有人開發的一個分支。 學習課程:完成 Andrew Moore 的Tutorial。 4. 今年,你必須保持住在 Kaggle 上的「大師」地位,準確的講,確保自己在 Kaggle 排名進入前 50。 行動計畫:加入Kaggle 追蹤你的進程。 結束語 我理解,這些學習計畫對你具有挑戰性,但值得一試。 這篇文章已經為你制定新年計畫掃除了障礙。
The Evolution of Web Search:From Real-Time Discovery to Collaborative Web Search - HeyStaks Certainly the world of the Web has changed dramatically since 2000, and search engine technology has evolved through a variety of phases. For example, in the pre-Google dawn (Search 1.0), search engines were guided primarily by the words in a page, their location and how they matched the query terms. Google’s great innovation was to demonstrate how search quality could be greatly enhanced by harnessing a new relevance signal: the links between pages. Google’s link analysis technology (PageRank) interpreted links to a page as votes and PageRank was a clever way of counting such votes to effectively compute an authority score for each page, which could then be used during result ranking.As an aside, back in the late 1990’s one of Google’s fellow innovators was a company called Direct Hit, which also argued for the need for new relevance signals. Figure 1. Figure 2. The Rise of the Social Web If search has remained stable the Web has not. Taming the Real-Time Web
Tackling the Challenges of Big Data | 应对大数据挑战 This is an Archived Course EdX keeps courses open for enrollment after they end to allow learners to explore content and continue learning. All features and materials may not be all available. Check back often to see when new course start dates are announced. *This course will be run primarily in English, but will have assessments, video transcripts, and limited course content (documents, instructions) in Simplified Chinese. This Digital Programs course will survey state-of-the-art topics in Big Data, looking at data collection (smartphones, sensors, the Web), data storage and processing (scalable relational databases, Hadoop, Spark, etc...), extracting structured data from unstructured data, systems issues (exploiting multicore, security), analytics (machine learning, data compression, efficient algorithms), visualization, and a range of applications. Each module will introduce broad concepts as well as provide the most recent developments in research. Participants may include: 学员可能包括: 常见问题
Yandex.Company — Yandex Blog Yandex has always been there to give answers. But then there comes a moment when giving answers becomes ‘not enough’. A large portion of the five billion searches that come from almost 100 million Yandex users each month are looking for a solution rather than an answer – people want to pay a bill, check-in for a flight, book a show or a visit to the dentist. Using all our knowledge and expertise, all these years we have been developing products and technologies to figure out what it is exactly that our visitors are looking for and offer this to them in our search results. We have been thinking about offering web users a shortcut precisely to what they need, and we came up with a new concept of a goal-oriented search platform. Yandex's new search platform embodies our new approach to search and website engagement, and features an attention-based search results page, where each search result is a standalone block of interactive information – an Island.
Digitizing the delivery of government services An agile development approach was critical in a European agency’s launch of a new online system for registering businesses. Here’s how the agency moved from paper to pixels. Government agencies around the world are under internal and external pressure to become more efficient by incorporating digital technologies and processes into their day-to-day operations. For a lot of public-sector organizations, however, the digital transformation has been bumpy. A few, however, have been able to change mind-sets internally, shed outdated approaches to developing new processes and systems, and build new ones. The Danish Business Authority is one of those organizations. Initial efforts to build the digital-registration system stalled under the agency’s traditional waterfall approach to systems design and development. By 2014, the system was nearly complete, and by 2015, the number of registrations requiring agency support for completion had dropped from 70 percent to 30 percent. Agility delivers