Special Report: The Connected Business

２０１７年２月２７日

ディープラーニングとは何か

ディープラーニングの可能性とアメリカでの実際の試みがどうであるのか

　おはようございます。

　今回から少しずつではありますが、「ディープ・ラーニング」についての情報や知識をお知らせします。

　「ディープ・ラーニング」とはAI（アーティフィシャル・インテリジェンス、人工知能）の第三次革命といわれ、それまでほとんど人力の足しになりえなかった人工知能を、生活やビジネスの実践に活用が可能となった技術の進展です。

　ビッグデータが今後非常に重要になるという記事を先日掲載しましたが、構造非構造を合わせた種々雑多なデータは、人間の力では分析しきれず、結局はある限定された分野を人力とカン、経験で突破していかなくてはなりませんでした。

もう２０年以上前の９０年代からIT革命は始まりましたが、その実は全く人間の労力や労働は減らず、むしろ処理しなければならない情報がそれまでと比べ物にならないほど大量になり、労働を増加させてきたのではないでしょうか。

　そこに２０１２年、ディープ・ラーニングという新しいAI革命がおこります。これによって、PCデバイスが人間が何も与えないでも自ら学習していくことが可能になり、本当の意味でのオートメーションが可能になりつつあります。

　今日ご紹介する「ザ・エコノミスト」の記事は、現在のAIが持つ可能性と問題点を簡潔に網羅しています。特に素人が次々とAIを盛り込んでいけるAPIが中心になること、そして、データベースの重要性、AIやビッグデータに関するスキルを持っている人材が大企業であっても大幅に不足していることなど、どの業界にいても知っておくべき基礎的な知識です。

　すべての人が新たなインフラのスタートラインに立っていることを、この記事で少しでも理解していただけると幸いです。

大げさにもてはやされているAIがそれにこたえることを示す年

大手クラウド企業とスタートアップ企業は、AI技術が約束する見通しから利益を確保を目指す

AI’s year for showing it can live up to the hype

Big cloud companies and start-ups aim to cash in on the technology’s promise

https://www.ft.com/content/b0f63ae4-b8ac-11e6-961e-a1acd97f622d

February 1, 2017

by: Richard Waters

Artificial intelligence was the technology industry’s hottest topic in 2016. This year AI will start to

show if it can live up to the hype. Until now, the potential for training computers to identify patterns

using large bodies of data has been restricted to services such as Google Photos, which can recognise

faces, and Amazon’s Alexa, a digital assistant that responds to voice commands. But the platforms

needed to make these algorithms more widely available to other companies have been taking shape

in recent months, turning them into ingredients for digital services of all kinds.

　人工知能は、２０１６年のテクノロジー業界で最もホットな話題でした。今年はAIが誇大宣伝にこたえることができるかどうかを示し始める年です。これまでは、コンピューターを訓練して、大量のデータを使用しながらパターンを特定するという潜在能力は、顔を認識できるGoogleフォトや音声コマンドに対応するデジタルアシスタントのAlexaなどのサービスに限定されていました。しかし、これらのアルゴリズムを他の企業に広く普及させるために必要なプラットフォームは、ここ数カ月の間に形をなしており、アルゴリズムをあらゆる種類のデジタルサービスの要素に変えています。

Part of the effort has involved packing smart algorithms developed for internal purposes into

services that other companies can tap into through APIs — software that lets people use and develop

automated tools over the internet — including natural language understanding, text-to-speech

translation, foreign language translation, and image and video recognition.

　この取り組みには、社内向けに開発されたスマートなアルゴリズムを、自然言語の理解、テキスト音声変換、外国語の翻訳、画像やビデオの認識など、他の企業がAPI―自動化されたツールをインターネット上で使用および開発できるソフトウェア―を通じて利用できるサービスへの組み込みがあります。

When Amazon Web Services unveiled a range of “on-demand” AI services like this for companies in

December, it capped a year of similar moves by other big tech companies. Google had set out early

in 2016 to make AI a distinguishing feature of its belated attempt to catch up with AWS in cloud

computing.

　Amazon Web Servicesが１２月に企業向けのこのような「オンデマンド」AIサービスを発表したとき、それは他の大手IT企業が同様の動きをした１年間の締めくくりとなりました。Googleは、２０１６年の初めに、クラウドコンピューティングで遅ればせながらAWSに追いつこうというものでしたが、AIを特に人目をひく目玉にすることに着手しました。

Google has said, for instance, that by using the computer vision technology developed for its

in-house services, companies in industries from agriculture to manufacturing would be able to

automate the inspection and analysis of facilities without using humans.

　たとえば、社内サービス用に開発されたコンピュータビジョン技術を使用することで、企業は農業から製造業に至る産業で、人間を介さずに施設の検査と分析を自動化できるとGoogleは述べています。

Microsoft put AI at the centre of its cloud services offering in 2016. Despite the large volumes of

data needed to train computer models and a scarcity of deep learning engineers, this is moving

beyond the realm of giant tech companies. Start-ups including Clarifai and Sentient Technologies

have built computer vision models with potentially wide applications and hope to compete with the

biggest providers.

　マイクロソフトは、２０１６年にAIをクラウドサービスの中心に据えました。コンピュータモデルの訓練に必要な大量のデータとディープラーニングエンジニアの不足にもかかわらず、これは巨大なテクノロジー企業の領域を超えつつあります。 ClarifaiやSentient Technologiesをはじめとするスタートアップ企業は、潜在的に幅広いアプリケーションを備えたコンピュータビジョンモデルを構築しており、最大手のプロバイダと肩を並べられるようになることを望んでいます。

Abby Larson and her husband Tait, who with 25 staff run a wedding blog site called StyleMePretty

that links users with selected vendors, are typical of the entrepreneurs drawn to the potential of such

services.

　Abby Larsonと夫のTait氏は、２５人のスタッフとともに、えり抜きのベンダーとユーザーを結ぶStyleMePrettyと呼ばれる結婚式ブログサイトを運営していますが、彼らはそのようなサービスの可能性に引き付けられた起業家の典型です。

Their company trawls some 12,000 wedding photos a week, says Mr Larson, categorising them for

use on the site. Tapping into Clarifai’s neural network makes it possible to automate such tasks by

machine-tagging pictures according to the season, whether there are flowers present, of if shoes

appear in them. The technology is not good enough to identify individual dresses, but at a cost of

around $500 a month it is a low-cost way of handling much greater volumes of content.

　同社は１週間に約１２、０００件の結婚式写真を探し回っている、とラーソン氏は話している。クラリファイのニューラルネットワークを利用することで、季節に応じて写真を機械的にタグ付けすることで、花があるかどうか、靴が見えているかどうかなどのタスクを自動化することができます。この技術は個々のドレスを識別するのには十分ではありませんが、月額約５００ドルを払えば、低コストではるかに多くの量のコンテンツを処理が可能な方法です。

“A human is going to be more efficient, especially in our domain — but it’s going to be an order of

magnitude more expensive,” he says. “This allows us to scale up.”

　「人間は、特に私たちのドメインではより効率的になるでしょう。しかし、それはもっと高価な注文規模になるでしょう」と彼は言います。「これにより、私たちはスケールアップすることができます。」

Conversational computing will change how people interact with phones, computers, and businesses.

These types of services charge depending on how often calls are made to the API. Though price lists

are not transparent, the leaders in the field say that costs are falling: Google, for instance, said last

year it was cutting prices for its image recognition service for its heaviest users by 80 per cent.

　会話型コンピューティングは、人々の電話やコンピュータ、ビジネスとのかかわり方を変えるでしょう。これらのタイプのサービスは、APIへの呼び出し頻度に応じて課金されます。価格表はユーザーに明らかになっていませんが、この分野のトップ企業によれば、負担は低くなってきているそうです。例えば、Googleは昨年、最も利用頻度の高いユーザーへの画像認識サービスの価格を８０％引き下げていたと述べています。

One question is whether companies have suitable data sets to mine for such purposes. Just about the

only other limiting factor, says Al Hilwa, an analyst at market researcher IDC, will be the

imagination of customers: can they think of productive uses for data collected for other purposes

once they have subjected them to new types of analysis?

　一つの疑問は、企業がそのような目的のために利用するのに適したデータセットを持っているかどうかです。市場調査会社であるIDCのアナリスト、アル・ヒルワ（Al Hilwa）氏は、他にAI開発を制限する要因があるとすれば、顧客の想像力だけだろうと述べています。すなわち、顧客は、別の目的のために収集されたデータでひとたび新しいタイプの分析を行ってしまうと、それらを生産的な活用に用いることを考えられるのでしょうかということです。

The big cloud computing companies have also released tools for developers to train algorithms using

their own data. These “machine learning in the cloud” services promise to turn machine learning into

a general-purpose technology with wide application.

　大手クラウドコンピューティング企業は、開発者が独自のデータを使用してアルゴリズムをトレーニングするためのツールもリリースしました。これらの「クラウドでの機械学習」サービスは、機械学習を幅広いアプリケーションで汎用技術に変えることを約束しています。

This will be where customers stand to see the biggest benefits from the new breakthroughs in AI,

says Mr Hilwa. But learning how to train and validate the algorithms will take time. And, he adds, a

skills shortage among the data scientists could put a brake on progress.

　ここがまさにAIの新たなブレークスルーの最大のメリットだと顧客が考えている地点でしょうとヒルワ氏は語ります。しかし、アルゴリズムを訓練して検証する方法を学ぶには時間がかかります。そして、彼はデータをあつかう科学者のスキル不足が進歩にブレーキをかける可能性があると付け加えています。

One of the biggest promises of the AI wave has been the advent of conversational computing — the

prospect of using language to control and interact with computers. Chatbots, automated text-based

messaging services designed to respond to human queries — often simple, narrowly defined tasks —

have been one early manifestation. Another has been the introduction of more complex digital

assistants that respond to voice commands, such as Alexa, Apple’s Siri and Microsoft’s Cortana.

　AIの波が約束する最大のものの1つは、会話型コンピューティングの出現であり、言語を使用してコンピュータを制御し、対話するという展望です。チャットボット、人間の質問に応答するように設計された自動テキストベースのメッセージングサービス（単純で定義の狭いタスクの場合が多い）は、早期に実現されたものです。もう1つは、Alexa、AppleのSiri、MicrosoftのCortanaなどの音声コマンドに対応するより複雑なデジタルアシスタントの導入です。

But companies experimenting with the technology have had mixed experiences. Chatbots were a

tech disappointment in 2016, a reminder of the risks of overhyping technologies. But the benefits

that could come from talking with computers make this a field ripe for innovation in 2017.

　しかし、この技術を試している企業には、複雑な経験があります。チャットボットは２０１６年、技術的には失望させるものであり、過剰な技術のリスクを思い出させるものでした。しかし、コンピュータとの会話で得られるメリットのおかげで、この技術は２０１７年のイノベーションにとって機の熟した領域となります。

Facebook’s launch last year of a chatbot platform — to which external companies could add their

own voice assistants — demonstrated both the shortcomings of the technology and the potential

demand. Banks, power companies and others developed automated assistants to handle customer

service or simple transactions, but consumers often found it hard to get answers to questions and

early reviews were damning. By the end of the year, however, more than 30,000 chatbots were using

the platform, and Facebook claimed the early teething problems were already being overcome.

　外部企業が自社の音声アシスタントを追加できるチャットボットプラットフォームを、昨年Facebookが発売しましたが、これによってチャットボットの技術の欠陥と潜在的な需要の両方が存在することが明らかになりました。銀行、電力会社などは、顧客サービスやかんたんな取引を処理するための自動化されたアシスタントを開発しましたが、消費者は質問への回答を得るのが難しいことが多く、始めのころは手厳しい報告がもたらされました。しかし年末までに３０，０００を超えるチャットボットがプラットフォームを使用していましたし、Facebookは初期に起こった問題はすでに克服されつつあると主張しています。

Companies that have been working with the technology say there are signs it will change how people

interact with phones and computers, and the businesses whose digital services they use.

　この技術を利用している会社によれば、人々が電話やコンピュータとどのようにやりとりするかや、彼らがデジタルサービスを利用している企業を変える兆しが見えるとのことです。

Adam Goldstein, chief executive of Hipmunk, a travel-planning service, was an early user of the bot

platform developed by Microsoft’s Skype messaging service. “We were surprised about the different

types of question people ask a bot,” says Mr Goldstein.

　旅行プランニングサービスのHipmunkのCEO、Adam Goldsteinは、MicrosoftのSkypeメッセージングサービスによって開発されたボットプラットフォームの初期ユーザーでした。ゴートスタイン氏は、「人々がボットに尋ねるさまざまな種類の質問に私たちは驚きました」と語ります。

When customers use a standard search form, he says, they tend to focus on the specific: on a travel

site, they look for flights between certain cities on particular dates. But with a bot they ask more

general research questions about where they should go or when.

　顧客が標準的な検索フォームを使用する場合、特定の都市に焦点を当てる傾向があります。顧客は、特定の都市間のフライトを検索旅行サイトでします。しかし、ボットでは、いつどこに行くべきか、より一般的な検索質問をしています。

Two weaknesses have held back the promise of conversational computing. One involves the

difficulty of training general-purpose systems that can work in everyday situations — a problem

Microsoft demonstrated in 2016, when a chatbot it released on Twitter was misled into producing

racist responses.

　会話型コンピューティング成功の見込みは、２つの弱点のために後退してしまいました。１つは、日常的な状況で機能する汎用システムを訓練することの難しさ、すなわち、２０１６年にMicrosoftが明らかにした問題でした。MicrosoftがTwitterで発表したチャットボットが判断を誤らされ、人種差別的な反応を引き起こしてしまったのです。

The other is the ability of chatbots to understand context, a problem for computers once they expand

beyond a narrow field of knowledge. Software companies have responded by limiting the

technology’s uses for now. Increasingly more new business software will include some level of

conversational AI, though it will be limited, says Tom Austin, an analyst at Gartner, a research

company.

　もう一つは、チャットボットが文脈を理解する能力がそれで、コンピュータが狭い知識の分野を超えてしまうと発生する問題です。ソフトウェア企業は現在、その技術の使用を制限することで対応してきました。リサーチ会社ガートナーのアナリスト、トム・オースティン氏によると、制限はされますが、ますます多くの新しいビジネス・ソフトウェアにはある程度会話型AIが含まれているといいます。

How much this is used and how useful it will be is open to question. “We want to be able to walk

into the office and say, ‘What the hell happened to Harry’s order?,’” says Mr Austin. “The

technology won’t be able to tell you in 2017, but it will start to ask you questions.” The answers it

receives, he adds, will help it to find its way to the right response.

　会話型コンピューティングがどのくらい使用され、どれだけ有用なのかは疑問を免れません。「私たちはオフィスに入って「ハリーの指示はいったいどうなったの？」と言うことができるようにしたいと思っています」とオースティン氏は言います。「この技術がいかなるものであるかは２０１７年にはわかりません。しかしあなたに質問を投げかけはじめ、あなたから受け取った答えは、正しい回答への道を見つけるのに役立つでしょう。」

But the lag in the uptake of new enterprise technology means it will probably be 2020 before people

begin to use conversational AI in large numbers, he says.

　しかし、新しい企業テクノロジーを導入するのが遅れれば、２０２０年までに多くの人が会話型AIを使用し始めるようになることはまずありえないでしょう、と彼は述べています。

　編集部からのコメントです。

　いかがでしたでしょうか。後半は、AIがもたらす約束で最も重要なものとして、会話型コンピューティングについて説明されています。記事ではチャットボットで例証されていますが、これは以前にご紹介したアマゾン・エコーというオーディオデバイスとして発売されたものを指します。Alexaのことです。

　音声認証のディープ・ラーニングは、画像認証の次の上の段階の高度なものですが、それを特にAmazonが克服しつつあります。この音声認証こそが、家庭から医療、福祉、など生活の様々な実感の伴った局面で大活躍する試みです。医療現場には、スマートフォンの問診からすぐにでも始められる試みといえます。

　この中には以前にご紹介した「フィンテック」（金融IT技術革命）におけるP2PやPSSと同じ発想があります。企業内のために開発したアルゴリズムを、オープンなAPIによって、素人でも自社のサービスに組み込むことができ、いわゆる「使った分だけ」課金される仕組みというものです。

　こうして、AI技術の導入は業界や個人、企業、グループ、他業種にまたがった「エコシステム」と協働消費を可能にしていくことがその本質なのです。

　そして、こうしたAI技術も、データをあつかう科学者のスキル不足が進歩にブレーキをかける可能性があるという一言が、非常に重くのしかかってきます。これは科学者や技術者、エンジニアの問題ではありません。そのアルゴリズムやプログラミングをするスキルのことではなく、大量のデータを新たな方式で分析したのはいいが、それをどうすればいいのかわからないという「スキルの不足」のことです。

　つまるところ企業の「ポリシー・メイキング」のことを表します。自社自らがビジョンを立てて政策を立て、商業ベースに乗せていくということをできる人材が圧倒的に少ないということです。

　日々の作業や労働、タスクの処理、果ては長時間労働や休日出勤に自らの「使命」を感じ、「やり切った」感覚は、私たち「労働者」の当然の感覚であり、プライドでもあります。

　しかし、現在そうした「労働者」（レイバーズ）であることは、AIの登場により、人間にとって全く必要のない要素となりつつあります。そうした「労働」は機械に任せなくてはならないということです。そしてそれが可能な時代を作らなければならないということです。

　そこで私たちはそうした岐路に立たされて、迷っている状態なのだと思います。逆を返してみれば、私たちは新たなパラダイムのスタート地点に立っているということです。

　当社ホームページの「ITの最新情報」は、これまでハイパーループやAmazon Goといった具体的事例を主に取り扱ってきましたが、これからAIと医療分野でのはたらしい試みと問題点を取り上げていきます。

　どうぞご期待ください。