Translate

Friday, October 30, 2020

Training and serving H2O models using Amazon SageMaker

Model training and serving steps are two essential pieces of a successful end-to-end machine learning (ML) pipeline. These two steps often require different software and hardware setups to provide the best mix for a production environment. Model training is optimized for a low-cost, feasible total run duration, scientific flexibility, and model interpretability objectives, whereas model serving is optimized for low cost, high throughput, and low latency objectives.

Therefore, a wide-spread approach is to train a model with a popular data science language like Python or R, and create model artifact formats such as Model Object, Optimized (MOJO), Predictive Model Markup Language (PMML) or Open Neural Network Exchange (ONNX) and serve the model on a microservice (e.g., Spring Boot application) based on Open Java Development Kit (OpenJDK).

This post demonstrates how to implement this approach end-to-end using Amazon SageMaker for the popular open-source ML framework H2O. Amazon SageMaker is a fully managed service that provides every developer and data scientist the ability to build, train, and deploy ML models quickly. Amazon SageMaker is a versatile ML service, which allows you to use ML frameworks and programming languages of your choice. H2O was founded by H2O.ai, an AWS Partner Network (APN) Advanced Partner. You can choose from a wide range of options to train and deploy H2O models on the AWS Cloud, and H2O provides some design pattern examples to productionize H2O ML pipelines.

The H2O framework supports three type of model artifacts, as summarized in the following table.

Dimension Binary Models Plain Old Java Object (POJO) Model Object, Optimized (MOJO)
Definition The H2O binary model is intended for non-production ML experimentation with the features supported by a specific H2O version. A POJO is an ordinary Java object, not bounded by any special restriction. It’s a way to export a model built in H2O and implement it in a Java application. A MOJO is also a Java object, but the model tree is out of this object, because it has a generic tree-walker code to navigate the model. This allows model artifacts to be much smaller.
Use case Intended for interactive ML experimentation. Suitable for production usage. Suitable for production usage
Deployment Restrictions The model hosting image should run an H2O cluster and the same h2o version as the binary model. 1 GB maximum model artifact file size restriction for H2O. No size restriction for H2O.
Inference Performance High latency (up to a few seconds)—not recommended for production. Only slightly faster than MOJOs for binomial and regression models. Latency is typically in single-digit milliseconds. Significant inference efficiency gains over POJOs for multi-nominal and large models. Latency is typically in single-digit milliseconds.

During my trials, I explored some of the design patterns that Amazon SageMaker manages end to end, summarized in the following table.

ID Design Pattern Advantages Disadvantages
A

Train and deploy the model with the Amazon SageMaker Marketplace algorithm offered by H2O.ai

 

No effort is required to create any custom container and Amazon SageMaker algorithm resource. An older version of the h2o Python library is available. All other disadvantages in option B also apply to this option.
B

Train using a custom container with h2o Python library. Export the model artifact as H2O binary model format. Serve the model using a custom container running a Flask application and running inference by h2o Python library.

 

It’s possible to use any version of the h2o Python library. H2O binary model inference latency is significantly higher than MOJO artifacts. It’s prone to failures due to h2o Python library version incompatibility.
C

Train using a custom container with the h2o Python library. Export the model in MOJO format. Serve the model using a custom container running a Flask application and running inference by pyH2oMojo.

 

Because MOJO model format is supported, the model inference latency is lower than option B and it’s possible to use any version of the h2o Python library. Using pyH2oMojo has a higher latency and it’s prone to failures due to weak support for continuously evolving H2O versions.
D Train using a custom container with the h2o Python library. Export the model in MOJO format. Serve the model using a custom container based on Amazon Corretto running a Spring Boot application and h2o-genmodel Java library. It’s possible to use any version of h2o Python library and h2o-genmodel libraries. It offers the lowest model inference latency. The majority of data scientists prefer using only scripting languages.

It’s possible to add a few more options to the preceding list, especially if you want to run distributed training with Sparkling Water. After testing all these alternatives, I have concluded that design pattern D is the most suitable option for a wide range of use cases to productionize H2O. Design pattern D is built by a custom model training container with the h2o Python library and a custom model inference container with Spring Boot application and h2o-genmodel Java library. This post shows how to build an ML workflow based on this design pattern in the subsequent sections.

Problem and dataset

You can use the Titanic Passenger Survival dataset, which is publicly available thanks to Kaggle and encyclopedia-titanica, to build a predictive model that answers what kind of people are more likely to survive in a catastrophic shipwreck. It uses 11 independent variables such as age, gender, and passenger class to predict the binary classification target variable Survived. For this post, we split the original training dataset 80%/20% to create train.csv and validation.csv input files. The datasets are located under the /examples directory of the parent repository. This dataset requires features preprocessing operations like data imputation of null values for the Age feature and string indexing for Sex and Embarked features to train a model using the Gradient Boosting Machines (GBM) algorithm using the H2O framework.

Overview of solution

The solution in this post offers an ML training and deployment process orchestrated by AWS Step Functions and implemented with Amazon SageMaker. The following diagram illustrates the workflow.

This workflow is developed using a JSON-based language called Amazon State Language (ASL). The Step Functions API provides service integrations to Amazon SageMaker, child workflows, and other services.

Two Amazon Elastic Container Registry (Amazon ECR) images contain the code mentioned in design pattern D:

  • h2o-gbm-trainer – H2O model training Docker image running a Python application
  • h2o-gbm-predictor – H2O model inference Docker image running a Spring Boot application

The creation of a manifest.json file in an Amazon Simple Storage Service (Amazon S3) bucket initiates an event notification, which starts the pipeline. This file can be generated by a prior data preparation job, which creates the training and validation datasets during a periodical production run. Uploading this file triggers an AWS Lambda function, which collects the ML workflow run duration configurations from the manifest.json file and AWS Systems Manager Parameter Store and starts the ML workflow.

Prerequisites

Make sure that you complete all the prerequisites before starting deployment. Deploying and running this workflow involves two types of dependencies:

Deploying the ML workflow infrastructure

The infrastructure required for this post is created with an AWS CloudFormation template compliant to AWS Serverless Application Model (AWS SAM), which simplifies how to define functions, state machines, and APIs for serverless applications. I calculated the cost for a test run is less than $1 in the eu-central-1 Region. For installation instructions, see Installation.

The deployment takes approximately 2 minutes. When it’s complete, the status switches to CREATE_COMPLETE for all stacks.

The nested stacks create three serverless applications:

Creating a model training Docker image

Amazon SageMaker launches this Docker image on Amazon SageMaker training instances in the runtime. It’s a slightly modified version of the open-sourced Docker image repository by our partner H2O.AI, which extends the Amazon Linux 2 Docker image. Only the training code and its required dependencies are preserved; the H2O version is upgraded and a functionality to export MOJO model artifacts is added.

Navigate to h2o-gbm-trainer repository in your command line. Optionally, you can test it in your local PC. Build and deploy the model training Docker image to Amazon ECR using the installation command.

Creating a model inference Docker image

Amazon SageMaker launches this Docker image on Amazon SageMaker model endpoint instances in the runtime. The Amazon Corretto Docker Image (amazoncorretto:8) is extended to provide dependencies with Amazon Linux 2 Docker image and Java settings required to launch a Spring Boot application.

Depending on an open-source distribution of OpenJDK has several drawbacks, such as backward incompatibility between minor releases, delays in bug fixing, security vulnerabilities like backports, and suboptimal performance for a production service. Therefore, I used Amazon Corretto, which is a no-cost, multiplatform, secure, production-ready downstream distribution of the OpenJDK. In addition, Corretto offers performance improvements (AWS Online Tech Talk) with respect to OpenJDK (openjdk:8-alpine), which are observable during the Spring Boot application launch and model inference latency. The Spring Boot framework is preferred to build the model hosting application for the following reasons:

  • It’s easy to build a standalone production-grade microservice
  • It requires minimal Spring configuration and easy deployment
  • It’s easy to build RESTful web services
  • It scales the system resource utilization according to the intensity of the model invocations

The following image is the class diagram of the Spring Boot application created for the H2O GBM model predictor.

SagemakerController class is an entry point of this Spring Boot Java application, launched by SagemakerLauncher class in the model inference Docker image. SagemakerController class initializes the service in init() method by loading the H2O MOJO model artifact from Amazon S3 with H2O settings to impute the missing model scoring input features and loading a predictor object.

SagemakerController class also provides the /ping and /invocations REST API interfaces required by Amazon SageMaker, which are called by asynchronous and concurrent HTTPS requests to Amazon SageMaker model endpoint instances in the runtime. Amazon SageMaker reserves the /ping path for health checks during the model endpoint deployment. The /invocations path is mapped to the invoke() method, which forwards the incoming model invocation requests to the predict() method of the predictor object asynchronously. This predict() method uses Amazon SageMaker instance resources dedicated to the model inference Docker image efficiently thanks to its non-blocking asynchronous and concurrent calls.

Navigate to the h2o-gbm-predictor repository in your command line. Optionally, you can test it in your local PC. Build and deploy the model inference Docker image to Amazon ECR using the installation command.

Creating a custom Amazon SageMaker algorithm resource

After publishing the model training and inference Docker images on Amazon ECR, it’s time to create an Amazon SageMaker algorithm resource called h2o-gbm-algorithm. As displayed in the following diagram, an Amazon SageMaker algorithm resource contains training and inference Docker image URIs, Amazon SageMaker instance types, input channels, supported hyperparameters, and algorithm evaluation metrics.

Navigate to the h2o-gbm-algorithm-resource repository in your command line. Then run the installation command to create your algorithm resource.

After a few seconds, an algorithm resource is created.

Because all the required infrastructure components are now deployed, it’s time to run the ML pipeline to train and deploy H2O models.

Running the ML workflow

To start running your workflow, complete the following steps:

  1. Upload the train.csv and validation.csv files to their dedicated directories in the <s3bucket> bucket (replace <s3bucket> with the S3 bucket name in the manifest.json file):
aws s3 cp examples/train.csv s3://<s3bucket>/titanic/training/
aws s3 cp examples/validation.csv s3://<s3bucket>/titanic/validation/
  1. Upload the file under the s3://<s3bucket>/manifests directory located in the same S3 bucket specified during the ML workflow deployment:
aws s3 cp examples/manifest.json s3://<s3bucket>/manifests

As soon as the manifest.json file is uploaded to Amazon S3, Step Functions puts the ML workflow in a Running state.

Training the H2O model using Amazon SageMaker

To train your H2O model, complete the following steps:

  1. On the Step Functions console, navigate to ModelTuningWithEndpointDeploymentStateMachine to find it in Running state and observe the Model Tuning Job step.

  1. On the Amazon SageMaker console, under Training, choose Hyperparameter tuning jobs.
  2. Drill down to the tuning job in progress.

After 4 minutes, all training jobs and the model tuning job change to Completed status.

The following screenshot shows the performance and configuration details of the best training job.

  1. Navigate to the Amazon SageMaker model link to display the model definition in detail.

The following screenshot shows the detailed settings associated with the created Amazon SageMaker model resource.

Deploying the MOJO model to an auto-scaling Amazon SageMaker model endpoint

To deploy your MOJO model, complete the following steps:

  1. On the Step Functions console, navigate to ModelTuningWithEndpointDeploymentStateMachine to find it in Running state.
  2. Observe the ongoing Deploy Auto-scaling Model Endpoint step.

The following screenshot shows the Amazon SageMaker model endpoint during the deployment.

Auto-scaling model endpoint deployment takes approximately 5–6 minutes. When the endpoint is deployed, the Step Functions workflow successfully concludes.

  1. Navigate to the model endpoint that is in InService status; it’s now ready to accept incoming requests.

  1. Drill down to the model endpoint details and observe the endpoint runtime settings.

This model endpoint can scale from one to four instances, which are all behind Amazon SageMaker Runtime.

Testing the Amazon SageMaker model endpoint

For Window users, enter the following code to invoke the model endpoint:

aws sagemaker-runtime invoke-endpoint --endpoint-name survival-endpoint ^
--content-type application/jsonlines ^
--accept application/jsonlines ^
--body "{\"Pclass\":\"3\",\"Sex\":\"male\",\"Age\":\"22\",\"SibSp\":\"1\",\"Parch\":\"0\",\"Fare\":\"7.25\",\"Embarked\":\"S\"}"  response.json && cat response.json

For Linux and macOS users, enter the following code to invoke the model endpoint:

aws sagemaker-runtime invoke-endpoint --endpoint-name survival-endpoint \
--content-type application/jsonlines \
--accept application/jsonlines \
--body "{\"Pclass\":\"3\",\"Sex\":\"male\",\"Age\":\"22\",\"SibSp\":\"1\",\"Parch\":\"0\",\"Fare\":\"7.25\",\"Embarked\":\"S\"}"  response.json --cli-binary-format raw-in-base64-out && cat response.json

As displayed in the following model endpoint response, this unfortunate third-class male passenger didn’t survive (prediction is 0) according to the trained model:

{"calibratedClassProbabilities":"null","classProbabilities":"[0.686304913500942, 0.313695086499058]","prediction":"0","predictionIndex":0}

The invocation round-trip latency might be higher in the first call, but it decreases in the subsequent calls. This latency measurement from your PC to the Amazon SageMaker model endpoint also involves the network overhead of the local PC to AWS Cloud connection. To have an objective evaluation of model invocation performance, a load test based on real-life traffic expectations is essential.

Cleaning up

To stop incurring costs to your AWS account, delete the resources created in this post. For instructions, see Cleanup.

Conclusion

In this post, I explained how to use Amazon SageMaker to train and serve models for an H2O framework in a production-scale design pattern. This approach uses custom containers running a model training application built with a data science scripting language and a separate model hosting application built with a low-level language like Java, and has proven to be very robust and repeatable. You could also adapt this design pattern and its artifacts to other ML use cases.

 


About the Author

As a Machine Learning Prototyping Architect, Anil Sener builds prototypes on Machine Learning, Big Data Analytics, and Data Streaming, which accelerates the production journey on AWS for top EMEA customers. He has two masters degrees in MIS and Data Science.

 

 



from AWS Machine Learning Blog https://ift.tt/2Jn1GEd
via A.I .Kung Fu

Waymo releases public road testing data on its self-driving operations in Arizona between Jan. 2019 and Sept. 2020, says its cars drove 6.1M+ miles in 2019 (Andrew J. Hawkins/The Verge)

Andrew J. Hawkins / The Verge:
Waymo releases public road testing data on its self-driving operations in Arizona between Jan. 2019 and Sept. 2020, says its cars drove 6.1M+ miles in 2019  —  In its first report on its autonomous vehicle operations in Phoenix, Arizona, Waymo said that it was involved in 18 crashes …



from Techmeme https://ift.tt/3mEQQYt
via A.I .Kung Fu

Daimler's truck division invests an undisclosed sum in lidar startup Luminar, as part of a partnership that seeks to develop autonomous trucks (Kirsten Korosec/TechCrunch)

Kirsten Korosec / TechCrunch:
Daimler's truck division invests an undisclosed sum in lidar startup Luminar, as part of a partnership that seeks to develop autonomous trucks  —  Daimler's trucks division has invested in lidar developer Luminar as part of a broader partnership to produce autonomous trucks capable …



from Techmeme https://ift.tt/35OX4hF
via A.I .Kung Fu

Twitter lifts restrictions from the NY Post's Twitter account, updating its practice of "not retroactively overturning prior enforcement" (Robert McMillan/Wall Street Journal)

Robert McMillan / Wall Street Journal:
Twitter lifts restrictions from the NY Post's Twitter account, updating its practice of “not retroactively overturning prior enforcement”  —  Social-media company reverses policy that previously required newspaper to delete old tweets before being able to tweet again



from Techmeme https://ift.tt/2HTbJ3g
via A.I .Kung Fu

Thursday, October 29, 2020

Biden campaign says Facebook blocked thousands of its ads approved prior to Facebook's ban; Facebook blames advertisers' misunderstanding and a technical glitch (Elena Schneider/Politico)

Elena Schneider / Politico:
Biden campaign says Facebook blocked thousands of its ads approved prior to Facebook's ban; Facebook blames advertisers' misunderstanding and a technical glitch  —  Biden's campaign said ads that had previously been approved by the tech giant have been erroneously removed.



from Techmeme https://ift.tt/3oEVKGH
via A.I .Kung Fu

Broken People novelist Sam Lansky wonders, What if you could fix all your faults in 3 days? - CNET

His new book, Broken People, answers that question, complete with a shaman who performs "open-soul surgery."

from CNET News https://ift.tt/3e8wEew
via A.I .Kung Fu

Kanye West gives Kim Kardashian a birthday hologram of her late father - CNET

The image of Robert Kardashian calls Kanye West the "most, most, most, most, most genius man in the whole world."

from CNET News https://ift.tt/37RXeHN
via A.I .Kung Fu

2022 Genesis GV70 SUV looks absolutely incredible - Roadshow

Genesis' new compact crossover looks amazing.

from CNET News https://ift.tt/2HE4Gf3
via A.I .Kung Fu

The Queen's Gambit: That ending explained and all your questions answered - CNET

Is the Netflix show based on a true story? Let's go through all those key details and more.

from CNET News https://ift.tt/2JhwcPO
via A.I .Kung Fu

Best smart plugs of 2020: Options for Google Assistant, Amazon Alexa or Siri - CNET

Bringing devices online is easier than ever, and these smart plugs connect everyday items with a voice assistant.

from CNET News https://ift.tt/36tNseg
via A.I .Kung Fu

NASA secures sample of asteroid Bennu to send home to Earth - CNET

The Osiris-Rex spacecraft grabbed so much space gravel its container started overflowing and the sample had to be put away early.

from CNET News https://ift.tt/3mwzPjd
via A.I .Kung Fu

Netflix is working on a Vince McMahon documentary with WWE - CNET

It'll be a multi-part docuseries on WWE's famously eccentric chairman.

from CNET News https://ift.tt/3oGqsiE
via A.I .Kung Fu

Apple, Amazon, Alphabet, Facebook rake in profits. Washington is watching - CNET

We've seen this story before.

from CNET News https://ift.tt/2TCMAMI
via A.I .Kung Fu

Kia is making a rad, production-based light military vehicle too, y'know - Roadshow

It looks like a little HMMWV and it's based on the Borrego SUV.

from CNET News https://ift.tt/3e7g1zQ
via A.I .Kung Fu

Chevy built a Bolt-powered classic K5 Blazer EV for SEMA and it's glorious - Roadshow

The kit can be used to bolt the Bolt's drivetrain into all kinds of classic vehicles.

from CNET News https://ift.tt/3ekYjsZ
via A.I .Kung Fu

Chevrolet built an electric Bolt-powered K5 Blazer for SEMA - Roadshow

It even keeps the Blazer's transfer case and axles.

from CNET News https://ift.tt/3eaU17h
via A.I .Kung Fu

Despite virtual school, kids will still get snow days in one district - CNET

Truly, snow days are one of the best things about American education.

from CNET News https://ift.tt/35KxkD3
via A.I .Kung Fu

Truth Seekers review: Simon Pegg and Nick Frost are surprisingly serious for Halloween - CNET

The Shaun of the Dead pair return to seek the truth behind haunted dwellings in the UK. But the Amazon show's horror and comedy only sometimes work.

from CNET News https://ift.tt/37WKxeG
via A.I .Kung Fu

Shopify Q3: revenue of $767.4M, up 96% YoY, vs $658M est., gross merchandise volume of $30.9B, up 109% YoY, ended Q3 with $6.12B in cash and equivalents (Larry Dignan/ZDNet)

Larry Dignan / ZDNet:
Shopify Q3: revenue of $767.4M, up 96% YoY, vs $658M est., gross merchandise volume of $30.9B, up 109% YoY, ended Q3 with $6.12B in cash and equivalents  —  The company, which has been among the winners during the COVID-19 pandemic, reported third quarter revenue of $767.4 million, up 96% from a year ago,



from Techmeme https://ift.tt/3efL6kX
via A.I .Kung Fu

Get three months of cell phone service for up to half off from Tello - CNET

You can get 4GB of data with unlimited talk and text for $15 per month with no contract.

from CNET News https://ift.tt/2HQVK5m
via A.I .Kung Fu

Halloween Google Doodle game resurrects Momo the black cat - CNET

Our feline friend has found a new foe to fight -- underwater ghouls!

from CNET News https://ift.tt/3oDkLCf
via A.I .Kung Fu

The Haunting of Bly Manor ending explained, and all your questions answered - CNET

Dani's beautifully haunting love story has a few complicated knots. Here are some answers to everything that happened in the Netflix horror.

from CNET News https://ift.tt/3k7MJU5
via A.I .Kung Fu

Getting too many texts? Blame the election

Every election campaign uses more sophisticated tech, but the text message still cuts through.

from BBC News - Technology https://ift.tt/37SvCSN
via A.I .Kung Fu

Facebook Q3: ad revenue was $21.2B, up 22% YoY from $17.4B despite July's ad boycott; Facebook expects higher ad growth in Q4 from holiday season (Ted Johnson/Deadline)

Ted Johnson / Deadline:
Facebook Q3: ad revenue was $21.2B, up 22% YoY from $17.4B despite July's ad boycott; Facebook expects higher ad growth in Q4 from holiday season  —  UPDATE, 4:15 PM PT: Facebook CEO Mark Zuckerberg gave a glimpse of what changes he envisions for Section 230, the law that gives tech platforms legal immunity …



from Techmeme https://ift.tt/3mMPA5R
via A.I .Kung Fu

Facebook's "Other" revenue, including sales of Oculus and Portal, was $249M in Q3, down 7% YoY, but says strong Oculus Quest 2 sales could boost Q4 revenue (Salvador Rodriguez/CNBC)

Salvador Rodriguez / CNBC:
Facebook's “Other” revenue, including sales of Oculus and Portal, was $249M in Q3, down 7% YoY, but says strong Oculus Quest 2 sales could boost Q4 revenue  —  - Facebook stock remained relatively flat on Thursday after the company reported a decrease in users in the U.S. and Canada …



from Techmeme https://ift.tt/35PU7xv
via A.I .Kung Fu

Corsair announces it has acquired the EpocCam app, which turns iOS devices into high definition Mac and PC webcams; EpocCam creator Sam Grohn to join Elgato (Brian Heater/TechCrunch)

Brian Heater / TechCrunch:
Corsair announces it has acquired the EpocCam app, which turns iOS devices into high definition Mac and PC webcams; EpocCam creator Sam Grohn to join Elgato  —  Corsair Gaming today announced that it has acquired EpocCam, the software developer behind the iOS software of the same name.



from Techmeme https://ift.tt/2Gf2CJE
via A.I .Kung Fu

Apple reports an all-time high of 585M paid subscriptions for its services in Q4, growing by 135M YoY, on track to reach 600M by the end of 2020 (Mike Peterson/AppleInsider)

Mike Peterson / AppleInsider:
Apple reports an all-time high of 585M paid subscriptions for its services in Q4, growing by 135M YoY, on track to reach 600M by the end of 2020  —  Apple's number of paid subscriptions for its various Services hit a new high of 585 million, and the company expects to hit 600 million by the end of the year.



from Techmeme https://ift.tt/32c3UwT
via A.I .Kung Fu

Wednesday, October 28, 2020

SoftBank Group, NVIDIA CEOs on What’s Next for AI

Good news: AI will soon be everywhere. Better news: it will be put to work by everyone.

Sharing a vision of AI enabling humankind, NVIDIA CEO Jensen Huang Wednesday joined Masayoshi Son, Chairman and CEO of SoftBank Group Corp. as a guest for his keynote at the annual SoftBank World conference.

“For the first time, we’re going to democratize software programming,” Huang said. “You don’t have to program the computer; you just have to teach the computer.”

Son is a legendary entrepreneur, investor and philanthropist who pioneered the development of the PC industry, the internet and mobile computing in Japan.

A Technological Jewel

The online conversation comes six weeks after NVIDIA agreed to acquire Arm from SoftBank in a transaction valued at $40 billion. Huang described Arm as “one of the technology world’s great jewels” in his conversation with Son.

“The reason why combining Arm and NVIDIA makes so much sense is because we can then bring NVIDIA’s AI to the most popular edge CPU in the world,” Huang said while seated beside the fireplace of his Silicon Valley home.

Arm has long provided its intellectual property to many chipset vendors, who deploy it on many different applications, in many different systems-on-a-chip, or SoCs, Son explained.

Huang said the combined company would “absolutely” continue this.

An Ecosystem Like No Other

“Of course the CPU is fantastic, energy-efficient and it’s improving all the time, thanks to incredible computer scientists building the best CPU in the world,” Huang said. “But the true value of Arm is in the ecosystem of Arm — the 500 companies that use Arm today.”

That ecosystem is growing fast. Son said it won’t be long until a trillion Arm-based SoCs have been shipped. Making NVIDIA AI available to those trillion chipsets “will be an amazing combination,” Son said.

“Our dream is to bring NVIDIA’s AI to Arm’s ecosystem, and the only way to bring it to the Arm ecosystem is through all of the existing customers, licensees and partners,” Huang said. “We would like to offer the licensees more, even more.”

Arm, Son said, provides toolsets to enable companies to create SoCs for very different applications, from game machines and home appliances to robots that fly or run or swim. These devices will, in turn, communicate with cloud AI “so each of them become smarter.”

“That’s the reason why combining Arm and NVIDIA makes so much sense because we can then bring NVIDIA AI to the most popular edge CPU in the world,” Huang said.

‘Intelligence at Scale’

That will allow even more companies to participate in the AI boom.

“AI is a new kind of computer science; the software is different, the chips are different, the methodology is different,” Huang said.

It’s a huge shift, Son agreed.

First, Son said, computers enabled advancements in calculation; next, came the ability to store massive amounts of data; and “now, finally, computers are the ears and the eyes, so they can recognize voice and speech.”

“It’s intelligence at scale,” Huang responded. “That’s the reason why this age of AI is such an important time.”

Extending Human Capabilities

Son and Huang spoke about how enterprises worldwide — from AstraZeneca and GlaxoSmithKline in drug discovery, to American Express in banking, to Walmart in retail, to Microsoft in software, to Kubota in agriculture — are now adopting NVIDIA AI tools.

Huang cited a new generation of systems, called recommender systems, that are already helping humans sort through vast array choices available online in everything from what clothes they wear to what music they listen to.

Huang and Son describe such systems — and AI more broadly — as a way to extend human capabilities.

“Humans will always be in the loop,” Huang said.

“We have a heart, a desire to be nice to other humans,” Son said. “We will utilize AI as a tool, for our happiness, for our joy — humans will choose which recommendations to take.”

‘Perpetually Learning Machines’

Such intelligent systems are being woven into the world around us, through smart, connected systems, or “edge AI,” Son said, which will work hand in hand with powerful cloud AI systems able to aggregate input from devices in the real world.

The result will be a “learning loop,” or “perpetually learning machines,” Huang said.

“The cloud side will aggregate information from edge AI, it will become smarter and smarter,” Son said.

Democratizing AI

One result: computing will finally be democratized, Huang said. Only a small number of people want to pursue a career as a computer programmer, but “everyone can teach,” Huang said.

“You [will] just ask the computer, ‘This is what I want to do, can you give me a solution?,’” Son responded. “Then the computer will give us the solution and the tools to make it happen.”

Such tools will amplify Japan’s strengths in precision engineering and manufacturing.

“This is the time of AI for Japan,” Huang said.

Huang described how, in tools such as NVIDIA Omniverse, a digital factory can be continually optimized.

“This robotic factory will be filled with robots that will build robots in virtual reality,” Huang said. “The whole thing will be simulated … and when you come in in the morning the whole thing will be optimized more than it was when you went to bed.”

Once it’s ready, a physical twin of the digital factory can be built and continually optimized with lessons learned in the virtual one.

“It’s the concept of the metaverse” Son said, referring to the shared, online world of imagined in Neal Stephensen’s 1992 cyberpunk classic, “Snow Crash.”

“… and it’s right in front of us now,” Huang added.

Connecting Humans with One Another

In addition to extending human capabilities with AI, it will help humans better connect with one another.

Video conferencing will soon be the vast majority of the world’s internet traffic, Huang said. Using AI to reconstruct a speaker’s facial expressions can “reduce bandwidth” by a factor of 10.

It can also unleash new capabilities, such as the ability for a speaker to make direct eye contact with 20 different people watching simultaneously, or real-time language translation.

“So you can speak to me in the future in Japanese and I can speak to you in English, and you will hear Japanese and I will hear English,” Huang said.

Enabling Big Dreams

Melding human judgment and AI, adaptive, autonomous machines and tightly connected teams of people will give entrepreneurs, philanthropists and others with “big wishes and big dreams” the ability to tackle ever more ambitious challenges, Huang said.

Son said AI is playing a role in the development of technologies that can detect heart attacks before they happen, speed the discovery of new treatments for cancer, and eliminate car accidents, among others.

“It is a big help,” Son said. “So we should be having a big smile, and big excitement, welcoming this revolution in AI.”

The post SoftBank Group, NVIDIA CEOs on What’s Next for AI appeared first on The Official NVIDIA Blog.



from The Official NVIDIA Blog https://ift.tt/37MBj4N
via A.I .Kung Fu

Sidekick Health raises $20 million Series A for its gamified digital care service that focuses on chronic disease management (Natasha Lomas/TechCrunch)

Natasha Lomas / TechCrunch:
Sidekick Health raises $20 million Series A for its gamified digital care service that focuses on chronic disease management  —  Nordic digital therapeutics company, Sidekick Health, has closed a $20 million Series A led by pan-European VC Wellington Partners and healthcare focused VC Asabys Partners.



from Techmeme https://ift.tt/2J94dSc
via A.I .Kung Fu

‘Perception Hacks’ and Other Potential Threats to the Election

In the final days of voting, election officials and cybersecurity experts are keeping a close eye on a range of possible ways foreign governments and other hackers could interfere.

from NYT > Technology https://ift.tt/2HIjyJa
via A.I .Kung Fu

TikTok and parent ByteDance have filed a suit against Triller in response to Triller's suit filed in July that alleges ByteDance infringed on one of its patents (Bloomberg)

Bloomberg:
TikTok and parent ByteDance have filed a suit against Triller in response to Triller's suit filed in July that alleges ByteDance infringed on one of its patents  —  - Lawsuit seeks ruling TikTok doesn't infringe rival's patent  — Both video-streaming companies are in take-over negotiations



from Techmeme https://ift.tt/2HM4D01
via A.I .Kung Fu

Halo Infinite loses director after being delayed to 2021 - CNET

Halo Infinite's development is looking increasingly beleaguered.

from CNET News https://ift.tt/2G6EIjh
via A.I .Kung Fu

The best phone to buy for 2020 - CNET

Hand-picked by CNET editors, find the phone you need right now.

from CNET News https://ift.tt/2TxTenn
via A.I .Kung Fu

FCA CEO Mike Manley confirms plans for electric Ram truck - Roadshow

Let's hope they don't let the same team that came up with "Stellantis" name this thing.

from CNET News https://ift.tt/3mCR5U5
via A.I .Kung Fu

The Haunting of Bly Manor ending explained, and all your questions answered - CNET

Dani's beautifully haunting love story has a few complicated knots. Here are some answers to everything that happened in the Netflix horror.

from CNET News https://ift.tt/3k7MJU5
via A.I .Kung Fu

The 31 best movies to see on Netflix - CNET

Not sure what to watch tonight? Here are some of the best movies Netflix has to offer.

from CNET News https://ift.tt/2G7h5XU
via A.I .Kung Fu

Happiness during coronavirus: 'Less about pleasure and more about wisdom' - CNET

Happiness during the stress and uncertainty of COVID-19 can seem elusive. Here's how scientists, writers, comedians, a yoga teacher and others are finding glimmers of it.

from CNET News https://ift.tt/35MvKAu
via A.I .Kung Fu

31 best movies to see on Disney Plus - CNET

Looking for entertainment other than Marvel and Star Wars? Let's round up the best gems on Disney Plus.

from CNET News https://ift.tt/2HNSVSA
via A.I .Kung Fu

31 of the best TV shows to binge-watch on Hulu - CNET

Searching for a great show to watch tonight? Here are some of the best Hulu has to offer.

from CNET News https://ift.tt/3e7N1bq
via A.I .Kung Fu

The 11 best TV shows to binge on Disney Plus - CNET

Searching for more great shows like The Mandalorian? Let's round up Disney's best gems.

from CNET News https://ift.tt/37OSl24
via A.I .Kung Fu

Daylight Saving Time officially apologizes for making 2020 longer - CNET

Or least one of its ministers does. Even one extra hour is too long for this hellish year.

from CNET News https://ift.tt/3kFMTlH
via A.I .Kung Fu

The 52 best TV shows to watch on Netflix - CNET

Looking for a great show to binge? Here are some of the best Netflix has to offer.

from CNET News https://ift.tt/2HAxN2R
via A.I .Kung Fu

Home Depot Black Friday deals start Nov. 6: See everything on sale - CNET

Save $650 on an LG smart refrigerator, $100 on a Ryobi 6-piece cordless tool combo, $69 on a KitchenAid stand mixer and much more.

from CNET News https://ift.tt/2HNoHPF
via A.I .Kung Fu

Verizon's Yahoo Mobile is launching its first smartphone for $50 - CNET

The phone is available as part of Yahoo Mobile's unlimited phone service.

from CNET News https://ift.tt/35Dakpy
via A.I .Kung Fu

Animal Crossing: 'My sister lives on in a video game'

Tending to her sister's virtual town in Animal Crossing helps Meredith Myers keep her memory alive and cope with her grief.

from BBC News - Technology https://ift.tt/2HIGJDb
via A.I .Kung Fu

Disinformation Moves From Social Networks to Texts



from NYT > Technology https://ift.tt/3jC2lOo
via A.I .Kung Fu

Facebook Removes Trump and Biden Ads, Saying They Could Mislead Voters



from NYT > Technology https://ift.tt/2HNUBvm
via A.I .Kung Fu

Cybersecurity experts say FBI is investigating Ryuk ransomware attacks on more than two dozen US hospitals, and officials warned hospitals to back up systems (Christopher Bing/Reuters)

Christopher Bing / Reuters:
Cybersecurity experts say FBI is investigating Ryuk ransomware attacks on more than two dozen US hospitals, and officials warned hospitals to back up systems  —  WASHINGTON (Reuters) - The FBI is investigating the recent targeting with ransomware of more than two dozen hospitals across …



from Techmeme https://ift.tt/3jCtTmV
via A.I .Kung Fu

Amazon files motion to dismiss suit alleging false advertising of Prime Video purchases, says its terms indicate users pay for "limited license", not ownership (Ashley Cullins/Hollywood Reporter)

Ashley Cullins / Hollywood Reporter:
Amazon files motion to dismiss suit alleging false advertising of Prime Video purchases, says its terms indicate users pay for “limited license”, not ownership  —  The streamer says its terms of use are clear: What viewers are paying for is a limited license.



from Techmeme https://ift.tt/3jD0xVt
via A.I .Kung Fu

BT signs 5G deal with Ericsson to help ditch Huawei

The move means its EE mobile network will be powered by both Ericsson and Nokia's products.

from BBC News - Technology https://ift.tt/37MrvHP
via A.I .Kung Fu

Mercedes targets evolution rather than revolution in self-driving car push

An employee of Daimler demonstrates steering by the Drive Pilot Level 3 autonomous driving system in a new Mercedes-Benz S-Class limousine on the company's test center near Immendingen, Germany October 14, 2020.
Mercedes-Benz has developed a similar self-driving system to Tesla, but it won't allow the public to take it on urban roads yet.Read More

from VentureBeat https://ift.tt/31RnNsR
via A.I .Kung Fu

Fluffy animals gambling ad was irresponsible, says watchdog

The advertising watchdog bans an advert for Gala Spins as it was likely to "appeal to under-18s".

from BBC News - Technology https://ift.tt/37O8cxU
via A.I .Kung Fu

Aston Martin: Mercedes to take 20% stake in luxury brand

The German company will also provide the struggling British luxury marque with electric car technology.

from BBC News - Technology https://ift.tt/2TuVwng
via A.I .Kung Fu

Audible adds 100K free podcasts to its catalog, available through the app for all listeners with or without a subscription (Ashley Carman/The Verge)

Ashley Carman / The Verge:
Audible adds 100K free podcasts to its catalog, available through the app for all listeners with or without a subscription  —  It's adding 100,000 podcasts to its catalog  —  Audible is turning into more of a podcast app.  The company announced today that its catalog now contains 100,000 free podcasts …



from Techmeme https://ift.tt/34AZHEp
via A.I .Kung Fu