Passive Income

Knowing what you want to achieve, and what passive income is, is important

Passive income is not just income earned outside of your job. The real meaning of passive income is money that you earn with little to no effort. 

Some may think that money earned in the stock market or from properties is passive income. Yet, this may not be the case. Stock market investors can sometimes be so caught up in trading and looking at stock prices that investing becomes a huge part of their lives and can even be considered another job. Property investing can also turn out to be tedious if you manage your properties yourself.

I know of friends who want to earn passive income but instead end up spending so much time on their investments. Don’t get me wrong. I love spending time learning and investing but this isn’t really “passive”.

Here’s how you can earn real passive income.

Stop trading the stock market

First, stop short-term trading. My definition of trading is buying stocks in the very short-term based on price action and charts. This is not investing and can become a part-time or full-time job as it requires a lot of time and effort. The more trades you need to make, the more effort is required.

We should aim to invest in a way that reduces the number of trades and amount of work that we need to do. 

One way to do this is by investing long-term in set-and-forget investments. Investing in stocks that have the potential to grow earnings (and thus the share price) reliably over the long-term is one good strategy that lowers the time spent on investing.

You can also invest in passive index ETFs that track the performance of broad market indexes. Stock indexes have historically increased in value over a sufficiently long period of time and provide a good way to gain exposure to some of the biggest and most profitable companies.

You can also invest in dividend stocks that reliably pay a dividend. Investors from Singapore enjoy tax-free dividends when they invest in Singapore-listed dividend-paying stocks.

Outsource your investing

Another way to reduce time and effort spent on investing is to outsource your investing to an expert.

One way to do this is by employing financial experts who can advise you on stocks to buy or funds to purchase. You can also use robo advisors which can help you allocate your portfolio into a variety of investments.

While you will need to pay a fee for these services, having someone to invest on your behalf or advise you frees you from the hassle of doing everything yourself and saves you a ton of time.

Invest in other passive assets

You can also invest in other assets beside the stock market.

Assets such as long-term fixed deposits, government bonds, or even professionally-managed real estate may be a good way to grow your wealth without doing much work.

If you can find long-term investments that require little effort on your part but can provide a stable passive return, this is a potentially good asset to invest in and reduce your investment effort.

Know your goals

What do you really want to achieve? Do you want to grow your wealth as quickly as possible?

Then by all means go ahead and dig through annual reports, scour the market for undervalued stocks, sell weekly put options, or even manage your own AirBnB property for higher rental yields. There is nothing wrong with this and is my preferred style of investing.

But this is not passive income.

If you really want passive income, invest in long-term assets, find a professional or robo advisor to manage your wealth or build passive income by dollar cost averaging into funds or long-term assets.

While this may not always give you the best fee-adjusted returns, this is a true passive income strategy and frees up your time for other things in life.

Ultimately, when it comes to investing there is no one-size-fit-all strategy and knowing what you want to achieve can help determine how you should approach investing your spare capital (and time).


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I do not have a vested interest in any companies mentioned. Holdings are subject to change at any time.

Why Do Employees Love Stock-Based Compensation?

Stock-based compensation has its risks, but it can still be an attractive proposition for an employee

In the past, stock-based compensation was more common with fledging startups that had to find ways to preserve the little cash they had.

But today, stock-based compensation is used by almost every major company in the world. Even big firms with lots of cash continue to use stock-based compensation. One reason is because employees want to get paid in stock.

How it works

To understand why this is, we need to look at the mechanics of how stock-based compensation works.

In a typical compensation package, an employee may be offered an annual contract with, say, 33% of the compensation coming in the form of restricted stock units and the rest in cash. This means that an employee who is on a $100,000 annual package will get $33,000 worth of shares per year.

But here’s the catch. This $33,000 worth of shares is based on the share price at the time of signing the employment contract. If the share price rises, the amount that the employee receives each year will be more than $33,000.

For instance, many Nvidia employees who were hired before the massive run up in its stock price the last few years are now receiving shares every quarter that are worth so much than when they joined the company.

Here’s how the math works. Let’s say you were hired by Nvidia five years ago. Back then, its shares were trading at a split-adjusted price of US$6.10 each. You were given a US$200,000 annual package for five years that consists of US$134,000 in cash and US$66,000 in stock . Using the stock price of US$6.10, the US$66,000 in stock-based compensation means you will receive 10,819 shares each year. The number of shares that you receive each year is fixed, even if the stock price goes up or down. Fast forward to today, and that 10,819 shares that you receive each year is now worth US$1.1 million.

Typically, stock grants only last for a few years before they expire and new grants will be made at the current stock price. This is why some employees may want to leave the company after the stock price has run up a lot and they have collected all their shares from the initial grant.

Stock-based compensation lets an employee enjoy the potential upside from a company’s stock without having to put down their own capital to buy shares. For instance, you, the Nvidia employee who was hired five years ago, essentially “bought” US$330,000 (US$66,000 multiplied by 5) worth of Nvidia shares five years ago. That’s a huge bet for most people, but stock-based compensation allows an employee to enjoy the returns of this bet without actually having to buy shares.

Potential downsides

However, there are potential downsides for an employee who takes a pay package that has a significant component in stock-based compensation.

For public-listed companies, employees can sell the shares when they vest. But for private companies, the shares are illiquid and employees may not have an easy way to convert the shares to cash. In addition, employees who work for a small startup and get shares in the startup have a high risk that the startup fails and the company’s shares ends up worthless.

I know of friends who are stuck with shares in companies they previously worked for. These companies may be struggling or have no clear path to an IPO or to be acquired, leading my friends to be stuck with shares of the companies without any real means to cash out.

The risk for employees of public-listed companies who receive stock-based compensation is that the share price falls. This is the case for many US-listed technology companies after 2021, with many of their stock prices still down by 50% or more from their 2021 peak.

Imagine if you joined Okta in April 2021 and received a 4-year pay package of US$200,000 consisting of US$134,000 in cash and US$66,000 in shares. Back then the shares were trading at around US$244 each, so you would receive 270 shares per year. Today, 270 Okta shares are worth US$27,049 – a materially smaller sum compared to the grant value of US$66,000. You would have been better off taking US$200,000 in all-cash compensation.

Bottom line

All things considered, despite some drawbacks, stock-based compensation is still an attractive proposition for an employee as it allows them to make a huge “bet” at the grant date stock price of a company without laying out any capital at all.

If the stock surges like Nvidia’s, then the employee could be set for life. However, if the stock fails, employees still get the cash portion of the annual pay package.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I have a vested interest in Okta. Holdings are subject to change at any time.

What The USA’s Largest Bank Thinks About The State Of The Country’s Economy In Q1 2025

Insights from JPMorgan Chase’s management on the health of American consumers and businesses in the first quarter of 2025.

JPMorgan Chase (NYSE: JPM) is currently the largest bank in the USA by total assets. Because of this status, JPMorgan is naturally able to feel the pulse of the country’s economy. The bank’s latest earnings conference call – for the first quarter of 2025 – was held last week and contained useful insights on the state of American consumers and businesses. The bottom-line is this: the US economy is facing turbulence, with a multitude of problems, but consumers and businesses still remain financially healthy

What’s shown between the two horizontal lines below are quotes from JPMorgan’s management team that I picked up from the call.


1. The US economy is facing turbulence, with problems including tariffs, trade wars, inflation, and high asset prices

The economy is facing considerable turbulence (including geopolitics), with the potential positives of tax reform and deregulation and the potential negatives of tariffs and “trade wars,” ongoing sticky inflation, high fiscal deficits and still rather high asset prices and volatility. As always, we hope for the best but prepare the Firm for a wide range of scenarios.

2. Net charge-offs for the whole bank (effectively bad loans that JPMorgan can’t recover) rose from US$1.9 billion a year ago; management increased the probability weightings for downside scenarios in its CECL (current expected credit losses) framework for credit allowances in 2025 Q1 because of higher risks and uncertainties from the environment seen in the last few weeks; the increase in allowance is not driven by deterioration in credit performance; Consumer & Community Banking’s net charge-offs surged significantly from US$0.72 billion a year ago 

Credit costs were $3.3 billion with net charge offs of $2.3 billion and a net reserve bill of $973 million…

…With this quarter’s reserve bill, firm’s total allowance for credit losses is $27.6 billion. Let’s take a second to add a little bit of context to our thinking surrounding this number in light of the unique environment of the last several weeks. Our first quarter allowance is anchored on the relatively benign central case economic outlook, which was in effect at the end of the quarter. But in light of the significantly elevated risks and uncertainties at the time, we increased the probability weightings associated with the downside scenarios in our CECL framework. As a result, the weighted average unemployment rate embedded in our allowance is 5.8%, up from 5.5% last quarter, driving the $973 million increase in the allowance. So with that in mind, the consumer build of $441 million was driven by changes in the weighted average macroeconomic outlook. The wholesale build of $549 million was predominantly driven by credit quality changes on certain exposures and that lending activity, as well as changes in the outlook…

…The increase in the allowance is not to any meaningful degree driven by deterioration in the actual credit performance in the portfolio which remains largely in line with expectations…

…Credit costs were $2.6 billion, reflecting net charge-offs of $2.2 billion, up $275 million year on year, pred ominantly driven by the seasoning of recent vintages and card with delinquencies and losses in line with expectations.

3. Management is seeing recent downtrends in consumer and small business sentiment, but consumers and small businesses remain financially healthy; management is seeing consumers front-load spending ahead of tariffs; management is seeing small businesses face more challenges than large businesses because of tariffs-related uncertainty; management is seeing a drop in travel-spending among consumers, but it’s not indicative of broader patterns; management is seeing relatively weaker spending from lower-income consumers, but they are not in distress 

Consumers and small businesses remain financially healthy despite the recent down trends in consumer and small business sentiment. Based on our data, spend, cash buffers, payment to income ratios, and credit utilization are all in line with our expectations…

…On the consumer side, the thing to check is the spending. And to be honest, the main thing that we see there would appear to be a certain amount of front-loading of spending ahead of people expecting price increases from tariffs…

…In terms of our corporate clients, obviously, they’ve been reacting to the changes in tariff policy… Across the size of the clients, I think smaller clients, small business, and smaller corporates are probably a little bit more challenged. I think the larger corporates have a bit more experience dealing with these things and more resources to manage…

…We obviously saw the airlines discuss what they are seeing as headwinds for them, specifically in airline travel. And we’re seeing that too through the card spend. It’s not obvious to us that that’s necessarily an indicator for broader patterns…

…When we look at our card data and also our cash buffers and people checking accounts, of course, it is true that it is relatively weaker in the lower income segment. But when you take a step back and you ask, are we seeing signs of distress in the lower income segment? The answer is no. So sure, the margin cash buffers are lower, and you see some rotation of spend and spending is a little bit weaker than it was in the peak spending moments. But actually, some of the increases in spending that we’re seeing in April are actually coming from the lower income segment. So no evidence of distress, I would say.

4. JPMorgan’s credit card outstanding loans was up double-digits year-on-year

Card outstandings were up 10% due to strong account acquisition.

5. Auto originations were up year-on-year

In auto, originations were $10.7 billion, up 20%, driven by higher lease volume.

6. JPMorgan’s investment banking fees had good growth in 2025 Q1, with growth in debt underwriting fees but a decline in equity underwriting fees, signalling higher appetite for refinancing activity from companies; management is seeing companies adopting a wait-and-see attitude when it comes to capital markets activities because of tariffs-related uncertainty in the current environment

IB fees were up 12% year on year, and we ranked number one with wallet share of 9%. In advisory, fees were up 16%, benefiting from the closing of deals announced in 2024. Debt underwriting fees were up 16%, primarily driven by elevated refinancing activity, particularly in leveraged finance. And equity underwriting fees were down 9% year on year, reflecting challenging market conditions. In light of market conditions, we are adopting a cautious stance on the investment banking outlook. While client engagement and dialogue is quite elevated, both the conversion of the existing pipeline and origination of new activity will require a reduction in the current levels of uncertainty…

…In terms of our corporate clients, obviously, they’ve been reacting to the changes in tariff policy. And at the margin, that shifts their focus away from more strategic priorities with obvious implications for the investment banking pipeline outlook towards more short-term work, optimizing supply chains, and trying to figure out how they’re going to respond to the current environment. So as a result, I think we would characterize what we’re hearing from our corporate clients as a little bit of a wait-and-see attitude.

7. Management expects credit card net charge-offs for 2025 to be in line with previous guidance because of the mechanical way credit card charge-offs work, and not because management thinks credit card net charge-offs will really be healthy as the year progresses

On credit, we expect the card net charge-off rate to be in line with our previous guidance of approximately 3.6%…

…[Question] No change to the full year credit card net charge-off forecast. How do we square that with the rising recession risk?

[Answer] We should have not given you that forecast. We don’t know what the number is going to be. I would say that’s a short-term number. And based on what’s happening today is there’s a wide range of potential outcomes… There are some mechanical elements to the way card charge-off works. That means that it’s pretty baked, pretty far out of time a couple of quarters… It just doesn’t necessarily tell you that much about what might actually happen through the end of the year, even if unemployment were to increase significantly, it probably wouldn’t flow through the charge-offs until later.

8. Management is now incorporating 3 interest rate cuts for 2025, up from the previous expectation of 1 cut

If you remember last quarter we said that we had one cut in the curve. I think, latest curve has something like three cuts.

9. JPMorgan’s economists think there’s a 50% chance of a recession

What I would say is our excellent economists, Michael Feroli, I call him this morning, specifically to ask him how they’re looking at their forecast today. And they think it’s about 50-50 for a recession. So I’ll just refer to that.

10. Management thinks inflation in the US will be sticky

We have sticky inflation. We had that before. I personally have told you I don’t think that’s going to go away, and that relates to that.

11. Management thinks the US dollar will remain the reserve currency globally

Obviously, the US dollar still is the reserve currency, and that isn’t going to change though some people may feel slightly differently about it.

12. Management thinks that the current situation is different from past cycles

[Question] You’ve been through many cycles. And I think we’re all interested in understanding how you think this next cycle is likely to progress. And I’m wondering, is there anything that you’ve seen in the past that looks like this or that you would suggest if any slowdown coming forward, is it more likely to be similar to what kind of prior cycle you’ve seen?

[Answer] This is different, okay? This is different. This is the global economy. And please read my chairman’s letter. The most important thing to me is the Western world stays together economically when we get through all this and militarily to keep the world safe and free for democracy. That is the most important thing… We obviously have to follow the law of the land, but it’s a significant change we’ve never seen in our lives.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I don’t have a vested interest in any company mentioned. Holdings are subject to change at any time.

What The USA’s Largest Bank Thinks About The State Of The Country’s Economy In Q4 2024

Insights from JPMorgan Chase’s management on the health of American consumers and businesses in the fourth quarter of 2024.

JPMorgan Chase (NYSE: JPM) is currently the largest bank in the USA by total assets. Because of this status, JPMorgan is naturally able to feel the pulse of the country’s economy. The bank’s latest earnings conference call – for the fourth quarter of 2024 – was held earlier this week and contained useful insights on the state of American consumers and businesses. The bottom-line is this: the US economy remains resilient, but two significant risks remain, namely, persistent inflation and dangerous geopolitical conditions 

What’s shown between the two horizontal lines below are quotes from JPMorgan’s management team that I picked up from the call.


1. The US economy remains resilient, with low unemployment and healthy consumer spending; businesses are now more optimistic about the economy

The U.S. economy has been resilient. Unemployment remains relatively low, and consumer spending stayed healthy, including during the holiday season. Businesses are more optimistic about the economy, and they are encouraged by expectations for a more pro-growth agenda and improved collaboration between government and business.

2. Management sees two significant risks, namely, persistent inflation, and the most dangerous geopolitical conditions since World War II; management thinks a high level of optimism is embedded in asset prices; management is focused on being prepared for a wide range of scenarios

Two significant risks remain. Ongoing and future spending requirements will likely be inflationary, and therefore, inflation may persist for some time. Additionally, geopolitical conditions remain the most dangerous and complicated since World War II…

…We think it’s important to acknowledge the tension in the risks and uncertainties in the environment and the degree of optimism embedded in asset prices and expectations. In that context, we remain upbeat about the strength of the franchise, but we are focused on being prepared for a wide range of scenarios.

3. Net charge-offs for the whole bank (effectively bad loans that JPMorgan can’t recover) rose from US$2.2 billion a year ago; Consumer & Community Banking’s net charge-offs rose by US$0.4 billion from a year ago

Credit costs were $2.6 billion, reflecting net charge-offs of $2.4 billion and a net reserve of $267 million…

…In terms of credit performance this quarter, credit costs were $2.6 billion, reflecting net charge-offs of $2.1 billion, up $428 million year-on-year driven by card. The net reserve build was $557 million predominantly driven by higher card revolving balances.

4. JPMorgan’s credit card outstanding loans was up double-digits; management expects card loans to grow in 2025, but at a slower pace than in 2024

Card outstandings were up 11% due to strong account acquisition and revolvers…

… We expect healthy card loan growth again this year but below the 12% pace we saw in 2024 as tailwinds from revolver normalization are largely behind us. 

5. Auto originations were up

In auto, originations were $10.6 billion, up 7%, reflecting higher lease volume on robust new vehicle inventory. 

6. JPMorgan’s investment banking fees had strong growth in 2024 Q4, with strong growth in debt underwriting and equity underwriting fees, signalling higher appetite for capital-markets activity from companies; management is optimistic about companies’ enthusiasm towards capital markets activities

IB fees were up 49% year-on-year, and we ranked #1 with wallet share of 9.3% for 2024. Advisory fees were up 41%, benefiting from large deals and share growth in a number of key sectors. Underwriting fees were up meaningfully with debt up 56% and equity up 54% primarily driven by favorable market conditions. In terms of the outlook for the overall Investment Banking wallet, in light of the positive momentum, we remain optimistic about our pipeline. 

7. Management is seeing companies paydown bank loans and is not seeing loan growth, but the lack of loan growth is not necessarily a negative thing, as it involves companies having wide access to capital markets

Global Corporate and Investment Banking loans were down 2% quarter-on-quarter driven by paydowns and lower short-term financing, primarily offset by originations. In Commercial Banking, middle market loans were also down 2% driven by paydowns, predominantly offset by new originations. And commercial real estate loans were flat as new originations were offset by paydowns…

…I think given the significant improvement in business sentiment and the general optimism out there, you might have expected to see some big open loan growth. We are not really seeing that. I don’t particularly think that’s a negative. I think it’s probably explained by a combination of wide open capital markets and so many of the larger corporates accessing the capital markets and healthy balance sheets in small businesses and maybe some residual caution. And maybe there are some pockets in some industries where some aspects of the policy uncertainty that we might be facing are making them a little bit more cautious than they otherwise would be about what they’re executing in the near term. But we’ll see what the new year brings. The current optimism starts getting tested with reality one way or the other.

8. Management is incorporating interest rate cuts in 2025

We expect 2025 NII ex Markets to be approximately $90 billion. Going through the drivers, as usual, the outlook assumes that rates follow the forward curve. It’s worth noting that the NII decrease is driven by both the cut expected in 2025 and the impact of the 100 basis points of cuts in the back half of 2024. 

9. Management expects credit card net charge-offs in 2025 of 3.6%, up from 3.34% in 2024

On credit, we expect the 2025 card net charge-off rate to be in line with our previous guidance of approximately 3.6%.

10. Management has extra capital for JPMorgan as they think there’s a good chance the bank can deploy the capital at better prices in the future, but they’re not increasing the size of the extra capital

The way we’re thinking about it right now is that we feel very comfortable with the notion that it makes sense for us to have a nice store of extra capital in light of the current environment. We believe there is a good chance that there will be a moment where we get to deploy it at better levels essentially in whatever way than the current opportunities would suggest. And so that feels like a correct kind of strategic and financial decision for us. Having said that, having studied it quite extensively over the last 6 months and have all these debates you would expect, we’ve concluded that we do have enough. We have not [indiscernible]. And given that, we would like to not have the excess grow from here.

11. The mortgage market for housing looks poor given the high interest rates there

You know well the state of the mortgage market given rates. 

12. Management thinks that the biggest sources of risk to the credit market are unemployment and stagflation

Just the biggest driver of credit has been and always will be unemployment, both on the consumer side and it feeds into the corporate side. It feeds into mortgages, subprime, credit card. So really it’s your forecast of unemployment. You have to make your own, which will determine that over time. And so the second thing you said vulnerabilities. It’s unemployment, but the worst case would be stagflation. High rates with higher unemployment will drive higher credit losses literally across the board. I’m not — we’re not predicting that, but you just ask for the vulnerabilities. That’s the vulnerabilities.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I don’t have a vested interest in any company mentioned. Holdings are subject to change at any time.

6 Things I’m Certain Will Happen In The Financial Markets In 2025

There are so many things that can happen, but here are six things that I’m certain will happen in the financial markets in 2025.

There are no guarantees in the world of investing… or are there? Here are six things I’m certain will happen in 2025.

1. There will be something huge to worry about in the financial markets.

Peter Lynch is the legendary manager of the Fidelity Magellan Fund who earned a 29% annual return during his 13-year tenure from 1977 to 1990. He once said:

“There is always something to worry about. Avoid weekend thinking and ignore the latest dire predictions of the newscasters. Sell a stock because the company’s fundamentals deteriorate, not because the sky is falling.”

Imagine a year in which all the following happened: (1) The US enters a recession; (2) the US goes to war in the Middle East; and (3) the price of oil doubles in three months. Scary? Well, there’s no need to imagine: They all happened in 1990. And what about the S&P 500? It has increased by more than 1,600% from the start of 1990 to today, even without counting dividends.

There will always be things to worry about. But that doesn’t mean we shouldn’t invest.

2. Individual stocks will be volatile.

From 1997 to today, the maximum peak-to-trough decline in each year for Amazon.com’s stock price ranged from 12.6% to 83.0%. In other words, Amazon’s stock price had suffered a double-digit fall every single year for 27 years. Meanwhile, the same Amazon stock price had climbed by an astonishing 233,924% (from US$0.098 to more than US$229) over the same period. 

If you’re investing in individual stocks, be prepared for a wild ride. Volatility is a feature of the stock market – it’s not a sign that things are broken. 

3. US-China relations will either remain status quo, intensify, or blow over.

“Seriously!?” I can hear your thoughts. But I’m stating the obvious for a good reason: We should not let our views on geopolitical events dictate our investment actions. Don’t just take my words for it. Warren Buffett himself said so. In his 1994 Berkshire Hathaway shareholders’ letter, Buffett wrote (emphases are mine):

“We will continue to ignore political and economic forecasts, which are an expensive distraction for many investors and businessmen. 

Thirty years ago, no one could have foreseen the huge expansion of the Vietnam War, wage and price controls, two oil shocks, the resignation of a president, the dissolution of the Soviet Union, a one-day drop in the Dow of 508 points, or treasury bill yields fluctuating between 2.8% and 17.4%.

But, surprise – none of these blockbuster events made the slightest dent in Ben Graham’s investment principles. Nor did they render unsound the negotiated purchases of fine businesses at sensible prices. 

Imagine the cost to us, then, if we had let a fear of unknowns cause us to defer or alter the deployment of capital. Indeed, we have usually made our best purchases when apprehensions about some macro event were at a peak. Fear is the foe of the faddist, but the friend of the fundamentalist.

A different set of major shocks is sure to occur in the next 30 years. We will neither try to predict these nor to profit from them. If we can identify businesses similar to those we have purchased in the past, external surprises will have little effect on our long-term results.”

From 1994 to the third quarter of 2024, Berkshire Hathaway’s book value per share, a proxy for the company’s intrinsic value – albeit a flawed measure – grew by 13.5% annually. Buffett’s disciplined focus on long-term business fundamentals – while ignoring the distractions of political and economic forecasts – has worked out just fine.

4. Interest rates will move in one of three ways: Sideways, up, or down.

“Again, Captain Obvious!?” Please bear with me. There is a good reason why I’m stating the obvious again.

Much ado has been made about what central banks have been doing, and would do, with their respective economies’ benchmark interest rates. This is because of the theoretical link between interest rates and stock prices.

Stocks and other asset classes (bonds, cash, real estate etc.) are constantly competing for capital. In theory, when interest rates are high, the valuation of stocks should be low, since the alternative to stocks – bonds – are providing a good return. On the other hand, when interest rates are low, the valuation of stocks should be high, since the alternative – again, bonds – are providing a poor return. 

But what does reality say? Here’re important historical data on the actual relationship between interest rates and stocks in the US:  

  • Rising interest rates have been met with rising valuations. According to data from economist and Nobel Laureate Robert Shiller, the US 10-year Treasury yield was 2.3% at the start of 1950. By September 1981, it had risen to 15.3%, the highest rate recorded in Shiller’s dataset. In that same period, the S&P 500’s price-to-earnings (P/E) ratio moved from 7 to 8. In other words, the P/E ratio for the S&P 500 increased slightly despite the huge jump in interest rates. It’s worth noting too that the S&P 500’s P/E ratio of 7 at the start of 1950 was not a result of earnings that were temporarily inflated. Yes, there’s cherry picking with the dates. For example, if I had chosen January 1946 as the starting point, when the US 10-year Treasury yield was 2.2% and the P/E ratio for the S&P 500 was 19, then it would be a case of valuations falling alongside rising interest rates. But this goes to show that while interest rates have a role to play in the movement of stocks, it is far from the only thing that matters.
  • Stocks have climbed in rising interest rate environments. In a September 2022 piece, Ben Carlson, Director of Institutional Asset Management at Ritholtz Wealth Management, showed that the S&P 500 climbed by 21% annually from 1954 to 1964 even when the yield on 3-month Treasury bills (a good proxy for the Fed Funds rate, which is the key interest rate set by the Federal Reserve) surged from around 1.2% to 4.4% in the same period. In the 1960s, the yield on the 3-month Treasury bill doubled from just over 4% to 8%, but US stocks still rose by 7.7% per year. And then in the 1970s, rates climbed from 8% to 12% and the S&P 500 still produced an annual return of nearly 6%.
  • Stocks have done poorly in both high and low interest rate environments, and have also done well in both high and low interest rate environments. Carlson published an article in February 2023 that looked at how the US stock market performed in different interest rate regimes. It turns out there’s no clear link between the two. In the 1950s, the 3-month Treasury bill (which is effectively a risk-free investment, since it’s a US government bond with one of the shortest maturities around) had a low average yield of 2.0%; US stocks returned 19.5% annually back then, a phenomenal gain. In the 2000s, US stocks fell by 1.0% per year when the average yield on the 3-month Treasury bill was 2.7%. Meanwhile, a blockbuster 17.3% annualised return in US stocks in the 1980s was accompanied by a high average yield of 8.8% for the 3-month Treasury bill. In the 1970s, the 3-month Treasury bill yielded a high average of 6.3% while US stocks returned just 5.9% per year. 
  • A cut in interest rates by the Federal Reserve is not guaranteed to be a good or bad event for stocks. Josh Brown, CEO of Ritholtz Wealth Management, shared fantastic data in an August 2024 article on how US stocks have performed in the past when the Federal Reserve lowered interest rates. His data, in the form of a chart, goes back to 1957 and I reproduced them in tabular format in Table 1; it shows how US stocks did in the next 12 months following a rate cut, as well as whether a recession occurred in the same window. I also split the data in Table 1 according to whether a recession had occurred shortly after a rate cut, since eight of the 21 past rate-cut cycles from the Federal Reserve since 1957 took place without an impending recession. Table 2 shows the same data as Table 1 but for rate cuts with a recession; Table 3 is for rate cuts without a recession. What the data show is that US stocks have historically done well, on average, in the 12 months following a rate-cut. The overall record, seen in Table 1, is an average 12-month forward return of 9%. When a recession happened shortly after a rate-cut, the average 12-month forward return is 8%; when a recession did not happen shortly after a rate-cut, the average 12-month forward return is 12%. A recession is not necessarily bad for stocks. As Table 2 shows, US stocks have historically delivered an average return of 8% over the next 12 months after rate cuts that came with impending recessions. It’s not a guarantee that stocks will produce good returns in the 12 months after a rate cut even if a recession does not occur, as can be seen from the August 1976 episode in Table 3.
Table 1; Source: Josh Brown
Table 2; Source: Josh Brown
Table 3; Source: Josh Brown

It turns out that the actual relationship between interest rates and stocks is not as clear-cut as theory suggests. There’s an important lesson here, in that one-factor analysis in finance – “if A happens, then B will occur” – should be largely avoided because clear-cut relationships are rarely seen.  

I also think that time that’s spent watching central banks’ decisions regarding interest rates will be better spent studying business fundamentals. The quality of a company’s business and the growth opportunities it has matter far more to its stock price over the long run than interest rates. 

Sears is a case in point. In the 1980s, the US-based company was the dominant retailer in the country. Morgan Housel wrote in a blog post, Common Plots of Economic History :

“Sears was the largest retailer in the world, housed in the tallest building in the world, employing one of the largest workforces.

“No one has to tell you you’ve come to the right place. The look of merchandising authority is complete and unmistakable,” The New York Times wrote of Sears in 1983.

Sears was so good at retailing that in the 1970s and ‘80s it ventured into other areas, like finance. It owned Allstate Insurance, Discover credit card, the Dean Witter brokerage for your stocks and Coldwell Banker brokerage for your house.”

US long-term interest rates fell dramatically from around 15% in the early-to-mid 1980s to around 3% in 2018. But Sears filed for bankruptcy in October 2018, leaving its shareholders with an empty bag. In his blog post mentioned earlier, Housel also wrote:

“Growing income inequality pushed consumers to either bargain or luxury goods, leaving Sears in the shrinking middle. Competition from Wal-Mart and Target – younger and hungrier – took off.

By the late 2000s Sears was a shell of its former self. “YES, WE ARE OPEN” a sign outside my local Sears read – a reminder to customers who had all but written it off.” 

If you’re investing for the long run, there are far more important things to watch than interest rates.

5. There will be investors who are itching to make wholesale changes to their investment portfolios for 2025.

Ofer Azar is a behavioural economist. He once studied more than 300 penalty kicks in professional football (or soccer) games. The goalkeepers who jumped left made a save 14.2% of the time while those who jumped right had a 12.6% success rate. Those who stayed in the centre of the goal saved a penalty 33.3% of the time.

Interestingly, only 6% of the keepers whom Azar studied chose to stay put in the centre. Azar concluded that the keepers’ moves highlight the action bias in us, where we think doing something is better than doing nothing. 

The bias can manifest in investing too, where we develop the urge to do something to our portfolios, especially during periods of volatility. We should guard against the action bias. This is because doing nothing to our portfolios is often better than doing something. I have two great examples. 

First is a paper published by finance professors Brad Barber and Terry Odean in 2000. They analysed the trading records of more than 66,000 US households over a five-year period from 1991 to 1996. They found out that the most frequent traders generated the lowest returns – and the difference is stark. The average household earned 16.4% per year for the timeframe under study but the active traders only made 11.4% per year.

Second, finance professor Jeremy Siegel discovered something fascinating in the mid-2000s. In an interview with Wharton, Siegel said:

“If you bought the original S&P 500 stocks, and held them until today—simple buy and hold, reinvesting dividends—you outpaced the S&P 500 index itself, which adds about 20 new stocks every year and has added almost 1,000 new stocks since its inception in 1957.”

Doing nothing beats doing something. 

6. There are nearly 8.2 billion individuals in the world today, and the vast majority of us will wake up every morning wanting to improve the world and our own lot in life.

This motivation is ultimately what fuels the global economy and financial markets. There are miscreants who appear occasionally to mess things up, but we should have faith in the collective positivity of humankind. We should have faith in us. The idiots’ mess will be temporary.

Mother Nature threw us a major global health threat in 2020 with COVID-19. But we – mankind – managed to build a vaccine against the disease in record time; Moderna even managed to design its vaccine in just two days. This is a great example of the ingenuity of humanity at work.

To me, investing in stocks is the same as having the long-term view that we humans are always striving, collectively, to improve the world.      

A final word

This article is a little cheeky, because it describes incredibly obvious things, such as “interest rates will move in one of three ways: sideways, up, or down.” But I wrote it in the way I did for a good reason. A lot of seemingly important things in finance – things with outcomes that financial market participants obsess over and try to predict – actually turn out to be mostly inconsequential for long-term investors. Keep this in mind when you read the next “X Things That Will Happen To Stocks in 2025” article.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I currently have a vested interest in Amazon. Holdings are subject to change at any time. 

Applying The Lessons Learnt From 2021

Don’t make the mistakes of the past.

With stock markets making new highs this year, it is a good time to look back at the lessons learnt from the collapse of some tech stocks in 2021 and 2022.

Back then, stock markets also reached all-time highs but many tech names collapsed as valuations compressed and growth stalled.

With this in mind, here are some of the key things to be mindful of today as we navigate the stock market.

Don’t celebrate too soon

Investing is a long game. Just because our stocks have risen does not mean we have won the game. The true test of a business’s strength is its enduring ability to keep on growing. Stock prices may also reflect unwarranted short term optimism which does not materialise. 

Make sure to continuously assess the fundamentals of your portfolio companies even (especially) if its stock price has skyrocketed.

Don’t chase stocks

It may be tempting to buy stocks that have risen greatly in a short period of time. Afterall, none of us want to be left out of a massive rally. 

But this fear of missing out can work against us as stocks don’t keep going up forever. Remember that valuations matter and we need to assess if a stock has gone up too much over a short period of time.

In 2021, many stocks rose to unsustainable valuations, only to come crashing down to earth in the next two years. While some have recovered, many still linger up to 90% off their all-time highs.

Sell if valuations don’t make sense

Buy-and-hold is a great strategy when markets are working smoothly and you’ve bought into great growing companies at reasonable valuations. 

But when stock markets are not working well and stock prices rise too high due to unwarranted exuberance, it may be important to look at your sell strategies.

Back in 2021, the stocks of many companies skyrocketed. It was not uncommon to see stocks rise by up to 1,000% in a short period of time.

While some of these companies are undoubtedly growing fast and are resilient, the valuations reached a point where forward returns would likely be depressed. Unsurprisingly, many of these companies’ stocks plunged and have yet to recover.

Growth trends may not continue

It may be tempting to look at a company’s recent revenue and profit growth and assume that it can continue growing at that rate for a long period of time. The reality is that future growth trends may not always mirror the past. This is especially true for companies that have been growing at unsustainably high rates. More often than not, growth will fall back to more normal rates.

The poster boy of the COVID collapse is probably Zoom Communications. The company saw explosive growth, only for its growth rates to flat-line once the pandemic ended.

Besides Zoom, there are numerous other companies that also saw growth decelerate meaningfully as we exited the pandemic era.

These companies unsurprisingly have seen their stock prices collapse.

Look for recurring revenue

Many companies can experience significant upmarkets due to upgrade cycles or loose monetary policy which encourage unsustainable consumer and business spending. However, remember that many companies do experience significant swings in revenue because of the cyclical nature of their end-demand. This may be more true for hardware companies or those that sell big ticket items.

Companies such as Enphase, which sells solar power products such as microinverters, have seen their revenues crater as distributors struggle to clear inventory because of weak end-customer demand.

Bottom line

Although it is nice to see stock prices rise significantly in the past two years, it is important that we remember the key tenets of value investing. The above mistakes are some that many of us have made before.

This time around, let’s try to ensure that we maintain a portfolio of stocks that have good valuations and whose business can continue to thrive in good times and in bad.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I have a vested interest in Zoom Communications. Holdings are subject to change at any time.

The Latest Thoughts From American Technology Companies On AI (2024 Q3)

A collection of quotes on artificial intelligence, or AI, from the management teams of US-listed technology companies in the 2024 Q3 earnings season.

The way I see it, artificial intelligence (or AI), really leapt into the zeitgeist in late-2022 or early-2023 with the public introduction of DALL-E2 and ChatGPT. Both are provided by OpenAI and are software products that use AI to generate art and writing, respectively (and often at astounding quality). Since then, developments in AI have progressed at a breathtaking pace.

With the latest earnings season for the US stock market – for the third quarter of 2024 – coming to its tail-end, I thought it would be useful to collate some of the interesting commentary I’ve come across in earnings conference calls, from the leaders of technology companies that I follow or have a vested interest in, on the topic of AI and how the technology could impact their industry and the business world writ large. This is an ongoing series. For the older commentary:

With that, here are the latest commentary, in no particular order:

Airbnb (NASDAQ: ABNB)

Airbnb’s management recently introduced a personalised welcome tour of the Airbnb app for first-time users; management sees this personalisation feature as the beginning of a more personalised Airbnb

We also introduced 50 upgrades for guests that make Airbnb a more intuitive and personalized app. And some of the features include a personalized welcome tour of the app for first-time guests, suggest a destination when guests tap the search bar, we’ll recommend locations on their search and booking history, and personalized listing highlights. So when a guest views a listing, we will highlight the details that are relevant to their search, and there are dozens of new features just like these. This is quite literally the beginning of a more personalized Airbnb.

Airbnb’s management is seeing great progress on AI-powered customer service; management sees 3 phases to the deployment of AI-powered customer service, where Phase 1 is Airbnb using AI to answer basic questions from customers, Phase 2 is the AI answering questions from customers in a personalised way, and Phase 3 is the AI taking personalised actions on behalf of customers; management thinks that Airbnb has hired some of the best AI talent to develop AI-powered customer service

We are seeing some really great progress on AI-powered customer service. The way we think about customer service, powered by AI is in 3 phases…

…Phase 1 is just answer basic general questions. We’re rolling out a pilot that can answer basic general questions. Phase 2 is personalization, be able to personalize the questions. Phase 3 is to take action…

…So this is where we think customer service can go enabled by AI, and we’ve hired some of the best people in the world to work on this.

Airbnb is currently in Phase 1 of deploying AI-powered customer service; management thinks that the vast majority of customer chats that are received by Airbnb will be handled directly by AI agents in the future

Phase 1 is the phase we’re in right now. If you were to most — first of all, most of our customer context, we get over 10 million contacts a year. Most of the contacts that we anticipate getting in the coming years aren’t going to be phone calls. They’re going to be chatting through the app. I really personally don’t like calling customer service and having to dial them. I want to be able to chat, and chat AI can intercept. And so we think in the future, the vast majority of our chats are going to be intercepted and handled directly by the AI agent.

An example of the 3rd phase of Airbnb’s AI-powered customer service that management has in mind: An AI agent can help customers to cancel bookings and even make rebookings 

So I’ll give you an example. Let me just give you 1 example. Let’s say I were to contact customer service and I say, “how do I cancel reservation?” In Phase 1, what we’re doing now, the AI agent will answer probably even better than the average customer service agent, how to cancel a reservation. So we’ll take you to how to cancel a reservation step by step. Phase 2 personalization, they’ll say, “hey, Brian, I see you have a reservation coming up in Los Angeles next week. Here’s how you cancel that reservation.” And Phase III is taking action. It would say, “hey, Brian, I see you have a reservation come to Los Angeles. Would you like me to cancel it for you? Just tell me, yes, and I’ll do it for you. I can even handle rebooking.”

Alphabet (NASDAQ: GOOG)

Alphabet’s management thinks Alphabet is positioned to lead in AI because of the company’s full-stack approach of a robust AI infrastructure, world-class research team, and broad user-reach

We are uniquely positioned to lead in the era of AI because of our differentiated full stack approach to AI innovation, and we are now seeing this operate at scale. There’s 3 components: first, a robust AI infrastructure that includes data centers, chips and a global fiber network; second, world-class research teams who are advancing our work with deep technical AI research and who are also building the models that power our efforts. And third, a broad global reach through products and platforms that touch billions of people and customers around the world, creating a virtuous cycle.

Alphabet signed the world’s first corporate agreement for energy from multiple small modular nuclear reactors; the reactors will deliver 500 megawatts of carbon-free power 24/7

We are also making bold clean energy investments, including the world’s first corporate agreement to purchase nuclear energy from multiple small modular reactors, which will enable up to 500 megawatts of new 24/7 carbon-free power.

Since Alphabet began testing AI overviews 18 months ago, the company has reduced the cost to deliver queries by 90% while doubling the size of its Gemini foundation AI model; AI overview has led to users coming to Search more often; AI overview was recently rolled out to 100 new countries and territories and will reach more than 1 billion users on a monthly basis; there’s strong engagement in AI overview, leading to higher overall search usage and user satisfaction, and users are asking longer questions and exploring more websites; the growth driven by AI overviews is increasing over time; the integration of advertising with AI overviews  is performing well; Alphabet is now showing search and shopping ads within AI overviews for mobile users in the USA; management finds that users find ads within AI overviews to be helpful; management expects Search to evolve significant in 2025, driven by advances in AI; management is seeing the monetisation rate on ads within AI overviews to be the same as the broader Search; reminder from management that Google already introduced an answer-machine 10 years ago and management is aware of changing trends in user behaviours in Search

Since we first began testing AI overviews, we have lowered machine cost per query significantly. In 18 months, we reduced cost by more than 90% for these queries through hardware, engineering and technical breakthroughs while doubling the size of our custom Gemini model…

…In Search, recent advancements, including AI overviews, Circle to Search and new features in :ens are transforming the user experience, expanding what people can search for and how they search for it. This leads to users coming to search more often for more of their information needs driving additional search queries. Just this week, AI overview started rolling out to more than 100 new countries and territories. It will now reach more than 1 billion users on a monthly basis. We are seeing strong engagement, which is increasing overall search usage and user satisfaction. People are asking longer and more complex questions and exploring a wide range of websites. What’s particularly exciting is that this growth actually increases over time as people learn that Google can answer more of their questions.

The integration of ads within AI overviews is also performing well, helping people connect with businesses as they search…

…AI overviews, where we have now started showing search and shopping ads within the overview for mobile users in the U.S. As you remember, we’ve already been running ads above and below AI overviews. We’re now seeing that people find ads directly within AI overview is helpful because they can quickly connect with relevant businesses, products and services to take the next step at the exact moment they need…

… So I expect Search to continue to evolve significantly in 2025, both in the search product and in Gemini…

…We recently launched ads within AI overviews on mobile in the U.S. And this really builds on our previous rollout of ads above and below the AI overviews. So overall, for AI overviews, we see monetization at approximately the same rate, which gives us a really strong base on which we can innovate even more…

…[Question] Why doesn’t it make sense to have 2 completely different search experiences one an agent like answers engine; and then two, a link-based more traditional search engine? 

[Answer] In this moment, people are using a lot of buzz words like answer engines and all that stuff. I mean Google started answering questions about 10 years ago in our search product with featured snippets. So look, I think, ultimately, you are serving users. User expectations are constantly evolving. And and we work hard to stay a step ahead, anticipate and stay a step ahead.

Alphabet uses and offers customers both its own TPUs (tensor processing units) and Nvidia GPUs; Alphabet is now on the 6th generation of its TPUs, known as Trillium; LG AI Research reduced its inference processing time by 50% and operating costs by 72% using Google Cloud’s TPUs and GPUs; Alphabet will be one of the first companies to provide Nvidia’s GB 200s at scale; management thinks TPUs have very attractive pricing for its capability

We use and offer our customers a range of AI accelerator options, including multiple classes of NVIDIA GPUs and our own custom-built GPUs. We are now on the sixth generation of TPUs known as Trillium and continue to drive efficiencies and better performance with them…

…Using a combination of our TPUs and GPUs, LG AI research reduced inference processing time for its multimodal model by more than 50% and operating costs by 72%…

…We have a wonderful partnership with NVIDIA. We are excited for the GB 200s and will be one of the first to provide it at scale…

…On your first part of the question on the TPUs. If you look at the flash pricing, we’ve been able to deliver externally, I think and how much more attractive it is compared to other models of that capability.

Usage of Alphabet’s Gemini foundation AI model is in a period of dramatic growth by any measure; improvements to Gemini will soon come; all 7 of Alphabet’s products that have more than 2 billion monthly users each use Gemini models; Gemini is now available on GitHub Copilot; Gemini API calls were up 14x in a 6-month period; Snap saw a 2.5 times increase in engagement with its MyAI chatbot after choosing Gemini to power the chatbot’s user experiences; Gemini’s integration with Android is improving Android; the latest Samsung Galaxy devices’ Android operating system has Gemini Live for users to converse with the Gemini model; Alphabet’s latest Pixel 9 devices have Gemini Nano within; development of the third generation of the Gemini model is progressing well; see Point 23 for how Gemini is helping advertisers

By any measure, token volume, API calls, consumer usage business adoption, usage of the Gemini models is in a period of dramatic growth, and our teams are actively working on performance improvements and new capabilities for our range of models. Stay tuned…

… Today, all 7 of our products and platforms with more than 2 billion monthly users use Gemini models, that includes the latest product to surpass the 2 billion user milestone Google Maps…

…Today, we shared that Gemini is now available on GitHub Copilot with more to come…

…Gemini API calls have grown nearly 14x in a 6-month period. When Snap was looking to power more innovative experiences within their MyAI chatbot, they chose Gemini’s strong multimodal capabilities. Since then, Snap saw over 2.5x as much engagement with MyAI in the United States…

… Gemini’s deep integration is improving Android. For example, Gemini Live lets you have free-flowing conversations with Gemini. People love it. It’s available on Android, including Samsung Galaxy devices. We continue to work closely with them to deliver innovations across their newest devices with much more to come. At Made by Google, we unveiled our latest Pixel 9 series of devices featuring advanced AI models, including Gemini Nano. We have seen strong demand for these devices, and they’ve already received multiple awards…

…We’ve had 2 generations of Gemini model. We are working on the third generation, which is progressing well.

Alphabet’s Project Astra will allow AI to see and reason about the physical world around users, and management aims to ship it as early as 2025

When they’re building out experiences where AI can see and reason about the world around you, Project Astra is a glimpse of that future. We are working to ship experiences like this as early as 2025.

Alphabet is using AI internally to improve coding productivity and efficiency; a quarter of new code at Google is now generated by AI

We’re also using AI internally to improve our coding processes, which is boosting productivity and efficiency. Today, more than 1/4 of all new coated Google is generated by AI, then reviewed and accepted by engineers. This helps our engineers do more and move faster. 

Circle to Search is now available on more than 150 million Android devices; a third of users who have tried Circle to Search now use it weekly; Circle to Search has higher engagement with younger users

Circle to Search is now available on over 150 million Android devices with people using it to shop, translate text and learn more about the world around them. 1/3 of the people who have tried Circle to Search now use it weekly, a testament to its helpfulness and potential…

… For example, with Circle to Search, where we see higher engagement from users aged 18 to 24.  

Lens is now used in over 20 billion visual searches per month; Lens is one of the fastest-growing query types management has seen on Search; management started testing product search on Lens in October and found that shoppers are more likely to engage; management is seeing users use Lens for complex multimodal queries; Alphabet has rolled out shopping ads with Lens visual research results to better connect consumers and businesses.

Lens is now used for over 20 billion visual searches per month. Lens is one of the fastest-growing query types we see on search because of its ability to answer complex multimodal questions and help in product discovery and shopping…

…In early October, we announced product search on Google Lens and in testing this feature, we found that shoppers are more likely to engage with content in this new format. We’re also seeing that people are turning to Lens more often to run complex multimodal quarries, voicing a question or inputting text in addition to a visual. Given these new user behaviors earlier this month, we announced the rollout of shopping ads above and alongside relevant Lens visual search results to help better connect consumers and businesses. 

Customers are using Google Cloud’s AI products in 5 different ways: (1) for AI hardware and software infrastructure; (2) for building and customising AI models with Vertex; (3) for combining Google Cloud’s AI platform with its data platform; (4) for AI-powered cybersecurity solutions; and (5) for building AI agents to improve customer engagement

Customers are using our products in 5 different ways. First, our AI infrastructure. which we differentiate with leading performance driven by storage, compute and software advances as well as leading reliability and a leading number of accelerators…

…Second, our enterprise AI platform, Vertex is used to build and customize the best foundation models from Google and the industry…

…Third, customers use our AI platform together with our data platform, Big Query, because we analyze multimodal data no matter where it is stored with ultra low latency access to Gemini…

…Fourth, our AI-powered cybersecurity solutions Google threat intelligence and security operations are helping customers like BBVA and Deloitte, prevent deduct and respond to cybersecurity threats much faster…

… Fifth, in Q3, we broadened our applications portfolio with the introduction of our new customer engagement suite. It’s designed to improve the customer experience online and in mobile apps as well as in call centers, retail stores and more. 

Waymo is the biggest part of Alphabet’s Other Bets portfolio; Alphabet’s management thinks Waymo is the clear technical leader in autonomous vehicles; Waymo is now serving 150,000 paid rides weekly and driving 1 million fully autonomous miles, and is the first autonomous vehicle company to reach these milestones; Waymo is partnering with Uber and Hyundai to deliver autonomous vehicles to more consumers; Waymo is now on its sixth generation system

I want to highlight Waymo, the biggest part of our portfolio. Waymo is now a clear technical leader within the autonomous vehicle industry and creating a growing commercial opportunity. Over the years, Waymo has been infusing cutting edge AI into its work. Now each week, Waymo is driving more than 1 million fully autonomous miles and serves over 150,000 paid rides, the first time any AV company has reached this kind of mainstream use. Through its expanded network and operations partnership with Uber in Austin and Atlanta, plus a new multiyear partnership with Hyundai, Waymo will bring fully autonomous driving to more people and places. By developing a universal driver, Waymo has multiple paths to market. And with its sixth generation system, Waymo significantly reduced unit costs without compromising safety.

Alphabet’s management finds that AI helps Alphabet better understand consumer-intent and connect consumers with advertisers

AI is expanding our ability to understand intent and connect it to our advertisers. This allows us to connect highly relevant users with the most helpful ad and deliver business impact to our customers.

Advertisers are using Gemini to build and test more creatives at scale; Audi worked with Gemini tools to increase website visits by 80% and increase clicks by 2.7 times

Advertisers now use our Gemini power tools to build and test a larger variety of relevant creators at scale. Audi used our AI tools to generate multiple video image and text assets in different links and orientations out of existing long-form videos. They then fed the newly generated creatives into Demand Gen to drive reach, traffic and booking to their driving experience. The campaign increased website visits by 80% and increased clicks by 2.7x, delivering a lift in their sales. 

Alphabet is offering AI-powered campaigns to help advertisers achieve faster feedback on what is working; DoorDash saw a 15x higher conversion rate at a 50% more efficient cost per action

AI-powered campaigns help advertisers get faster feedback on what creatives workwear and redirect the media buying. Using Demand Gen, DoorDash tested a mix of image and video assets to drive more impact across Google and YouTube’s visually immersive surfaces. They saw a 15x higher conversion rate at a 50% more efficient cost per action when compared to video action campaigns alone. 

Alphabet is using AI to help advertisers better measure their advertising results

This quarter, we extended the availability of our open source marketing mix model, Meridian to more customers, helping to scale measurement of cross-channel budgets to drive better business outcomes.

Alphabet’s big jump capex in 2024 Q3 (was $8.1 billion in 2023 Q3) was mostly for technical infrastructure, in the form of servers and data centers; management expects Alphabet’s 2024 Q4 capex to be similar to what was seen in 2024 Q3; Alphabet announced more than $7 billion in planned data center investments in 2024 Q3, with $6 billion in the USA; management expects further growth in capex in 2025, but not at the same percentage increase seen from 2023 to 2024; the use of TPUs at Alphabet helps to drive efficiencies

With respect to CapEx, our reported CapEx in the third quarter was $13 billion, reflecting investment in our technical infrastructure with the largest component being investment in servers, followed by data centers and networking equipment. Looking ahead, we expect quarterly CapEx in the fourth quarter to be at similar levels to Q3…

…In the third quarter alone, we made announcements of over $7 billion in planned data center investments with nearly $6 billion of that in the U.S…

…As you saw in the quarter, we invested $13 billion in CapEx across the company. And as you think about it, it really is divided into 2 categories. One is our technical infrastructure, and that’s the majority of that $13 billion. And the other one goes into areas such as facilities, the bets and other areas across the company. Within TI, we have investments in servers, which includes both TPUs and GPUs. And then the second categories are data centers and networking equipment. This quarter, approximately 60% of that investments in technical infrastructure went towards servers and about 40% towards data center and networking equipment…

…And as you think about the next quarter and going into next year, as I mentioned in my prepared remarks, we will be investing in Q4 at approximately the same level of what we’ve invested in Q3, approximately $13 billion. And as we think into 2025, we do see an increase coming in 2025, and we will provide more color on that on the Q4 call, likely not the same percent step-up that we saw between ’23 and ’24, but additional increase…

…On your first part of the question on the TPUs. If you look at the flash pricing, we’ve been able to deliver externally, I think and how much more attractive it is compared to other models of that capability. I think probably that gives a good sense of the efficiencies we can generate from our architecture. And so — and we are doing the same that for internal use as well. The models for search while they keep going up in capability we’ve been able to really optimize them for the underlying architecture, and that’s where we are seeing a lot of efficiencies as well.  

Amazon (NASDAQ: AMZN)

Amazon’s management believes that AI will be a big piece of the company’s robotics efforts in its fulfilment network

We continue to innovate in robotics to speed delivery, lower cost to serve, and further improve safety in our fulfillment network…

…We really do believe that AI is going to be a big piece of what we do in our robotics network. We had a number of efforts going on there. We just hired a number of people from an incredibly strong robotics AI organization. And I think that will be a very central part of what we do moving forward, too. 

Amazon’s management sees customers focused on new cloud computing efforts again, and the modernisation of their infrastructure, by migrating to the cloud, is important if they want to work on generative AI at scale

Companies are focused on new efforts again, spending energy on modernizing their infrastructure from on-premises to the cloud. This modernization enables companies to save money, innovate more quickly, and get more productivity from their scarce engineering resources. However, it also allows them to organize their data in the right architecture and environment to do generative AI at scale. It’s much harder to be successful and competitive in generative AI if your data is not in the cloud.

AWS has released nearly twice as many AI features in the last 18 months as other leading cloud providers combined; AWS’s AI business is growing at a triple digit rate at a multi-billion revenue run rate; AWS’s AI business is currently growing more than 3x faster than AWS itself grew when AWS was at a similar stage; management sees AI as an unusually large opportunity

In the last 18 months, AWS has released nearly twice as many machine learning and gen AI features as the other leading cloud providers combined. AWS’ AI business is a multibillion-dollar revenue run rate business that continues to grow at a triple-digit year-over-year percentage and is growing more than 3x faster at this stage of its evolution as AWS itself grew, and we felt like AWS grew pretty quickly…

…It is a really unusually large, maybe once-in-a-lifetime type of opportunity. And I think our customers, the business, and our shareholders will feel good about this long term that we’re aggressively pursuing it.

Amazon has a good relationship with NVIDIA, but management have heard from customers that they want better price performance on their AI workloads, and so AWS developed its own AI chips for training and inference; AWS’s second version of its AI chip for model-training, Trainium 2, will ramp up in the next few weeks; management thinks Trainium 2 have very compelling price performance; management is seeing significant interest in Trainium 2, to the extent they have to increase manufacturing orders much more than originally planned

While we have a deep partnership with NVIDIA, we’ve also heard from customers that they want better price performance on their AI workloads. As customers approach higher scale in their implementations, they realize quickly that AI can get costly. It’s why we’ve invested in our own custom silicon in Trainium for training and Inferentia for inference. The second version of Trainium, Trainium2, is starting to ramp up in the next few weeks and will be very compelling for customers on price performance. We’re seeing significant interest in these chips, and we’ve gone back to our manufacturing partners multiple times to produce much more than we’d originally planned…

…We have a very deep partnership with NVIDIA, we tend to be their lead partner on most of their new chips. We were the first to offer H200s in EC2 instances. And I expect us to have a partnership for a very long time that matters.

Amazon’s management is seeing more model builders standardise on SageMaker,  AWS’s fully-managed AI service; SageMaker’s hyperpod capability helps save model-training time by up to 40%

We also continue to see increasingly more model builders standardize on Amazon SageMaker, our service that makes it much easier to manage your AI data, build models, experiment, and deploy to production. This team continues to add features at a rapid clip punctuated by SageMaker’s unique hyperpod capability, which automatically splits training workloads across more than 1,000 AI accelerators, prevents interruptions by periodically saving checkpoints, and automatically repairing faulty instances from their last saved checkpoint, and saving training time by up to 40%.

Amazon’s management believes Amazon Bedrock, AWS’s AI-models-as-a-service offering for companies that want to leverage existing foundation models for customisation, has the broadest selection of leading foundation models; Bedrock recently added Anthropic’s Claude 3.5 Sonnet model, Meta’s Llama 3.2 models, and more; management is seeing companies use models from different providers within the same application and Bedrock makes it easy to orchestrate the disparate models; Bedrock also helps companies with model-access, prompt engineering, and lowering inference costs

At the middle layer where teams want to leverage an existing foundation model, customized with their data, and then have features to deploy high-quality generative AI applications, Amazon Bedrock has the broadest selection of leading foundation models and most compelling modules for key capabilities like model valuation, guardrails, rag and agents. Recently, we’ve added Anthropic’s Claude 3.5 Sonnet model, Meta’s Llama 3.2 models, Mistral’s Large 2 models and multiple-stability AI models. We also continue to see teams use multiple model types from different model providers and multiple model sizes in the same application.  There’s mucking orchestration required to make this happen. And part of what makes Bedrock so appealing to customers and why it has so much traction is that Bedrock makes this much easier. Customers have many other requests: access to even more models, making prompt management easier, further optimizing inference costs. And our Bedrock team is hard at work making this happen.

Amazon’s management continues to see strong adoption of Amazon Q, Amazon’s generative AI assistant for software development; Amazon Q has the highest reported code acceptance rates in the industry; reminder that Amazon saved $260 million and 4,500 developer years when performing a large Java Development Kit migration through the use of Amazon Q

We’re continuing to see strong adoption of Amazon Q, the most capable generative AI-powered assistant for software development and to leverage your own data. Q has the highest reported code acceptance rates in the industry for multiline code suggestions. The team has added all sorts of capabilities in the last few months, but the very practical use case recently shared where Q Transform saving Amazon’s teams $260 million and 4,500 developer years in migrating over 30,000 applications to new versions of the Java JDK. As excited developers and prompted them to ask how else we could help them with tedious and painful transformations.

Amazon is using generative AI pervasively across its businesses, with hundreds of apps in use or in development; Rufus is a generative AI-powered shopping assistant available in parts of Europe, North America, and India; Amazon is using generative AI to improve personalisation and product-search for consumers when shopping; Project Amelia is an AI system offering tailored business insights to Amazon sellers; Alexa, Amazon’s virtual assistant technology, is being rearchitected with new foundation AI models; the new Kindle Scribe has a built-in AI-powered notebook 

We’re also using generative AI pervasively across Amazon’s other businesses with hundreds of apps in development or launched.

For consumers, we’ve expanded Rufus, our generative AI-powered expert shopping assistant, to the U.K., India, Germany, France, Italy, Spain, and Canada. And in the U.S., we’ve added more personalization, the ability to better narrow customer intent and real-time pricing and deal information. We’ve recently debuted AI shopping guides for consumers, which simplifies product research by using generative AI to pair key factors to consider in a product category with Amazon’s wide selection, making it easier for customers to find the right product for their needs. 

For sellers, we’ve recently launched Project Amelia, an AI system that offers tailored business insights to boost productivity and drive seller growth.

We continue to rearchitect the brain of Alexa with a new set of foundation models that we’ll share with customers in the near future, and we’re increasingly adding more AI into all of our devices. Take the new Kindle Scribe we just announced. The note-taking experience is much more powerful with the new built-in AI-powered notebook, which enables you to quickly summarize pages of notes into concise bullets in a script font that can easily be shared.

Amazon’s management expects capital expenditures of $75 billion for the whole of 2024; most of the capex will be for AWS infrastructure to support demand for AI services; the capex also includes investments in Amazon’s fulfilment and transportation network; management expects capex in 2025 to increase from 2024’s level, with most of the capex for AWS, specifically generative AI; reminder that the faster AWS grows, the faster Amazon needs to invest capital for hardware; many of the assets AWS’s capex is invested in have long, useful lives; management expects to deliver high returns on invested capital with AWS’s generative AI investments; management has a lot of experience, accumulated over the years, in predicting just the right amount of compute capacity to provide for AWS before the generative AI era, and they believe they can do so again for generative AI

Year-to-date capital investments were $51.9 billion. We expect to spend approximately $75 billion in CapEx in 2024. The majority of the spend is to support the growing need for technology infrastructure. This primarily relates to AWS as we invest to support demand for our AI services while also including technology infrastructure to support our North America and international segments. Additionally, we’re continuing to invest in our fulfillment and transportation network to support the growth of the business, improve delivery speeds and lower our cost to serve. This includes investments in same-day delivery facilities, in our inbound network and as well in robotics and automation…

… I’ll take the CapEx part of that. As Brian said in his opening comments, we expect to spend about $75 billion in 2024. I suspect we’ll spend more than that in 2025. And the majority of it is for AWS, and specifically, the increased bumps here are really driven by generative AI…

…The thing to remember about the AWS business is the cash life cycle is such that the faster we grow demand, the faster we have to invest capital in data centers and networking gear and hardware. And of course, in the hardware of AI, the accelerators or the chips are more expensive than the CPU hardware. And so we invest in all of that upfront in advance of when we can monetize it with customers using the resources…

…A lot of these assets are many-year useful life assets. Data centers, for instance, are useful assets for 20 to 30 years…

…I think we’ve proven over time that we can drive enough operating income and free cash flow to make this very successful return on invested capital business. And we expect the same thing will happen here with generative AI…

…One of the least understood parts about AWS over time is that it is a massive logistics challenge. If you think about we have 35-or-so regions around the world, which is an area of the world where we have multiple data centers, and then probably about 130 availability zone through data centers, and then we have thousands of SKUs we have to land in all those facilities. And if you land too little of them, you end up with shortages, which end up in outages for customers. So most don’t end up with too little, they end up with too much. And if you end up with too much, the economics are woefully inefficient. And I think you can see from our economics that we’ve done a pretty good job over time at managing those types of logistics and capacity. And it’s meant that we’ve had to develop very sophisticated models in anticipating how much capacity we need, where, in which SKUs and units.

And so I think that the AI space is, for sure, earlier stage, more fluid and dynamic than our non-AI part of AWS. But it’s also true that people aren’t showing up for 30,000 chips in a day. They’re planning in advance. So we have very significant demand signals giving us an idea about how much we need…

…There are some similarities in the early days here of AI, where the offerings are new and people are very excited about it. It’s moving very quickly and the margins are lower than what I think they will be over time. The same was true with AWS. If you looked at our margins around the time you were citing, in 2010, they were pretty different than they are now. I think as the market matures over time, there are going to be very healthy margins here in the generative AI space.

There are a few hundred million active Alexa devices; management had an initial vision of Alexa being the world’s best personal assistant and they believe now that Alexa’s re-architecture can give it a shot at fulfilling the initial vision

I think we have a really broad number of Alexa devices all over people’s homes and offices and automobiles and hospitality suites. We’ve about 0.5 billion devices out there with a couple of hundred million active end points. And when we first were pursuing Alexa, we had this vision of it being the world’s best personal assistant and people thought that was kind of a crazy idea. And I think if you look at what’s happened in generative AI over the last couple of years, I think you’re kind of missing the boat if you don’t believe that’s going to happen. It absolutely is going to happen. So we have a really broad footprint where we believe if we rearchitect the brains of Alexa with next-generation foundational models, which we’re in the process of doing, we have an opportunity to be the leader in that space.

Amazon’s management believes that AWS’s demand substantially outweighs capacity today; management believes AWS’s rate of growth can improve over time as capacity grows

[Question] On the cloud, are you at all capacity constrained, and will the new Trainium or NVIDIA chips maybe even drive sales growth faster?

[Answer] I believe we have more demand that we could fulfill if we had even more capacity today. I think pretty much everyone today has less capacity than they have demand for, and it’s really primarily chips that are the area where companies could use more supply…

…We’re growing at a very rapid rate and have grown a pretty big business here in the AI space. And it’s early days, but I actually believe that the rate of growth there has a chance to improve over time as we have bigger and bigger capacity.

Apple (NASDAQ: AAPL)

Apple announced Apple Intelligence in June 2024; Apple Intelligence redefines privacy in AI; Apple recently released the first set of Apple Intelligence features in US English for iPhone, iPad, and Mac users, and they include writing tools, an improved version of Siri, a more intelligent Photos App, and notification summaries and priority messages; more Apple Intelligence features will be released in December 2024 and early developer feedback is great; the adoption rate of iOS18 in its first three days is twice as fast as for iOS17, suggesting interest for Apple Intelligence; Apple will release support for additional languages in Apple Intelligence in April 2025

In June, we announced Apple Intelligence, a remarkable personal intelligent system that combines the power of generative models with personal context to deliver intelligence that is incredibly useful and relevant. Apple Intelligence marks the beginning of a new chapter for Apple Innovation and redefines privacy and AI by extending our groundbreaking approach to privacy into the cloud with private cloud compute. Earlier this week, we made the first set of Apple Intelligence features available in U.S. English for iPhone, iPad and Mac users with system-wide writing tools that help you refine your writing, a more natural and conversational Siri, a more intelligent Photos app, including the ability to create movies simply by typing a description, and new ways to prioritize and stay in the moment with notification summaries and priority messages.

And we look forward to additional intelligence features in December with even more powerful writing tools, a new visual intelligence experience that builds on Apple Intelligence and ChatGPT integration as well as localized English in several countries, including the U.K., Australia and Canada. These features have already been provided to developers, and we’re getting great feedback. More features will be rolling out in the coming months as well as support for more languages, and this is just the beginning…

…[Question] I was wondering if you could just expand a little bit on some of the early feedback to Apple Intelligence, both for iOS 18.1 but also the developer beta so far and whether you would attribute Apple Intelligence to any of the strong iPhone performance that we’ve seen to date.

[Answer] We’re getting a lot of positive feedback from developers and customers. And in fact, if you just look at the first 3 days, which is all we have obviously from Monday, the 18.1 adoption is twice as fast as the 17.1 adoption was in the year ago quarter. And so there’s definitely interest out there for Apple Intelligence…

…We started in the — with U.S. English. That started on Monday. There’s another release coming that adds additional features that I had referenced in December in not only U.S. English but also localized for U.K., Australia, Canada, Ireland and New Zealand. And then we will add more languages in April. We haven’t set the specifics yet in terms of the languages, but we’ll add more in April and then more as we step through the year. And so we’re moving just as fast as possible while ensuring quality.

Apple’s management is building the infrastructure to deliver Apple Intelligence, but it does not seem like Apple will need to significantly increase its capex budget from historical norms; management also does not see any significant change to the intensity of research & development (R&D) spending that Apple needs to invest in AI

[Question] Could you just talk a little bit about the CapEx outlook and whether investments in things like private cloud compute could change the historical CapEx range of roughly $10 billion a year?

[Answer] We are rolling out these features, Apple Intelligence features already now. And so we are making all the capacity that is needed available for these features. You will see in our 10-K the amount of CapEx that we’ve incurred during the course of fiscal ’24. And we will — in fiscal ’25, we will continue to make all the investments that are necessary, and of course, the investments in AI-related CapEx will be made…

…[Question] Given how much your tech peers are spending on AI, does this new era of Apple Intelligence actually require Apple to invest more in R&D beyond your current 7% to 8% of sales to capture this opportunity? 

[Answer] We’ve been investing heavily in R&D over the last several years. Our R&D growth has been significant during the last several years. And obviously, as we move through the course of fiscal ’24, we’ve also reallocated some of the existing resources to this new technology, to AI. And so the level of intensity that we’re putting into AI has increased a lot, and you maybe don’t see the full extent of it because we’ve also had some internal reallocation of the base of engineering resources that we have within the company.

Apple’s management thinks the introduction of Apple Intelligence will benefit the entire Apple ecosystem

[Question] I understand Apple Intelligence is a feature on the phone today. But do you think that in the future it could potentially have or benefit the services growth business? Or is that too — are those too bifurcated to even make a call on the — this early in the cycle?

[Answer] Keep in mind that we have released a lot of APIs, and developers will be taking advantage of those APIs. That release has occurred as well, and of course, more are coming. And so I definitely believe that a lot of developers will be taking advantage of Apple Intelligence in a big way. And what that does to services, I’ll not forecast, but I would say that from an ecosystem point of view, I think it will be great for the user and the user experience.

Arista Networks (NYSE: ANET)

Arista Networks’ management is seeing networking for AI gaining a lot of traction; trials that took place in 2023 are becoming pilots in 2024; management expects more production in 2025 and 2026

Networking for AI is gaining a lot of traction as we move from trials in 2023 to more pilots in 2024, connecting to thousands of GPUs, and we expect more production in 2025 and 2026.

AI data traffic is very different from traditional cloud workloads and smooth and consistent data flow is a crucial factor in AI networking

AI traffic differs greatly from cloud workloads in terms of diversity, duration and size of flow. The fidelity of AI traffic flows where the slowest flow matters and one slow flow could slow down the entire job completion time is a crucial factor in networking.

Arista Networks’ management sees the company becoming a pioneer in scale-out Ethernet accelerated networking for large AI workloads; Arista Networks’ new Etherlink portfolio scales well to networks with over 100,000 GPUs and can potentially even handle 1 million GPU clusters; Arista Networks’ latest 77R4 DES platform was launched in close collaboration with Meta Platforms

Our AI centers connect seamlessly from the back end to the front end of compute, storage, WAN and classic cloud networks. Arista is emerging as the a pioneer and scale-out Ethernet accelerated networking for large-scale training and AI workloads. Our new Etherlink portfolio with wire speed 800-gig throughput and non-blocking performance, scales from single tier to efficient 2-tier networks for over 100,000 GPUs, potentially even 1 million AI accelerators with multiple tiers. Our accelerated AI networking portfolio consists of 3 families with over 20 switching products and not just one point switch. At the recent OCP in mid-October 2024, we officially launched a very unique platform that distributed Etherlink 7700 to build 2 tier networks for up to 10,000 GPU clusters. The 77R4 DES platform was developed in close collaboration with Meta. And while it may physically look like and be cable like a 2-tier leaf spine network, DES provides a single-stage forwarding with highly efficient spine fabric, eliminating the need for tuning and encouraging fast failover for large AI accelerator-based clusters. 

Arista Networks’ management believes the company has the broadest set of 800 gigabit per second Ethernet products for AI networks

I’m pleased to report Arista 7700R4 distributed Etherlink switch, the 7800R4 Spine, along with the 7060X6 AI leaf that we announced in June have entered into production providing our customers the broadest set of 800 gigabit per second Ethernet products for their AI networks. Together with 800 gigabit per second parallel optics, our customers are able to connect to 400 gigabit per second GPUs to each port increasing the deployment density over current switching solutions. This broad range of Ethernet platforms allows our customers to optimize density and minimize tiers to best match the requirements of their AI workload.

New AI clusters require high-speed connections to existing backbones

New AI clusters require new high-speed port connections into the existing backbone. These new clusters also increased bandwidth on the backbone to access training data, capture snapshots and deliver results generated by the cluster. This trend is providing increased demand for 7800R3 400-gigabit solution.

Arista Networks’ management sees next-generation AI data centres needing significantly more power while doubling network performance

Next-generation data centers integrating AI will contend with significant increases in power consumption while looking to double network performance.

Arista Networks’ management thinks the adoption of AI networking will rest on specifications that the Ultra Ethernet Consortium (UEC) is expected to soon release; the UEC now has 97 members and Arista Networks is a founding member

Critical to the rapid adoption of AI networking is the Ultra Ethernet consortium specification expected imminently with Arista’s key contributions as a founding member. The UEC ecosystem for AI has evolved to over 97 members.

Arista Networks’ management thinks Ethernet is the only option for open standard space AI networking

In our view, Ethernet is the only long-term viable direction for open standard space AI networking.

Arista Networks’ business growth in 2024 was achieved partly with the help of AI; management is now projecting even more growth in 2025 and is confident of achieving its AI back-end revenue target of US$750 million; the adoption of Arista Networks’ AI back-end products influences the adoption of its front-end AI networking products too; management also expects Arista Networks’ front-end AI networking products to generate around US$750 million in revenue in 2025, but sometimes this gets hard to track; the US$750 million in AI back-end revenue that management expects are brand new for the company

We’ve experienced some pretty amazing growth years with 33.8% growth in ’23 and 2024 appears to be heading at least to 18%, exceeding our prior predictions of 10% to 12%. This is quite a jump in 2024, influenced by faster AI pilots. We are now projecting an annual growth of 15% to 17% next year, translating to approximately $8 billion in 2025 revenue with a healthy expectation of operating margin. Within that $8 billion revenue target, we are quite confident in achieving our campus and AI by back-end networking targets of $750 million each in 2025 that we set way back 1 or 2 years ago. It’s important to recognize though that the back end of AI will influence the front-end AI network and its ratios. This ratio can be anywhere from 30% to 100% and sometimes, we’ve seen it as high as 200% of the back-end network depending on the training requirements. Our comprehensive AI center networking number is therefore likely to be double of our back-end target of $750 million, now aiming for approximately $1.5 billion in 2025…

… I would expect in the back end, any share Arista gets, including that $750 million is incremental. It’s brand new to us. We were never there before…

…I think it all depends on their approach to AI. If they just want to build a back-end cluster and prove something out, then they just look for the highest job training completion and intense training models. And it’s a very narrow use case. But what we’re starting to see more and more, especially with the top 5, like I said, is for every dollar spent in the back end, you could spend 30% more, 100% more, and we’ve even seen a 200% more scenario, which is why our $750 million will carry over to, we believe, next year, another $750 million on front-end traffic that will include AI, but it will include other things as well. It won’t be unique to AI. So I wouldn’t be surprised if that number is anywhere between 30% and 100%, so the average is 100%., which is 2x our back-end number. So feeling pretty good about that. Don’t know how to exactly count that as pure AI, which is why I qualify it by saying increasingly, if you start having inference, training, front end, storage, WAN, classic cloud all come together, the AI — the pure AI number becomes difficult to track.

Arista Networks’ management is stocking up inventory in preparation for a rapid deployment of AI networking products

On the cash front, while we have experienced significant increases in operating cash over the last couple of quarters, we anticipate an increase in working capital requirements in Q4. This is primarily driven by increased inventory in order to respond to the rapid deployment of AI networks and to reduce overall lead times as we move into 2025.

Arista Networks’ management has been surprised by the acceleration of AI pilots by its customers in 2024; management would not be surprised going forward if its AI business grows faster than its classic data center and cloud business (in other words, management would not be surprised if the company’s customers cannibalise some of their classic data center and cloud buildouts for AI)

We were pleasantly surprised with the faster acceleration of AI pilots in 2024. So we definitely see that our large cloud customers are continuing to refresh on the cloud, but are pivoting very aggressively to AI. So it wouldn’t surprise me if we grow faster in AI and faster in campus in the new center markets and slower in our classic markets called that data center and cloud. 

The 4 major AI trials Arista Networks discussed in the 2024 Q1 earnings call have now become 5 trials; 3 of the 5 customers are progressing well and are transitioning from trials to pilots, and they will each grow their GPU clusters from 50,000 to 100,000 in 2025; the customer for the new trial that was started has historically been very focused on Infiniband so management is happy to have won the trial, and management hopes the trail will enter pilot and production in 2025; the last remaining customer is moving slower than management expected with delays in their data center buildout; management has good revenue visibility for 3 of the 5 trials for the next 6-12 months and Arista Networks’ revenue-guide for 2025 does not depend on the remaining 2 trials; a majority of the trials are currently on Arista Networks’ 400-gig products because the customers are waiting for the ecosystem to develop on the 800-gig products, but management expects more adoption of the 800-gig products in 2025; Arista Networks is participating in other smaller AI trials too, but the difference is that management expects the 5 major ones to scale to at least 100,000 GPU clusters 

Arista now believes we’re actually 5 out of 5, not 4 out of 5. We are progressing very well in 4 out of the 5 clusters. 3 of the customers are moving from trials to pilots this year, and we’re expecting those 3 to become 50,000 to 100,000 GPU clusters in 2025. We’re also pleased with the new Ethernet trial in 2024 with our fifth customer. This customer was historically very, very InfiniBand driven. And we are now moving in that particular fifth customer, we are largely in a trial mode in 2024, and we hope to go to pilots and production in 2025. There is one customer who — so 3 are going well. One is starting. The fifth customer is moving slower than we expected. They may get back on their feet. In 2025, they’re awaiting new GPUs, and they’ve got some challenges on power cooling, et cetera. So 3, I would give an A. The fourth one, we’re really glad we won, and we’re getting started and the fifth one, I’d say, steady-state, not quite as great as we would expect them — have expected them to be…

…[Question] I wanted to ask a little bit more about the $750 million in AI for next year. Has your visibility on that improved over the last few months? I wanted to reconcile your comment around the fifth customer not going slower than expected. And it sounds like you’re now in 5 of 5, but wondering if that fifth customer going slower is limiting upside or limiting your visibility there?

[Answer] I think on 3 out of the 5, we have good visibility, at least for the next 6 months, maybe even 12…

…On the fourth one, we are in early trials. We’ve got some improving to do. So let’s see, but we’re not looking for 2025 to be the bang up year on the fourth one. It’s probably 2026. And on the fifth one, we’re a little bit stalled, which may be why we’re being careful about predicting how they’ll do. They may step in nicely in the second half of ’25, in which case, we’ll let you know. But if they don’t, we’re still feeling good about our guide for ’25…

…A majority of the trials and pilots are on 400 because people are still waiting for the ecosystem at 800, including the NICs and the UEC and the packet spring capabilities, et cetera. So while we’re in some early trials on 800, majority of 400 — majority of 2024 is 400 gig. I expect as we go into 2025, we will see a better split between 400 and 800…

… So we’re not saying these 5 are the be-all, end-all, but these are the 5 we predict can go to 100,000 GPUs and more. That’s the way to look at this. So there are the largest AI Titans, if you will. And they can be in the cloud, hyperscaler Titan group, they could be in the Tier 2 as well, by the way, very rarely would they be in a classic enterprise. By the way, we do have at least 10 to 15 trials going on in the classic enterprise too, but they’re much smaller GPU counts, so we don’t talk about it.

Arista Networks’ management sees NVIDIA both as a partner and a competitor in the AI networking market; Arista Networks does see NVIDIA’s Infiniband as a competing solution, but rarely sees NVIDIA’s own Ethernet solution competing; management thinks customers, ranging from those building large GPU clusters to smaller ones, all see Arista Networks as the expert when it comes to AI networking

We view NVIDIA as a good partner. If we didn’t have the ability to connect to their GPUs, we wouldn’t have all this AI networking demand. So thank you, NVIDIA. Thank you, Jensen, for the partnership. Now as you know, NVIDIA sells the full stack and most of the time, it’s with InfiniBand, and with the Mellanox acquisition, they do have some Ethernet capability. We personally do not run into the Ethernet capability very much. We run into it, maybe in 1 or 2 customers. And so generally speaking, Arista has looked upon as the expert there. We have a full portfolio. We have full software. And whether it’s the large scale-out ethernet working customers like the Titans or even the smaller enterprises, we’re seeing a lot of smaller GPU clusters of the enterprise, Arista is looked upon as the expert there. But that’s not to say we’re going to win 100%. We certainly welcome NVIDIA as a partner on the GPU side and a fierce competitor, and we look to compete with them on the Ethernet switching.

The AI back-end market is where Arista Networks natively connects with GPU and where NVIDIA’s Infiniband is the market leader, but Arista Networks’ Ethernet solution is aiming to be the gold standard; for the AI front-end market, Arista Networks’ solutions are the gold standard and management is seeing some customers fail to run their AI application on competing solutions and want to replace them with Arista Networks’ solutions

So since you asked me specifically about AI as opposed to cloud, let me parse this problem into 2 halves, the back end and the front end, right? At the back end, we’re natively connecting to GPUs. And there can be many times, we just don’t say it because somebody just bundles it in the GPU in particular, NVIDIA. And you may remember a year ago, I was saying we’re outside looking in because most of the bundling is happening with InfiniBand…

…So we’ll take all we can get, but we are not claiming to be a market leader there. We’re, in fact, claiming that there are many incumbents there with InfiniBand and smaller versions of Ethernet that Arista is looking to gain more credibility and experience and become the gold standard for the back end.

On the front end, in many ways, we are viewed as the gold standard. So competitively, it’s a much more complex network. You have to build a leaf-spine architecture. John alluded to this, there’s a tremendous amount of scale with L2, L3, EVPN, VXLAN, visibility, telemetry, automation, routing at scale, encryption at scale. And this, what I would call accelerated networking portfolio complements NVIDIA’s accelerated compute portfolio. And compared to all the peers you mentioned, we have the absolute best portfolio of 20 switches and 3 families and the capability and the competitive differentiation is bar none. In fact, I am specifically aware of a couple of situations where the AI applications aren’t even running on some of the industry peers you talked about, and they want to swap theirs for ours. So feeling extremely bullish with the 7800 flagship product, the newly introduced 7700 that we worked closely with Meta, the 7060, this product line running today mostly at 400 gig because a lot of the NIC and the ecosystem isn’t there for 800. But moving forward into 800, this is why John and the team are building the supply chain to get ready for it.

ASML (NASDAQ: ASML)

While ASML’s management has seen the strong performance of AI continue – and expects the performance to continue for some time – other market segments have taken longer to recover than management expected; in the Memory segment, management is seeing limited capacity additions among customers, apart from AI, as the customers embark on technology transition to HBM and DDR5

There have been quite some market dynamics in the past couple of months. Very clearly, the strong performance of AI clearly continues and I think it continues to come with quite some upside. We will also see that in other market segments, it takes longer to recover. Recovery is there, but it’s more gradual than what we anticipated before and it will continue in 2025. That does lead to some customer cautiousness…

…If you look at the Memory business, this customer cautiousness that I talked about, leads to limited capacity additions. While at the same time, we do see a lot of focus and strong demand when it comes to technology transitions and particularly as it is related to High Bandwidth Memory and to DDR5. So again, there anything related to AI is strong, but other than that there are limited capacity additions.

The AI growth-driver is very strong over the long-term and ASML’s management sees that AI is increasing share in ASML’s customers’ business

If you look at the long-term outlook, I believe the growth drivers are still very much intact. The secular growth drivers are clear and they are strong. I think if you look at AI, very, very strong, very clear and undisputed. Taking an increasing share in the business of our customers. So I think that is going very strongly.

ASML’s management is seeing upside on AI because the overall demand for AI applications continues to increase, which has driven a recovery in server demand, but management does not have complete understanding on how the AI market will play out

We also mentioned some upside on AI, because we still believe that the overall demand for those application is there, continue to increase. So if we look at the server demand, we see there a very nice recovery. A lot of that has to do with AI application. So we talk about upside, which also means that the overall dynamic of the market is still playing. And we felt the need to provide an update for next year based on some of the development we have seen. I think in no way we are also saying that there is a complete understanding of how the entire market will continue to play out in the next few months. So I think on the second part of your question, I would say maybe this has not played out fully yet…

…[Question] You would expect to happen then, I guess, to — at some point will happen?

[Answer] Well, I think if everyone — and I think a lot of us still believe in a strong AI demand in the coming years, I think that demand has to be fulfilled. Therefore, yes, I will say mostly, we will see some development also on that front in the coming months.

Datadog (NASDAQ: DDOG)

Datadog’s management is seeing next-gen AI customers want to obtain visibility into their AI usage as they continue experimenting with the technology; around 3,000 customers used at least one of Datadog’s AI integrations at the end of 2024 Q3; management is starting to see Datadog’s LLM (large language model) observability products gain traction as AI experiments start becoming production applications; hundreds of customers are already using LLM observability, and some customers have reduced time spent on investigating LLM issues from days or hours to just minutes; management is seeing customers wanting to use APM (Application Performance Monitoring) alongside LLM observability 

In the next-gen AI space, customers continue to experiment with new AI technologies. And as they do, they want to get visibility into their AI use. At the end of Q3, about 3,000 customers used one or more Datadog AI integrations to send us data about their AI, machine learning and LLM usage. As some of these experiments start turning into production AI applications, we are seeing initial signs of traction for our LLM observability products.

Today, hundreds of customers are using LLM observability with more exploring it every day. And some of our first paying customers have told us that they have cut the time spent investigating LLM latency, errors and quality from days or hours to just minutes. Our customers not only want to understand the performance and cost of their LLM applications, they also want to understand the LLM model performance within the context of their entire application. So they are using APM alongside LLM observability to get fully integrated end-to-end visibility across all their applications and tech stacks

AI-native customers accounted for 6% of Datadog’s ARR in 2024 Q3 (was 6% 2024 Q2); AI-native customers contributed 4 percentage points to Datadog’s year-on-year growth in 2024 Q3, compared to 2 percentage points in 2023 Q3; management has seen a very rapid ramp in usage of Datadog among large customers in the AI-native cohort, and management thinks these customers will optimise cloud and observability usage in the future, while also asking for better terms; management is seeing Datadog’s production-minded LLM observability products being used by real paying customers with real volumes in real production workloads; AI-native companies are model providers or AI infrastructure providers that serve as a proxy for the AI industry

AI native customers who this quarter represented more than 6% of our Q3 ARR, up from more than 4% in Q2 and about 2.5% of our ARR in the year ago quarter. AI native customers contributed about 4 percentage points of year-over-year growth in Q3 versus about 2 percentage points in the year ago quarter. While we believe that adoption of AI will continue to benefit Datadog in the long term, we are mindful that some of the large customers in this cohort have ramped extremely rapidly and that these customers may optimize cloud and observability usage and increase their commitments to us over time with better terms. This may create volatility in our revenue growth in future quarters on the backdrop of long-term volume growth…

…We are seeing our production-minded LLM observability products, for example, being used by real paying customers with real volumes and real applications in real production workloads. So that’s exciting and healthy. I think it’s a great trend for the future…

… We have that group of AI, like smaller — relatively small number of AI companies or AI native companies. Many of them are model providers or infrastructure providers for AI that serve the rest of the industry and they are really a proxy for the future growth of the rest of the industry in AI.

Datadog signed a 7-figure expansion deal with a hyperscaler delivering next-gen AI models; the hyperscaler has its homegrown observability solution, but the solution needs time-consuming customisation and manual configuration; the hyperscaler chose Datadog because Datadog’s platform can scale flexibly

We signed a 7-figure annualized expansion with a division of a hyperscaler delivering next-gen AI models. This customer is very technically capable and already has a homegrown observability solution, which requires time-consuming customization and manual configuration. They will be launching new features for their large language models soon and need a platform that can scale flexibly while supporting proactive incident detection. By expanding the use of Datadog, they expect to efficiently onboard new teams and environments and support the rapidly increasing adoption of the LLMs.

Datadog’s management continues to believe that digital transformation, cloud migration, and AI adoption are long-term growth drivers of Datadog’s business

Overall, we continue to see no change to the multiyear trend towards digital transformation and cloud migration, which we continue to believe are still in early days. We are seeing continued experimentation with new advances such as next-gen AI, and we believe this is one of the many factors that will drive greater use of the cloud and other modern technologies.

Datadog’s management is starting to see more inference AI workloads, but they are still concentrated among API-driven providers and it’s still very early days in terms of customers putting their next-gen AI applications into production; management expects more diversification to occur in the future as more companies enter production with their applications and customise their models 

In terms of the workloads, you’re right that we’re starting to see more inference workloads, but they still tend to be more concentrated across a number of API-driven providers. So there are a few others, both on LLMs and other kinds of models. So this is where I think most of the usage in production at least is today. We expect that to diversify more over time as companies get further into production with their applications and they start to be customizing more on their models…

…We are excited to see what’s happening with the AI innovation as it gets further down the pipe and away from testing and experimenting and more into production applications. And we have some signs that it’s starting to happen. Again, we see that with our LLM observability product. We see that also with some of the workloads we monitor from our customers on the infrastructure side. But I would say it’s still very early days in terms of customers being in production with their next-gen AI applications.

Datadog’s management is seeing a small amount of cloud workloads of companies being cannibalised by their AI initiatives

You’re right that the — where the workloads could have grown maybe instead of growing 20%, they could grow 25%, maybe some of those 5% instead are being invested both in terms of infrastructure budget or innovation — time innovation budget. All that is going into AI, and that’s largely right now in experimentation and model training and that sort of thing. 

Datadog’s Management is working with customers with large inference workloads on how Datadog can be helpful on the GPU profiling side of inference; management is also experimenting with how Datadog can be helpful on side of training; management thinks that in a steady state, 60% of AI workloads will be inference and 40% will be training, so there’s still a lot of value to be found if Datadog can be useful in the training side too

Right now, we’re working with a number of customers that have real-world large inference workloads on how we can help on the GPU profiling side for inference. We’re doing less on the training side, mostly because the training jobs tend to be more bespoke and temporary, and there’s less of an application that’s attached to those that these are just very large clusters of GPUs. So it’s closer to HPC in a way than it is to traditional applications, though we are also experimenting with what we can do there. There is a world where maybe in a durable fashion, 60% of workloads are inference and 40% are training. And if that’s the case, there’s going to be a lot of value to be had by having repeatable training and repeatable tooling for that. So we are also looking into that.

Datadog is not monetising GPU instances as well as CPU instances today, but management thinks that could change in the future

As of today, we really don’t monetize GPU instances all that well compared to the other CPU instances. So GPU instance is many times the cost of a CPU instance, and we charge the same amount for it. That doesn’t have to be the case in the future. If we do things that are particularly interesting and it’s going to have a real impact on — and deliver value and how customers use and make the best of their GPUs and in the end, save money. 

Datadog’s management is seeing Datadog’s AI-native cohort grow faster than its cloud-native cohorts did in the late 2010s and early 2020s

What we’ve seen with cloud native in the late ’10s and early ’20s, where we had these numbers of cloud-native consumer companies that were growing very fast, with 2 differences. The first one is that the AI cohort is growing faster and there are larger individual ACVs [annual contract value] for these customers.

Datadog’s management thinks that workloads on Datadog’s platform could really accelerate when non-AI-native companies start bringing AI applications into production

In terms of the growth of workloads, look, I mean, as we said, we see growth across the customer base pretty much. We see growth of classical workloads in the cloud. We see large growth — very large growth on the AI native side. We think that the one big catalyst for future acceleration will be those AI native applications or those AI applications, I should say, going into production for non-AI native companies for a much broader set of customers than the customers that are deploying these kind of applications to their — in production. And as they do, they will also look less like just large cluster of GPUs and more like traditional applications because the GPU needs a database, it needs [ core ] application in front of it, it needs layers to secure it and authorize it and all the other things. So it’s going to look a lot more like a normal application with some additional more concentrated compute and GPUs.

Datadog’s management does not expect Datadog to make outsized investments in GPU clusters for compute

Unlike many others, we don’t expect at this point to have outsized investments in compute. We’re not building absolutely large GPU clusters.

dLocal (NASDAQ: DLO)

dLocal’s management launched the Smart Requests functionality in 2024 Q3 that improves conversion rates for merchants by 1.22 percentage points on average, which equates to a 1.2% increase in revenue for merchants; Smart Requests relies on localised machine learning models to maximise authorisation rates for merchant

During the quarter, we launched our smart requests functionality, boosting our transaction performance and therefore, improving conversion rates by an average of 1.22 percentage points across the board. It may sound minor, but it isn’t. It actually represents, in practical terms, 1.2% additional revenue to our merchants. Smart requests rely on per country machine learning models that optimize routing and chaining so as to maximize authorization rates for our merchants.

Fiverr (NYSE: FVRR)

Fiverr’s management believes that Fiverr’s next generation of products must empower its community to fully leverage AI, and that the best work will be done in the future by a combination of humans and AI

One thing that became clearer to me in the last year is that with the emergence of GenAI and the promise of AGI, the next generation of products we build must empower our community to fully leverage artificial intelligence. It also became clear to me that in the future, the best work will be done by humans and AI technology together, not humans alone or AI alone.

Fiverr’s management is providing Fiverr’s customers with an AI assistant to help them navigate the company’s platform 

This means that every business that comes to Fiverr will have a world-class AI assistant to help them get things done, from ideation, scoping and briefing to project management and workflow automation. It means that they can seamlessly leverage both human talent and machine intelligence to create the most beautiful results.

Fiverr’s management is building a new search experience on the Fiverr platform for buyers which incorporates Neo, its AI powered smart matching tool; Fiverr has launched Dynamic Matching to allow buyers to put together project briefs with an AI assistant to help them get matched to the most relevant freelancers; these new features have experienced enthusiastic reception in just a few weeks; projects that use these new features are bigger projects than the typical scope of projects on Fiverr

On the buyer side, we are building a new search experience that not only includes more dynamic catalogs but also incorporates Neo, an AI-powered smart matching tool, to help customers match with more contextual information. We launched Dynamic Matching to allow buyers to put together comprehensive briefs with a powerful AI assistant and then get matched with the most relevant freelancer with a tailored proposal…

…Even in the few weeks since we launched these products, we have already seen an enthusiastic reception from our community and promising performance. The projects that come through these products are several times larger than a typical project on Fiverr, and we believe it has a lot more potential down the road as the awareness and trust of these products grow on the platform.

Mastercard (NYSE: MA)

Mastercard acquired Brighterion in 2017 to use AI capabilities for decision intelligence; after boosting the product with generative AI, Mastercard has seen a 20% lift in the product 

One of the more recent ones that we talked about that we invested heavily in using our Brighterion acquisition from back in 2017 to use our AI capabilities is decision intelligence. We’ve now boosted the product with Gen AI and the outcome that we see is tremendous. This is up to a 20% lift that we see.

Meta Platforms (NASDAQ: META)

Meta’s management is seeing rapid adoption of Meta AI and Llama; Meta AI now has more than 500 million monthly actives; Llama token usage has grown exponentially in 2024 so far; Meta released Llama 3.2 in 2024 Q3; the public sector is adopting Llama; management is seeing higher usage of Meta AI as the models improve; Meta AI is built on Llama 3.2; voice functions for Meta AI are now available in English in the USA, Australia, Canada, and New Zealand; image editing through simple text prompts, and the ability to learn about images, are now available in Meta AI in the USA; Meta AI remains on track to be the most-used AI assistant in the world by end-2024; early use cases for Meta AI are for information gathering, help with how-to tasks, explore interests, look for content, and generate images

We’re seeing rapid adoption of Meta AI and Llama, which is quickly becoming a standard across the industry…

…Meta AI now has more than 500 million monthly actives…

…Llama token usage has grown exponentially this year and the more widely that Llama gets adopted and becomes the industry standard the more that the improvements to its quality and efficiency will flow back to all of our products. This quarter, we released Llama 3.2, including the leading small models that run on device and open source multimodal models…

…We’re also working with the public sector to adopt Llama across the U.S. government…

…We’re seeing lifts in usage as we improve our models and have introduced a number of enhancements in recent months to make Meta AI more helpful in engaging. Last month, we began introducing voice, so you can speak with Meta AI more naturally, and it’s now fully available in English to people in the U.S., Australia, Canada and New Zealand. In the U.S., people can now also upload photos to Meta AI to learn more about them, write captions for post and add, remove or change things about their images with a simple text prompt. These are all built with our first multimodal foundation model, Llama 3.2…

…We’re excited about the progress of Meta AI. It’s obviously very early in its journey, but it continues to be on track to be the most used AI assistant in the world by end of year…

… Number of the frequent use cases we’re seeing include information gathering, help with how-to tasks, which is the largest use case. But we also see people using it to go deeper on interests, to look for content on our services, for image generation, that’s also been another pretty popular use case so far.

Meta’s management is seeing AI have a positive impact on nearly all aspects of Meta; improvements to Meta’s AI-driven feed and video recommendations have driven increases in time spent on Facebook this year by 8% and on Instagram by 6%; more than 1 million advertisers are using Meta’s Gen AI tools and advertisers using image generation are enjoying a 7% increase in conversions; management sees plenty of new opportunities for new AI advances to accelerate Meta’s core business, so they want to invest more there

We’re seeing AI have a positive impact on nearly all aspects of our work from our core business engagement and monetization to our long-term road maps for new services and computing platforms…

…Improvements to our AI-driven feed and video recommendations have led to an 8% increase in time spent on Facebook and a 6% increase on Instagram this year alone. More than 1 million advertisers used our Gen AI tools to create more than 15 million ads in the last month. And we estimate that businesses using image generation are seeing a 7% increase in conversions and we believe that there’s a lot more upside here…

…It’s clear that there are a lot of new opportunities to use new AI advances to accelerate our core business that should have strong ROI over the next few years. So I think we should invest more there.

 The development of Llama 4 is progressing well; Llama 4 is being trained on more than 100,000 H100s and it’s the biggest training cluster in the world management is aware of; management expects the smaller Llama 4 models to be ready in early-2025; management thinks Llama 4 will be much faster and will have new modalities, stronger capabilities and reasoning

I’m even more excited about Llama 4, which is now well into its development. We’re training the Llama 4 models on a cluster that is bigger than 100,000 H100s or bigger than anything that I’ve seen reported for what others are doing. I expect that the smaller Llama 4 models will be ready first, and they’ll be ready — we expect sometime early next year. And I think that there are going to be a big deal on several fronts, new modalities, capabilities, stronger reasoning and much faster. 

Meta’s management remains convinced that open source is the way to go for AI development; the more developers use Llama, the more Llama improves in both quality and efficiency; in terms of efficiency, with higher adoption of Llama, management is seeing NVIDIA and AMD optimise their chip designs to better run Llama

It seems pretty clear to me that open source will be the most cost-effective, customizable, trustworthy performance and easiest to use option that is available to developers. And I am proud that Llama is leading the way on this…

…[Question] You said something along the lines of the more standardized Llama becomes the more improvements will flow back to the core meta business. And I guess, could you just dig in a little bit more on that?

[Answer] The improvements to Llama, I’d say come in a couple of flavors. There’s sort of the quality flavor and the efficiency flavor. There are a lot of researchers and independent developers who do work and because Llama is available, they do the work on Llama and they make improvements and then they publish it and it becomes — it’s very easy for us to then incorporate that both back into Llama and into our Meta products like Meta AI or AI Studio or Business AIs because the work — the examples that are being shown are people doing it on our stack.

Perhaps more importantly, is just the efficiency and cost. I mean this stuff is obviously very expensive. When someone figures out a way to run this better if that — if they can run it 20% more effectively, then that will save us a huge amount of money. And that was sort of the experience that we had with open compute and why — part of why we are leaning so much into open source here in the first place, is that we found counterintuitively with open compute that by publishing and sharing the architectures and designs that we had for our compute, the industry standardized around it a bit more. We got some suggestions also that helped us save costs and that just ended up being really valuable for us. Here, one of the big costs is chips — a lot of the infrastructure there. What we’re seeing is that as Llama gets adopted more, you’re seeing folks like NVIDIA and AMD optimize their chips more to run Llama specifically well, which clearly benefits us. 

Meta’s management expects to continue investing seriously into AI infrastructure

Our AI investments continue to require serious infrastructure, and I expect to continue investing significantly there too. We haven’t decided on the final budget yet, but those are some of the directional trends that I’m seeing.

Meta’s management thinks the integration of Meta AI into the Meta Ray-Ban glasses is what truly makes the glasses special; the Meta Ray-Ban glasses can answer questions throughout the day, help wearers remember things, give suggestions to wearers in real-time using multi-modal AI, and translate languages directly into the ear of wearers; management continues to think glasses are the ideal form-factor for AI because glasses lets AI see what you see and hear what you hear; demand for the Meta Ray-Ban glasses continues to be really strong; a recent release of the glasses was sold out almost immediately; Meta has deepened its partnership with EssilorLuxottica to build future generations of the glasses; Meta recently showcased Orion, its first full holographic AR glasses

This quarter, we also had several milestones around Reality Labs and the integration of AI and wearables. Ray-Ban meta glasses are the prime example here. They’re great booking glasses that let you take photos and videos, listen to music and take calls. But what makes them really special is the Meta AI integration. With our new updates, it will be able to not only answer your questions throughout the day, but also help you remember things, give you suggestions as you’re doing things using real-time multi-modal AI and even translate other languages right in your ear for you. I continue to think that glasses are the ideal form factor for AI because you can let your AI see what you see, hear what you hear and talk to you.

Demand for the glasses continues to be very strong. The new clear addition that we released at Connect sold out almost immediately and has been trading online for over $1,000. We’ve deepened our partnership with EssilorLuxottica to build future generations of smart eyewear that deliver both cutting-edge technology and style.

At Connect, we also showed Orion, our first full holographic AR glasses. We’ve been working on this one for about a decade, and it gives you a sense of where this is all going. We’re not too far off from being able to deliver great-looking glasses to let you seamlessly blend the physical and digital worlds so you can feel present with anyone no matter where they are. And we’re starting to see the next computing platform come together and it’s pretty exciting.

Newer scaling laws seen with Meta’s large language models inspired management to develop new ranking model architectures that can learn more effectively from significantly larger data sets; the new ranking model architectures have been deployed to Facebook’s video ranking models, helping to deliver more relevant recommendations; management is exploring the use of the new ranking model architectures on other services and the introduction of cross-surface data to the models, with the view that these moves will unlock more relevant recommendations and lead to better engineering efficiency

Previously, we operated separate ranking and recommendation systems for each of our products because we found that performance did not scale if we expanded the model size and compute power beyond a certain point. However, inspired by the scaling laws we were observing with our large language models, last year, we developed new ranking model architectures capable of learning more effectively from significantly larger data sets.

To start, we have been deploying these new architectures to our Facebook ranking video ranking models, which has enabled us to deliver more relevant recommendations and unlock meaningful gains in launch time. Now we’re exploring whether these new models can unlock similar improvements to recommendations on other services. After that, we will look to introduce cross-surface data to these models, so our systems can learn from what is interesting to someone on one surface of our apps and use it to improve their recommendations on another. This will take time to execute and there are other explorations that we will pursue in parallel. However, over time, we are optimistic that this will unlock more relevant recommendations while also leading to higher engineering efficiency as we operate a smaller number of recommendations.

Meta’s management is using new approaches to AI modelling to allow Meta’s ad systems to consider a person’s sequence of actions before and after seeing an ad, which allow the systems to better predict a person’s response to specific ads; the new approaches to AI modelling have delivered a 2%-4% increase in conversions in tests; Meta is seeing strong user-retention with its generative AI tools for image expansion, background generation, and text generation; Meta has started testing its first generative AI tools for video expansion and image animation and plans to roll them out broadly by early-2025

The second part of improving monetization efficiency is enhancing marketing performance. Similar to organic content ranking, we are finding opportunities to achieve meaningful ads performance gains by adopting new approaches to modeling. For example, we recently deployed new learning and modeling techniques that enable our ad systems to consider the sequence of actions a person takes before and after seeing an ad. Previously, our ad system could only aggregate those actions together without mapping the sequence. This new approach allows our systems to better anticipate how audiences will respond to specific ads. Since we adopted the new models in the first half of this year, we’ve already seen a 2% to 4% increase in conversions based on testing within selected segments…

…Finally, there is continued momentum with our Advantage+ solutions, including our ad creative tools. We’re seeing strong retention with advertisers using our Generative AI-powered image expansion, background generation and text generation tools, and they’re already driving improved performance for advertisers even at this early stage. Earlier this month, we began testing our first video generation features, video expansion and image animation. We expect to make them more broadly available by early next year.

Meta’s management expects to significantly increase Meta’s infrastructure for generative AI while prioritising fungibility

Given the lead time of our longer-term investments, we also continue to maximize our flexibility so that we can react to market developments. Within Reality Labs, this has benefited us as we’ve evolved our road map to respond to the earlier-than-expected success of smart glasses. Within Generative AI, we expect significantly scaling up our infrastructure capacity now while also prioritizing its fungibility will similarly position us well to respond to how the technology and market develop in the years ahead.

Meta’s management continues to develop tools for individuals and businesses to create AI agents easily; management thinks that Meta’s progress with AI agent tools is currently at where Meta was with Meta AI a year ago; management wants the AI agent tools to be widely used in 2025

There are also other new products like that, things around AI Studio. This year, we really focused on rolling out Meta AI as kind of our are kind of single assistant that people can ask any question to, but I think there’s a lot of opportunities that I think we’ll see ramp more over the next year in terms of both consumer and business use cases, for people interacting with a wide variety of different AI agents, consumer ones with AI Studio around whether it’s different creators or kind of different agents that people create for entertainment. Or on the business side, we do want to continue making progress on this vision of making it set any small business or any business over time can with a few clicks stand up in AI agent that can help do customer service and sell things to all of their customers around the world, and I think that’s a huge opportunity. So it’s very broad…

…But I’d say that we’re — today, with AI Studio and business AIs about where we were with Meta AI about a year ago. So I think in the next year, our goal around that is going to be to try to make those pretty widespread use cases, even though there’s going to be a multiyear path to getting kind of the depth of usage and the business results around that we want. 

Meta’s management is not currently sharing quantitative metrics on productivity improvements with the internal use of AI, but management is excited about the internal adoption they are seeing and the future opportunities for doing so

On the use of AI and employee productivity, it’s certainly something that we’re very excited about. I don’t know that we have anything particularly quantitative that we’re sharing right now. I think there are different efficiency opportunities with AI that we’ve been focused on in terms of where we can reduce costs over time and generate savings through increasing internal productivity in areas like coding. For example, it’s early, but we’re seeing a lot of adoption internally of our internal assistant and coding agent, and we continue to make Llama more effective at coding, which should also make this use case increasingly valuable to developers over time.

There are also places where we hope over time that we’ll be able to deploy these tools against a lot of our content moderation efforts to help make the big body of content moderation work that we undertake, to help it make it more efficient and effective for us to do so. And there are lots of other places around the company where I would say we’re relatively early in exploring the way that we can use LLM based tools to make different types of work streams more efficient.

It appears that Meta has achieved more than management expected in terms of developing its own AI infrastructure (in other words, developing its own AI chips)

So I think part of what we’re seeing this year is the infra team is executing quite well. And I think that’s, why over the course of the year, we’ve been able to build out more capacity. I mean going into the year, we had a range for what we thought we could potentially do. And we have been able to do, I think, more than, I think, we’d kind of hoped and expected at the beginning of the year. And while that reflects as higher expenses, it’s actually something that I’m quite happy that the team is executing well on. And I think that will — so that execution makes me somewhat more optimistic that we’re going to be able to keep on building this out at a good pace but that’s part of the whole thing. 

Meta’s management is starting to test the addition of AI-generated or AI-augmented content to users of Instagram and Facebook; management has high confidence that AI-generated and/or AI-augmented content will be an important trend in the future

I think we’re going to add a whole new category of content, which is AI generated or AI summarized content or kind of existing content pulled together by AI in some way. And I think that, that’s going to be just very exciting for the — for Facebook and Instagram and maybe Threads or other kind of feed experiences over time. It’s something that we’re starting to test different things around this. I don’t know if we know exactly what’s going to work really well yet. Some things are promising. I don’t know that this isn’t going to be a big impact on the business in ’25 would be my guess. But I think that there is I have high confidence that over the next several years, this is going to be an important trend and one of the important applications.

Meta’s management is currently focused on the engagement and user-experience of Meta AI; the monetisation of Meta AI will come later

Right now, we’re really focused on making Meta AI as engaging and valuable a consumer experience as possible. Over time, we think there will be a broadening set of queries that people use it for. And I think that the monetization opportunities will exist when over time as we get there. But right now, I would say we are really focused on the consumer experience above all and this is sort of a playbook for us with products that we put out in the world where we really dial in the consumer experience before we focus on what the monetization could look like.

Microsoft (NASDAQ: MSFT)

Microsoft’s AI business is on track to exceed $10 billion in annual revenue run rate in 2024 Q4 after being started for just 2.5 years; it will be the fastest business in the company’s history to do so; Microsoft’s AI business is nearly all inference (see Point 32 for more)

All up, our AI business is on track to surpass an annual revenue run rate of $10 billion next quarter, which will make it the fastest business in our history to reach this milestone…

…We’re excited that only 2.5 years in, our AI business is on track to surpass $10 billion of annual revenue run rate in Q2…

…If you sort of think about the point we even made that this is going to be the fastest growth to $10 billion of any business in our history, it’s all inference, right? 

Azure took share in 2024 Q3 (FY2025 Q1), driven by AI; Azure grew revenue by 33% in 2024 Q3 (was 29% in 2024 Q2), with 12 points of growth from AI services (was 8 points in 2024 Q2); Azure’s AI business has higher demand than available capacity

Azure took share this quarter…. 

… Azure and other cloud services revenue grew 33% and 34% in constant currency, with healthy consumption trends that were in line with expectations. The better-than-expected result was due to the small benefit from in-period revenue recognition noted earlier. Azure growth included roughly 12 points from AI services similar to last quarter. Demand continues to be higher than our available capacity. 

Microsoft’s management thinks Azure offers the broadest selection of AI chips, from Microsoft’s own Maia 100 chip to AMD and NVIDIA’s latest GPUs; Azure is the first cloud provider to offer NVIDIA’s GB200 chips

We are building out our next-generation AI infrastructure, innovating across the full stack to optimize our fleet for AI workloads. We offer the broadest selection of AI accelerators, including our first-party accelerator, Maia 100 as well as the latest GPUs from AMD and NVIDIA. In fact, we are the first cloud to bring up NVIDIA’s Blackwell system with GB200-powered AI servers.

Azure OpenAI usage more than doubled in the past 6 months, as both startups and enterprises move apps from test to production; GE Aerospace used Azure OpenAI to build a digital assistant for its 52,000 employees and in 3 months, the assistant has processed 500,000 internal queries and 200,000 documents; Azure recently added support for OpenAI’s newest o1 family of AI models; Azure AI is offering industry-specific models, including multi-modal models for medical imaging; Azure AI is increasingly an on-ramp for Azure’s data and analytics services, driving acceleration of Azure Cosmos DB and Azure SQL DB hyperscale usage

More broadly with Azure AI, we are building an end-to-end app platform to help customers build their own copilots and agents. Azure OpenAI usage more than doubled over the past 6 months as both digital natives like Grammarly and Harvey as well as established enterprises like Bajaj Finance, Hitachi, KT and LG move apps from test to production. GE Aerospace, for example, used Azure OpenAI to build a new digital assistant for all 52,000 of its employees. In just 3 months, it has been used to conduct over 500,000 internal queries and process more than 200,000 documents…

…This quarter, we added support for OpenAI’s newest model family, o1. We’re also bringing industry-specific models through Azure AI, including a collection of best-in-class multimodal models for medical imaging…

…Azure AI is also increasingly an on-ramp to our data and analytics services. As developers build new AI apps on Azure, we have seen an acceleration of Azure Cosmos DB and Azure SQL DB hyperscale usage as customers like Air India, Novo Nordisk, Telefonica, Toyota Motor North America and Uniper take advantage of capabilities purpose built for AI applications. 

Azure is offering its full catalog of AI models directly within the GitHub developer workflow; GitHub Copilot enterprise customers grew 55% sequentially in 2024 Q3; GitHub Copilot now has agentic workflows, such as Copilot Autofix, which helps users fix code 3x faster than it would take them on their own

And with the GitHub models, we now provide access to our full model catalog directly within the GitHub developer workflow…

… GitHub Copilot is changing the way the world builds software. Copilot enterprise customers increased 55% quarter-over-quarter as companies like AMD and Flutter Entertainment tailor Copilot to their own code base. And we are introducing the next phase of AI code generation, making GitHub Copilot agentic across the developer workflow. GitHub Copilot Workspace is a developer environment, which leverages agents from start to finish so developers can go from spec to plan to code all in natural language. Copilot Autofix is an AI agent that helps developers at companies like Asurion and Auto Group fix vulnerabilities in their code over 3x faster than it would take them on their own. We’re also continuing to build on GitHub’s open platform ethos by making more models available via GitHub Copilot. And we are expanding the reach of GitHub to a new segment of developers introducing GitHub Spark, which enables anyone to build apps in natural language.

Microsoft 365 Copilot has a new Pages feature, which management thinks is the first new digital artefact for the AI age; Pages helps users brainstorm with AI and collaborate with other users; Microsoft 365 Copilot responses are now 2x faster and 3x better; daily users of Microsoft 365 have more than doubled sequentially; Microsoft 365 copilot saves Vodafone employees 3 hours per person per week, and will be rolled out to 68,000 employees; 70% of the Fortune 500 now use Microsoft 365 Copilot; Microsoft 365 copilot is being adopted at a faster rate than any other new Microsoft 365 feature; with Copilot Studio, organisations can build autonomous agents to connect with Microsoft 365 Copilot; more than 10,000 organisations have used Copilot Studio, up 2x sequentially; monthly active users of Copilot across Microsoft’s CRM and ERP portfolio grew 60% sequentially

We launched the next wave of Microsoft 365 Copilot innovation last month, bringing together web, work, and Pages as the new design system for knowledge work. Pages is the first new digital artifact for the AI age, and it’s designed to help you ideate with AI and collaborate with other people. We’ve also made Microsoft 365 Copilot responses 2x faster and improved response quality by nearly 3x. This innovation is driving accelerated usage, and the number of people using Microsoft 365 daily more than doubled quarter-over-quarter. We are also seeing increased adoption from customers in every industry as they use Microsoft 365 Copilot to drive real business value. Vodafone, for example, will roll out Microsoft 365 Copilot to 68,000 employees after a trial showed that, on average, they save 3 hours per person per week. And UBS will deploy 50,000 seats in our largest finserve deal to date. And we continue to see enterprise customers coming back to buy more seats. All up, nearly 70% of the Fortune 500 now use Microsoft 365 Copilot, and customers continue to adopt it at a faster rate than any other new Microsoft 365 suite…

…With Copilot Studio, organizations can build and connect Microsoft 365 Copilot to autonomous agents, which then delegate to Copilot when there is an exception. More than 100,000 organizations from Nsure, Standard Bank and Thomson Reuters to Virgin Money and Zurich Insurance have used Copilot Studio to date, up over 2x quarter-over-quarter…

…Monthly active users of Copilot across our CRM and ERP portfolio increased over 60% quarter-over-quarter. 

Azure is bringing AI to industry-specific workflows; DAX Copilot is used in over 500 healthcare organisations to document more than 1.3 million physician-patient encounters each month; DAX Copilot is growing revenue faster than GitHub Copilot did in its first year

We’re also bringing AI to industry-specific workflows. One year in, DAX Copilot is now documenting over 1.3 million physician-patient encounters each month at over 500 health care organizations like Baptist Medical Group, Baylor Scott & White, Greater Baltimore Medical Center, Novant Health and Overlake Medical Center. It is showing faster revenue growth than GitHub Copilot did in this first year. And new features extend DAX beyond notes, helping physicians automatically draft referrals, after-visit instructions and diagnostic evidence.

LinkedIn’s AI tools help hirers find qualified candidates faster, and hirers who use AI assistant messages see a 44% higher acceptance rate

LinkedIn’s first agent hiring assistant will help hirers find qualified candidates faster by tackling the most time-consuming task. Already hirers who use AI assistant messages see a 44% higher acceptance rate compared to those who don’t. And our hiring business continues to take share.

In September 2024, Microsoft introduced a new AI companion experience – powered by Copilot – that includes voice and vision capabilities, allowing users to browse and converse with Copilot simultaneously

With Copilot, we are seeing the first step towards creating a new AI companion for everyone with new Copilot experience we introduced earlier this month, includes a refreshed design and tone along with improved speed and fluency across the web and mobile. And it includes advanced capabilities like voice and vision that make it more delightful and useful and feel more natural. You can both browse and converse with Copilot simultaneously because Copilot sees what you see. 

Roughly half of Microsoft’s cloud and AI-related capex in 2024 Q3 (FY2025 Q1) are for long-lived assets that will support monetisation over the next 15 years and more, while the other half are for CPUs and GPUs; the capex spend for CPUs and GPUs are made based on demand signals; management will be looking at inference demand to govern the level of AI capex for training; management sees that growth in capex will eventually slow and revenue growth will increase, but how fast that happens will depend on the pace of adoption of AI; the capex that Microsoft has been committing is a sign of management’s commitment to grow together with OpenAI, and to grow Azure beyond OpenAI; Microsoft is currently not interested at all in selling GPUs for companies to train AI models and has turned such business away, and this gives management conviction about the company’s AI-related capex

Capital expenditures including finance leases were $20 billion, in line with expectations, and cash paid for PP&E was $14.9 billion. Roughly half of our cloud and AI-related spend continues to be for long-lived assets that will support monetization over the next 15 years and beyond. The remaining cloud and AI spend is primarily for servers, both CPUs and GPUs, to serve customers based on demand signals…

…The inference demand ultimately will govern how much we invest in training because that’s, I think, at the end of the day, you’re all subject to ultimately demand…

…I think in some ways, it’s helpful to go back to the cloud transition that we worked on over a decade ago, I think, in the early stages. And what you did see and you’ll see us do in the same time is you have to build to meet demand. Unlike the cloud transition, we’re doing it on a global basis in parallel as opposed to sequential given the nature of the demand. And then as long as we continue to see that demand grow, you’re right, the growth in CapEx will slow and the revenue growth will increase. And those 2 things, to your point, get closer and closer together over time. The pace of that entirely depends really on the pace of adoption…

…[Question] How does Microsoft manage the demands on CapEx from helping OpenAI with its scaling ambitions?

[Answer] I’m thrilled with their success and need for supply from Azure and infrastructure and really what it’s meant in terms of being able to also serve other customers for us. It’s important that we continue to invest capital to meet not only their demand signal and needs for compute but also from our broader customers. That’s partially why you’ve seen us committing the amount of capital we’ve seen over the past few quarters, is our commitment to both grow together and for us to continue to grow the Azure platform for customers beyond them…

…One of the things that may not be as evident is that we’re not actually selling raw GPUs for other people to train. In fact, that’s sort of a business we turn away because we have so much demand on inference that we are not taking what I would — in fact, there’s a huge adverse selection problem today where people — it’s just a bunch of tech companies still using VC money to buy a bunch of GPUs. We kind of really are not even participating in most of that because we are literally going to the real demand, which is in the enterprise space or our own products like GitHub Copilot or M365 Copilot. So I feel the quality of our revenue is also pretty superior in that context. And that’s what gives us even the conviction, to even Amy’s answers previously, about our capital spend, is if this was just all about sort of a bunch of people training large models and that was all we got, then that would be ultimately still waiting, to your point, for someone to actually have demand, which is real. And in our case, the good news here is we have a diversified portfolio. We’re seeing real demand across all of that portfolio.

Microsoft’s management continues to expect Azure’s growth to accelerate in FY2025 H2, driven by increase in AI capacity to meet growing demand

In H2, we still expect Azure growth to accelerate from H1 as our capital investments create an increase in available AI capacity to serve more of the growing demand.

Microsoft’s management thinks that the level of supply and demand for AI compute will match up in FY2025 H2

But I feel pretty good that going into the second half of even this fiscal year, that some of that supply/demand will match up…

…I do, as you heard, have confidence, as we get a good influx of supply across the second half of the year particularly on the AI side, that we’ll be better able to do some supply-demand matching and hence, while we’re talking about acceleration in the back half.

Microsoft’s management sees Microsoft’s partnership with OpenAI as having been super beneficial to both parties; Microsoft provides the infrastructure for OpenAI to innovate on models; Microsoft takes OpenAI’s models and innovates further, through post-training of the models, building smaller models, and building products on top of the models; management developed conviction on the OpenAI partnership after seeing products such as GitHub Copilot and DAX Copilot get built; management feels very good about Microsoft’s investment in OpenAI; Microsoft accounts for OpenAI’s financials under the equity method

The partnership for both sides, that’s OpenAI and Microsoft, has been super beneficial. After all, we were the — we effectively sponsored what is one of the most highest-valued private companies today when we invested in them and really took a bet on them and their innovation 4, 5 years ago. And that has led to great success for Microsoft. That’s led to great success for OpenAI. And we continue to build on it, right? So we serve them with world-class infrastructure on which they do their innovation in terms of models, on top of which we innovate on both the model layer with some of the post-training stuff we do as well as some of the small models we build and then, of course, all of the product innovation, right? One of the things that my own sort of conviction of OpenAI and what they were doing came about when I started seeing something like GitHub Copilot as a product get built or DAX Copilot get built or M365 Copilot get built…

… And the same also, I would say, we are investors. We feel very, very good about sort of our investment stake in OpenAI…

…  I would say, just a reminder, this is under the equity method, which means we just take our percentage of losses every quarter. And those losses, of course, are capped by the amount of investment we make in total, which we did talk about in the Q this quarter as being $13 billion. And so over time, that’s just the constraint, and it’s a bit of a mechanical entry. And so I don’t really think about managing that. That’s the investment and acceleration that OpenAI is making in themselves, and we take a percentage of that.

Microsoft’s management sees Copilot as the UI layer for humans to interact with AI; Copilot Studio is used to build AI agents to connect Copilot to other systems of the user’s choice; Copilot Studio can also be used to create autonomous AI agents but these AI agents are not fully autonomous because at some point, they will need to notify a human or require an input and that is where Copilot comes in again

The system we have built is Copilot, Copilot Studio, agents and autonomous agents. You should think of that as the spectrum of things, right? So ultimately, the way we think about how this all comes together is you need humans to be able to interface with AI. So the UI layer for AI is Copilot. You can then use Copilot Studio to extend Copilot. For example, you want to connect it to your CRM system, to your office system, to your HR system. You do that through Copilot Studio by building agents effectively.

You also build autonomous agents. So you can use even — that’s the announcement we made a couple of weeks ago, is you can even use Copilot Studio to build autonomous agents. Now these autonomous agents are working independently, but from time to time, they need to raise an exception, right? So autonomous agents are not fully autonomous because, at some point, they need to either notify someone or have someone input something. And when they need to do that, they need a UI layer, and that’s where, again, it’s Copilot.

So Copilot, Copilot agents built-in Copilot Studio, autonomous agents built in Copilot Studio, that’s the full system, we think, that comes together.

Netflix (NASDAQ: NFLX)

Within entertainment, Netflix’s management thinks the most important question for AI is whether it can help creators produce even better content; the ability of AI to reduce costs in content creation is of secondary importance

 Lots of hype, good and bad, about how AI is going to impact or transform the entertainment industry. I think that the history has been that entertainment and technology have worked hand-in-hand throughout the history of time. And it’s very important, I think, for creators to be very curious about what these new tools are and what they could do. But AI needs to pass a very important test. Actually, can it help make better shows and better films? That is the test and that’s what they got to figure out. But I’ve said this before and I will say it again. We benefit greatly from improving the quality of the movies and the shows much more so than we do from making them a little cheaper. So any tool that can go to enhance the quality, making them better is something that is going to actually help the industry a great deal.

Paycom Software (NYSE: PAYC)

Paycom’s management developed an AI agent internally for the company’s service team to help the team provide even better service; the AI agent improved Paycom’s immediate response rates by 25% without any additional human interaction; the AI agent was built in house; Paycom is using AI in other areas, such as in several existing and upcoming products

Internally, we developed and deployed an AI agent for our service team. This technology utilizes our own knowledge-based semantic search model and enables us to provide service to help our clients more quickly and consistently than ever before.The AI agent continually improves over time and is having an impact on helping our clients achieve even more value out of their relationship with Paycom. By utilizing our own AI agent, we were able to connect our clients to the right solution faster, improving our immediate response rates by 25% without any additional human interaction…

…[Question] Interesting to hear about using AI in the customer service organization. I’m curious if that’s technology that Paycom has built or if you’re using a third party.

[Answer] So that’s internal. We built it ourselves, and we’ve been using it. And so it gets better and better as we mentioned on the call. It’s sped up our process by 25% as far as being able to connect clients to the solution quicker, whether that be a configuration question, a tax question or what have you. And so that’s really been helpful to us, and it continues to do more and more from that perspective…

…[Question] A follow-up on the AI agent or the AI technology that you’re developing. Do you see an opportunity in the future to productize what you’re developing internally, maybe like in your — in future versions of your recruiting product or other products in your platform?

[Answer] I would say this isn’t the only area in which we’re using AI. We have it in several products that we both have released and will be releasing. And so there’s definitely opportunities to monetize AI. As far as this particular solution, it’s really helping us on the back end and helping our client as well. So I think we’re going to see results and benefits from that in other areas of efficiency across the board within our own organization.

Shopify (NASDAQ: SHOP)

Shopify recently enhanced Shopify Flow, a low-code workflow automation app, with a new admin API connector that provides an additional 304 new automation actions

Let’s start with Shopify Flow. A low-code workflow automation app that empowers merchants to build custom automations and help them run their businesses more efficiently. This includes a new automation trigger based on the merchant’s custom data and newly completed admin API connector that provides an additional 304 new actions to use in their automations. And as a result, Flow has become a much more powerful tool, enabling merchants to update products, process customer form submissions, edit orders and so much more.

The Shopify Inbox feature now uses AI to suggest personalised replies for merchants to respond to customer inquiries; half of merchants’ responses are now using the AI-suggested replies; fast customer response helps lift conversion rates for merchants; the replies feature may not seem like a big deal, but it actually helps free up a lot of time for merchants to focus on building products

Within Shopify Inbox, this product now uses AI to suggest replies based on each merchant’s unique store information making it super easy for merchants to respond quickly and accurately to customer inquiries. In fact, on average, merchants are using the Suggest Replies for about half of their responses, edited or not, showing just how effective this feature has become. Replying can quickly boost conversion rates, which means more sales for our merchants and in turn, for Shopify…

…I mentioned suggest replies in Shopify Inbox, which may not seem like a big deal, but it’s a huge deal because it means merchants can spend more of their time focused on the things that they need to be focused on like building our products.  

The Shop App has a new merchant-focused home feed that is powered by machine learning models to increase shopper engagement; the new home feed has led to an 18% increase in sessions where a buyer engaged with a recommendation; management thinks the combination of search with AI will make the search function on the Shop App a lot more relevant and personalised

This quarter, the Shop App launched a new merchant-focused home feed, showcasing the diversity and the richness of brands on Shop. The experience uses new machine learning models to help buyers keep up with the brands they love and discover new brands based on their preferences. These changes have already led to early success with an 18% increase in sessions where a buyer engaged with a recommendation…

…We also think Search and AI together makes the Shop search way more relevant, way more personalized. That is also very compelling.

Essentially every Shopify internal department is using AI to be more productive

Support engineering, sales, finance, just about every department internally is using AI in some way to get more efficient, more productive.

Shopify’s management thinks the integration of AI in search will change how consumers find merchants and products, but Shopify has helped merchants navigate many similar changes before, and Shopify will continue to help merchants navigate the AI-related changes

In terms of where consumers find merchants or find products, yes, AI and search is going to change. But to be clear, this entire flow and discovery process has been changing for many years. It’s the reason that you saw us integrate with places like YouTube or more recently, Roblox or TikTok or Instagram…

…You can rest assured that when consumers shift their buying preferences, their discovery preferences, their search preferences, and they’re looking for great products from great brands, Shopify will ensure that our merchants are able to do so. And that’s the reason even some of the more nuanced or some of the more — as you know, Shopify has an integration to Spotify. Why? Because some merchants that also have very large followings as a musician have massive followings on their artist profile, the fact that so you can now show Shopify products on your artist profile means for that particular segment of merchants, they can easily — they now have a new surface area in which to conduct business. And that’s the same thing when it comes to AI and search. 

Taiwan Semiconductor Manufacturing Company (NYSE: TSM)

TSMC’s management expects TMSC’s business in 2024 Q4 to be supported by strong AI-related demand; management sees very strong AI demand in 2024 H2, leading to higher capacity utilisation rate for TSMC’s leading-edge 3nm and 5nm process technologies; management now expects server AI processors to account for mid-teens percent of TSMC’s total revenue in 2024 (previous expectation was for low-teens percent)

Moving into fourth quarter. We expect our business to continue to be supported by strong demand for our leading-edge process technologies. We continue to observe extremely robust AI-related demand from our customers throughout the second half of 2024, leading to increasing overall capacity utilization rate for our leading-edge 3-nanometer and 5-nanometer process technologies…

…We now forecast the revenue contribution from server AI processors to more than triple this year and account for mid-teens percentage of our total revenue in 2024.

TSMC’s management defines server AI processors as GPUs, AI accelerators, and CPUs for training and inference

At TSMC, we defined server AI processor as GPUs, AI accelerators and CPUs performing training and inference functions and do not including — include networking, edge or on-device AI.

TSMC’s management thinks AI demand is real, based on TSMC’s own experience of using AI and machine learning in its operations; a 1% productivity gain for TSMC is equal to a tangible NT$1 billion return on investment (ROI); management thinks TSMC is not the only company that has benefitted from AI applications

Whether this AI demand is real or not, okay, and my judgment is real, we have talked to our customers all the time, including hyperscaler customers who are building their own chips, and almost every AI innovators is working with TSMC. And so we probably get the deepest and widest look of anyone in this industry. And why I say it’s real? Because we have our real experience. We have using the AI and machine learning in our fab and R&D operations. By using AI, we are able to create more value by driving greater productivity, efficiency, speed, qualities. And think about it, let me use, 1% productivity gain, that was almost equal to about TWD 1 billion to TSMC. And this is a tangible ROI benefit. And I believe we cannot be the only one company that have benefited from this AI application. So I believe a lot of companies right now are using AI and — for their own improving productivity, efficiency and everything.

TSMC’s management thinks AI demand is just at the beginning

[Question] Talk a little bit about what you think about the duration of this current semiconductor up-cycle? Do you think it will continue into the next couple of years? Or are we getting closer to the peak of the cycle?

[Answer] The demand is real and I believe it’s just the beginning of this demand, all right? So one of my key customers said, the demand right now is insane, that it’s just the beginning. It’s [ a form of scientific ] to be engineering, okay? And it will continue for many years.

When TSMC builds fabs to meet AI demand, management has a picture in mind of what the long-term demand picture looks like

[Question] Keen to understand how TSMC gets comfortable with customer demand for AI beyond 2025. And I ask this because it takes a couple of years before you can build a fab, so you need to be taking early — an early view on what does AI look like in 2026, 2027. So how are you specifically cooperating on long-term plans for capacity with these AI customers? And what commitments are these customers giving you?

[Answer]  let me say again that we did talk to a lot of our customers. Almost every AI innovator are working with us and that’s including the hyperscalers. So if you look at the long-term market — long-term structure and market demand profile, I think we have some picture in our mind and we make some judgment, of course, and we work with them on a rolling basis. So how we prepare our capacity, actually, just like Wendell said, we have a disciplined and [ a rollout ] system to plan the appropriate level of capacity. And that — to support our customers’ need, also to maximize our shareholders’ value. That’s what we’re always keeping our mind.

There’s more AI content that goes into the chips in PCs (personal computers) and smartphones; management expects the PC and smartphone business of TSMC to be healthy in the next few years because of AI-related applications 

The unit growth of PC and smartphone is still in the low single digit. But more importantly is the content. The content now we put more AI into that, they are cheap and so the silicon area increased faster than the unit growth. So again, I would like to say that for this PC and the smartphone business, not — is gradually increased and we expect it to be healthy in the next few years because of our AI-related applications.

Advanced packaging is currently a high single-digit percentage of TSMC’s revenue and management expects it to grow faster than TSMC’s overall business over the next 5 years; the margins of advanced packaging are improving, but it’s not at the corporate average level yet

Advanced packaging in the next several years, let’s say, 5 years, will be growing faster than the corporate average. This year, it accounts for about high single digit of our revenue. In terms of margins, yes, it is also improving. However, it’s still — it’s approaching corporate, but not there yet.

Demand for TSMC’s CoWoS (advanced packaging) continues to far exceed supply, even though TSMC has doubled CoWoS capacity compared to a year ago and will double it again

Let me share with you today’s situation is our customer’s demand far exceed our ability to supply. So even we work very hard and increase the capacity by about more than twice, more than 2x as of this year compared with last year and probably double again, but still not enough. And — but anyway, we are working very hard to meet the customers’ requirement.

Tencent (NASDAQ: TCEHY)

Tencent’s management is increasingly seeing tangible benefits from deploying AI across the company’s business; management wants to continue investing in AI; the most significant benefits are in content recommendation and targeting, which directly benefits Tencent’s business and advertising revenue; management also sees AI as a productivity tool, as Tencent’s Copilot is being used by Tencent’s software engineers frequently and is helping them generate efficiency gains; management is trying to incorporate AI into a lot of Tencent’s products, but they think it will take a few more quarters before real use cases show up

We are increasingly seeing a tangible benefit of deploying AI across our products and operations, including marketing services and cloud. And we’ll continue investing in AI technology, tools and solutions that assist users and partners…

…I think that the most significant one right now is actually around content recommendation and at targeting because the AI in — the AI engine in those two use cases are generating a significant amount of additional user time and at the same time, it’s generating a higher incremental targeting rate, response rate for our apps and both of them actually are direct benefits to the business and direct benefit to ad revenue. and both of the video accounts and our performance at revenue actually at scale…

… It’s actually a productivity tool that everybody is using on a frequent basis, for example, our Copilot is being used by our engineers across the board on a very frequent basis, and it’s actually generating efficiency gains for our business. and different businesses, a lot of our products are actually testing our Hunyuan and trying to incorporate AI into the — either the production process, right, so that they would gain efficiency or in the user experience use case so that it can actually make their user experience better. So I would say, right now, we are seeing more and more adoption among all our different products and services. It would take probably a few more quarters for us to see some real use cases at scale. 

Tencent’s management used the company’s foundation AI model, Tencent Hunyuan, to facilitate tagging and categorisation of content and advertising materials; Tencent also upgraded its machine learning platforms to deliver better advertising targeting; marketing services revenue from video accounts was up 60% year-on-year; Mini Programs marketing services revenue had robust growth; Tencent used large language models (LLMs) to improve the relevance of Weixin Search results, leading to higher commercial queries and click-through rates, and consequently, an increase in search revenue of more than 100%

Our Marketing Services revenue grew 17% year-on-year. Strength in games and e-commerce categories outweighed weakness in real estate and food and beverage. The Paris Olympics somewhat cushioned industry-wide weakness in brand ad revenue during the third quarter but this positive factor will be absent in the fourth quarter. We leveraged our foundation model, Tencent Hunyuan to facilitate tagging and categorization of content and ad materials. And we upgraded our machine learning platforms to deliver more accurate ad targeting.

By property, video accounts marketing services revenue increased over 60% year-on-year. As we systematically strengthen transaction capabilities in Weixin, advertisers increasingly utilize our marketing tools to boost their exposure and drive sales conversion. Mini Programs marketing services revenue grew robustly year-on-year as our Mini Games and Mini Dramas provided high-value rewarded video ad inventory and generated incremental closed-loop demand. And for Weixin Search, we utilized large language model capabilities to facilitate understanding of complex queries and content, enhancing the relevance of search results. Commercial queries increased and click-through rate improved, and our search revenue more than doubled year-on-year.

Tencent enjoyed swift year-on-year growth in GPU-focused cloud revenue and this revenue stream is now a teens percentage of Tencent’s infrastructure as a services revenue; Tencent has released Tencent Hunyuan Turbo, the new generation of its foundation AI model, which uses a heterogeneous mixture of experts architecture; compared to the previous generation, Hunyuan Turbo’s training and inference efficiency has doubled while its inference costs has halved; Hunyuan Turbo is ranked first for general capabilities among foundation AI models in China; Tencent has open-sourced Hunyuan models; management sees Tencent’s AI revenue being lesser than US cloud companies because China does not have a large enterprise, SaaS, and startup markets for AI services 

Our cloud revenue from GPUs primarily used for AI grew swiftly year-on-year and now represents a teens percentage of our infrastructure as a services revenue. We released Tencent Hunyuan Turbo, which utilizes a heterogeneous mixture of experts architecture, doubling our training and inference efficiency and halving inference cost versus its predecessor Hunyuan Pro. SuperCLUE ranked Hunyuan Turbo first for general capabilities among domestic peers. Last week, we made the Hunyuan large model and the Hunyuan 3D generation models available on an open-source basis. Our international cloud revenue increased significantly year-on-year. We leveraged domain expertise in areas such as games and live streaming and competitive pricing to win international customers…

…The IAS revenue is now in the teens generated by AI. But having said that, we think the amount of AI revenue is actually less than U.S. cloud companies. And the main reason is because, number one, China doesn’t really have a every big enterprise market. And if you look at the U.S., a lot of enterprises are actually sort of fitted in with AI and the — in testing out how AI can do for their business that they’re actually buying a lot of compute, which is not happening in China yet. There’s a very big SaaS ecosystem in the U.S., which everybody is actually trying to add AI to their functionality and thus charge the customers more. And that SaaS ecosystem is not really that vibrant in China. And thirdly, there are also fewer AI start-ups in China, which are actually buying a lot of compute. So as a result, the AI revenue in China on the cloud side is somewhat sort of at scale for us, but I think it will not be exploding like in the U.S. 

Tencent’s management does not want to embed commercial search results into the company’s AI chatbot, YongBao right now; the current focus for YongBao is on growing usage, not monetisation

[Question] Will you ramp up the Gen AI chatbot, would that eventually embed with the commercial sponsor answer as well?

[Answer] In terms of whether YongBao will embed commercial search results, the answer is no. for the current time, we’re focused on making YongBao be as appealing and attractive to users as it can be and we’re not focused on premature monetization.

Tencent’s management plans to invest in capex for AI, but the amount of investment will be small compared to the companies in the USA

If you look at CapEx, right, we believe we have a progressive CapEx plan, especially given that the development of a cloud business and the advent of AI, but at the same time, it’s measured compared to a lot of the U.S. companies. 

Tencent’s management sees the company’s advertising business being driven by 3 factors, namely consumer spending, Tencent’s ability to utilise AI to continue boosting click-through rates from currently low levels, and deployment of more inventory

In terms of the drivers for 2025, the overall macro environment would obviously be important accelerator or decelerator or neutral force for the aggregate advertising market. And that in turn will be a function primarily of consumer confidence. And consumer spending behavior. Now within that overall environment, our relative performance will be a function of, first of all, our advertising technology and our ability to utilize GPUs, utilize neural networks to continue boosting click-through rates from the current very low levels to higher levels that mechanically translates into more revenue. And then secondly, our deployment of specific inventories, in particular, video accounts, in particular, Weixin Search.

Tesla (NASDAQ: TSLA)

Tesla’s management released FSD v12.5 in 2024 Q3, which has increased data and training compute, and 5x increase in parameter count; Tesla also released Actually Smart Summon (your vehicle will autonomously drive to you in parking lots) and FSD for Cybertruck, which includes end-to-end neural nets for highway driving for the first time; version 13 of FSD is coming soon and it is expected to have a 5-6 fold improvement in miles between interventions compared to version 12.5; over the course of 2024, FSD’s improvement in miles between interventions has been at least 3 orders of magnitude; management expects FSD to become safer than human in 2025 Q2; Tesla vehicles on autopilot have 1 crash per 7 million miles, compared to 1 crash per 700,000 miles for the US average; Tesla has earned $236 million in revenue in 2024 Q3 from the release of FSD for Cybertruck and Actually Smart Summon

In Q3, we released the 12.5 series of FSD (Supervised)1 with improved safety and comfort thanks to increased data and training compute, a 5x increase in parameter count, and other architectural choices that we plan to continue scaling in Q4. We released Actually Smart Summon, which enables your vehicle to autonomously drive to you in parking lots, and FSD (Supervised) to Cybertruck customers, including end-to-end neural nets for highway driving for the first time…

…Version 13 of FSD is going out soon… We expect to see roughly a 5- or 6-fold improvement in miles between interventions compared to 12.5. And actually, looking at the year as whole, the improvement in miles between interventions, we think will be at least 3 orders of magnitude. So that’s a very dramatic improvement in the course of the year, and we expect that trend to continue next year.  The current total expectation, internal expectation for the Tesla FSD having longer miles between interventions [indecipherable] is the second quarter of next year, which means it may end up being in the third quarter but it’s next — it seems extremely likely to be next year…

…miles between critical interventions, mentioned by Elon already made 100x improvement with 12.5 from starting of this year and then with v13 release, we expect to be 1,000x from the beginning, from January of this year on production software. And this came in because of technology improvements going to end-to-end, having higher frame rate, partly also helped by hardware force, more capabilities, so on. And we hope that we continue to scale the neural network, the data, the training compute, et cetera. By Q2 next year, we should cross over the average, even in miles per critical intervention [indiscernible] in that case…

…Our internal estimate is Q2 of next year to be safer than human and then to continue with rapid improvements thereafter…

… So we published Q3 vehicle safety report, which shows 1 crash for every 7 million miles on autopilot that compares with the U.S. average of crash roughly every 700,000 miles. So it’s currently showing a 10x safety improvement relative to the U.S. average…

…We released FSD for Cybertruck and other features like actually small [indiscernible] like Elon talked about in North America, which contributed $326 million of revenues in the quarter. 

Tesla has deployed a 29,000 H100 cluster and expects to have a 50,000 H100 cluster by the end of October 2024, to support FSD and Optimus; Tesla is not training compute-constrained; Tesla’s AI has gotten so good that it now takes a long time to decide which version of the software is better because mistakes happen so infrequently and that is the big bottleneck to Tesla’s AI development; management is being very careful with AI-spending

We deployed and are training ahead of schedule on a 29k H100 cluster at Gigafactory Texas – where we expect to have 50k H100 capacity by the end of October…

…We continue to expand our AI training capacity to accommodate the needs of both FSD and Optimus. We are currently not training compute-constrained. [indiscernible] probably the big limiting factors of the FSD is actually getting so good that it takes us a while to actually find mistakes. And when you start getting to where it can take 10,000 miles to find a mistake, it takes a while to actually figure out which it is, is software A better than software B? It actually takes a while to figure it out because neither 1 of them makes the mistakes, would take a long time to make mistakes. So it’s actually the single biggest limiting factor is how long does it take us to figure out which version is better? Sort of a high-class problem…

… One thing which I’d like to elaborate is that we’re being really judicious on our AI compute spend to and saying how best we can utilize the existing infrastructure before making further investments…

…We still got to take which models are performing better. So the validation network to picking the models because as mentioned the miles between intervention is pretty large. We had to drive a lot of miles going close to. We do have simulation and other ways to get those metrics. Those 2 help, but in the end, that’s a big bottleneck. That’s why we’re not training-compete constrained alone. 

In the 10 October 2024 “We, Robot” event by Tesla, the company had showcased 50 autonomous vehicles, including 20 Cybercabs; the Cybercabs had no steering wheel, brake, or accelerator pedals, so they were truly autonomous

On October 10, we laid out a vision for an autonomous and future that I think is very compelling that the Tesla team did a phenomenal job there with actually giving people an option to experience the future, where you have humanoid robots working among the craft, not with a canned video and a presentation or anything but walking among crowd so he drinks and whatnot. And we had 50 autonomous vehicles. There were 20 Cybercabs but there were an additional 30 Model Ys, operating fully autonomously the entire night, carrying thousands of people with no incidents the entire night…

…Worth emphasizing that the Cybercab had no steering wheel or brake or accelerator panels, meaning there was no way for anyone to intervene manually a unit if they wanted to and the whole night went very smoothly.

Tesla is already offering autonomous ridehailing for Tesla employees in the Bay Area; the ridehailing service currently has a safety driver; Tesla has been testing autonomous ridehailing for some time; Elon Musk expects ridehailing to be rolled out to the public in California and Texas in 2025, and maybe other states in the USA; California has a lot of regulations around ridehailing, but there’s still a regulatory pathway; Tesla actually has passed Federal regulations for ridehailing, but it’s the state level where there are problems

We have for Tesla employees in the Bay Area, we already are offering ridehailing capabilities. So you can actually, with the development app, you can request a ride and it will take you anywhere in the Bay Area. We do have a safety driver for now but it’s not required to do that…

… We’ve been testing it for the good part of the year. And the building blocks that we needed in order to build this functionality and deliver it to production, we’ve been thinking about working on for years…

…So it’s not like we’re just starting to think about this stuff right now while we’re building out the early stages of our ridehailing network. We’ve been thinking about this for quite a long time, and we’re excited to get the functionality out there…

…We do expect to roll out ridehailing in California and Texas next year to the public. Now California is somewhere — there’s quite a long regulatory approval process. I think we should get approval next year but it’s contingent upon regulatory approval. Texas is a lot faster so it’s — we’ll definitely have available in Texas and probably have it available in California, subject to regulatory approval. And then — and maybe some other states actually next year as well, but at least California and Texas…

…[Question] Elon mentioned unsupervised FSD in California and Texas next year. Does that mean regulators have agreed to it in the entire state for existing hardware 3 and 4 vehicles?

[Answer] As I said earlier, California loves regulation… here’s a pathway. Obviously, Waymo operates in California so there’s just a lot of forms and a lot of approvals that are required. I mean, I’d be shocked if we don’t get approved next year, but it’s just not something we totally control. But I think we will get approval next year in California and Texas. And towards the Bay Area, branch out beyond California and Texas…

…I think it’s important to reiterate this like on our certifying a vehicle at the federal level in the U.S. is done by meeting FMVSS regulations. Our vehicles today that are produced there capable to meet all those regulations, the Cybercab regulations. And so the deployment of the vehicle to the road is no limitation, but its limitation is what you said at the state level where they control autonomous vehicle deployment. Some states are relatively easy, as you mentioned, for Texas. It’s other ones have always like California that may take a little longer. The other ones hadn’t set up anything yet. 

Tesla’s management acknowledges that there’s a chance that Tesla vehicles with Hardware Version 3 may not support unsupervised full self-dricing, and if so, Tesla will replace the hardware for those vehicle fleets for free into Hardware Version 4

By some measure, Hardware 4 has really several times the capability of Hardware 3. It’s easier to get things to work with then it takes a lot of effort to sort of squeeze that box analyst hat Hardware 3. And there is some chance that Hardware 3 is — does not achieve the safety level that allows for unsupervised FSD. There is some chance of that. And if that turns out to be the case, we will upgrade those group bought Hardware 3 FSD for free. And we have designed the system to be upgradeable so it’s really just to sort of switch out the computer thing, the camera, the cameras are capable. But we don’t actually know the answers of that. But if it does turn out, we’ll make sure we take care of those who are.

Tesla’s management thinks real-world AI in self-driving cars is different from LLMs (large language models) in that (1) real-world AI requires massive amounts of context that needs to be processed with a small amount of compute power and the way around this limitation is to do massive amounts of training so that the amount of inference that needs to be done is tiny, and (2) it’s difficult to sort out what data coming in from the video feed is important for the training

The nature of real world AI is different from LLM in that you have a massive amount of context. So like the — you’ve got a case of Tesla cameras that [indiscernible] if you include tunnel camera that — so you’ve got some context. And that is then distilled down into a small number of control outputs, whereas it’s like it’s very rare to have, in fact, I’m not sure any LLM out there can do gigabytes of context. And then you’ve got to then process that in the car with a very small amount of compute power. It’s all doable and it’s happening, but it is a different problem than what, say, a Gemini or OpenAI is doing.

And now part of the way you can make up for the fact that the inference computer is quite small, it is by spending a lot of effort on training. And just like a human the way you train on something, the less metal work takes when you try to — when you do it, like when the first time like a driving it absorbs your whole mind. But then as you train more and more on driving then the driving becomes a background task. It doesn’t — it only absorbs a small amount of your mental capacity because you have a lot of training. So we can make up for the fact that the inference computers — it’s tiny compared to a 10-kilowatt bank of GPUs because you’ve got a few hundred watts of inference compute. We can make up that with heavy training.

And then there’s also vast amounts to the actual petabytes of data coming in are tremendous. And then sorting out what training is important, of the vast amounts of video data coming in the feed, what is actually most important for training. That’s also quite difficult.

Tesla’s management thinks Elon Musk’s xAI AI-startup has been helpful to Tesla, but the 2 companies are focused on very different kinds of AI problems 

Well, I should say that xAI has been helpful to Tesla AI quite a few times in terms of things like scaling it, like training, just even like recently in the last week or so, improvements in training, where if you’re doing a big training run and it fails, being able to continue training and to recover from a training run, has been pretty helpful. But there are different problems. xAI actually is working on artificial general intelligence or artificial super intelligence. Tesla is autonomous cars and autonomous robots. There are different problems…

…Yes, Tesla is focused on real-world AI. And I was saying earlier, it is quite a bit different from LLM. But you have massive context in the form of video and some amount of audio, that’s going to be distilled like extremely efficient inference compute. I do think Tesla is the most efficient in the world in terms of inference compute because out of necessity, we have to be very good at efficient inference. We can’t put 10 kilowatts of GPUs in a car. We’ve got a couple of hundred watts. And it’s a pretty well designed Tesla AI chip, but it’s still a couple hundred watts. But there are different problems. I mean, the stuff at xAI. We’re running inference. I mean, it is running inference, answering questions on a 10-kilowatt rack. It’s like you can’t put that in a car. It’s a different problem.

Elon Musk created xAI because he thought there wasn’t a truth-seeking AI company being built

xAI is because I felt there wasn’t there wasn’t a truth-seeking digital super intelligence company out there, like that’s what it came down to. There needed to be a truth-seeking AI company that is very [indiscernible] about being truthful. I’m not saying xAI is perfect, but that is truth, but that is at least the explicit aspiration, even if something is politically incorrect, it would still be truhtful. I think this is very important for AI safety. So I think xAI has been helpful to Tesla and will continue to be helpful to Tesla, but they are very different problems.

There are no other car companies that has a world-class AI and chip-design team like Tesla

And like what other car company has a world-class chip design team? Like zero. What other car company has a world-class AI team like Tesla does? 0. Those were all startups that were created from scratch.

The Trade Desk (NASDAQ: TTD)

The incorporation of AI into Kokai, Trade Desk’s ad-buying platform, is encouraging adoption of Trade Desk by CFOs and CMOs

While there has been a lot of macro focus on the reduction in inflation rates, historic highs for stock market indices and growing indications of a soft landing, that’s not necessarily translating to consumer confidence, which is why CMOs are becoming much more closely aligned with their CFOs. CFOs want more evidence than ever that marketing is working. And for CFOs that doesn’t just mean traditional marketing KPIs. It means growing the top line business. All of our AI and data science injection into Kokai, our latest product release, is encouraging CMOs and CFOs to lean more and more on TTD to deliver real, measured growth…

…When CMOs faced pressure to achieve more with less, they turn to platforms like ours for flexibility, precision and measurable results.

Companies need an AI strategy, and Trade Desk’s AI product, Koa, is a great copilot for advertising traders; Trade Desk has plenty of opportunities in an AI-world because of the data assets it has, and management wants to improve all aspects of the company through AI

Every company needs an AI strategy. Our AI product, Koa, is a great copilot for traders. But this is only the beginning. There are endless possibilities for us as we have 1 of the best data assets on the Internet. The learnings that come from buying the global open Internet outside of walled gardens. To win in this new frontier, we’re looking across our entire suite of products, algorithms and features and asking how they all can be advanced by AI.

Visa (NYSE: V)

For Risk and Identity Solutions within value-added services, Visa wants to acquire Featurespace, an AI payments protection tech company that will enable Visa to enhance fraud prevention tools to clients and protect consumers in real time; Worldline, a Visa partner, will be using Decision Manager to provide businesses with AI-based e-commerce fraud detection abilities; Featurespace is a world leader in providing AI solutions to fight fraud

In Risk and Identity Solutions, we recently announced our intent to acquire Featurespace, a developer of real-time artificial intelligence payments protection technology. It will enable Visa to provide enhanced fraud prevention tools to our clients and protect consumers in real-time across various payment methods.  And Worldline, already a Visa partner and leading European acquirer, will soon be launching an optimized fraud management solution, utilizing Decision Manager to provide businesses with AI-based e-commerce fraud detection capabilities…

…Featurespace is a world leader in providing AI-driven solutions to combat that fraud, to reduce that fraud, to enable our clients and partners to continue to serve their customers in a safe way.

Visa’s management sees AI as being a driver of productivity across multiple functions in the company, and as a differentiator in its products and services

[Question] I just wanted to ask how you see AI playing into the business model. Do you see it more as driving VAS or incremental business model, uplift revenue or cost improvement? Or is it more of a competitive differentiator that will just keep you ahead of your competition?

[Answer] As it relates more broadly to especially generative AI at Visa, I see it really in 2 different buckets. The first is we are adopting it aggressively across our company to drive productivity. And we’ve seen some great results from everywhere to our engineering teams, to our accounting teams, to our sales teams, our client service teams. And we’re still in the early stages of, I think, the very significant impact this will have on the productivity of our business. I also see it as a real differentiator to the products and services that we’re putting in market. You’ve heard me talk about some of the new risk capabilities, risk management capabilities, for example, that we’ve deployed in the account-to-account space, which are all enabled with generative AI. You mentioned Featurespace. We’ve had some really good success in other parts of both our value-added services business and the broader consumer payments business as well. And we’ve got a product pipeline that is very heavily tilted towards some, we think, very exciting generative AI capabilities that hopefully you’ll hear more from us on soon.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I have a vested interest in Alphabet, Amazon, Apple, ASML, Coupang, Datadog, Fiverr, Mastercard, Meta Platforms, Microsoft, Netflix, Shopify, TSMC, Tesla, and Visa. Holdings are subject to change at any time.

The US Stock Market And US Presidents

History’s verdict on how US stocks have performed under different US presidents

The US presidential election is just a few weeks away. And as usual, large swathes of participants in the US stock market are trying to predict the victor because they think it will have significant consequences on how US stocks perform. I don’t have a crystal ball. But I do have history’s verdict, thanks to excellent research from the US-based wealth management firm, Ritholtz Wealth Management, that I came across recently.

Here’s a table showing the annualised returns of the S&P 500 for each US President, going back to Theodore Roosevelt’s first term in 1901:

Table 1; Source: Ritholtz Wealth Management 

I think the key takeaway from the table is that how the US stock market performs does not depend on what political party the US President belongs to. Republican presidents have presided over bad episodes for US stocks (Herbert Hoover, Richard Nixon, and George W. Bush, for example) as well as fantastic times (Calvin Coolidge, Dwight Eisenhower, and Ronald Reagan, for example). The same goes for Democrat presidents, who have led the country through both poor stock market returns (Woodrow Wilson and Franklin Roosevelt, for example) as well as great gains (Franklin Roosevelt, Lyndon Johnson, and Barack Obama, for example). Presidents do not have that much power over the financial markets. Don’t let politics influence your investing decision-making.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I don’t have a vested interest in any company mentioned. Holdings are subject to change at any time.

More Of The Latest Thoughts From American Technology Companies On AI (2024 Q2)

A collection of quotes on artificial intelligence, or AI, from the management teams of US-listed technology companies in the 2024 Q2 earnings season.

Last month, I published The Latest Thoughts From American Technology Companies On AI (2024 Q2). In it, I shared commentary in earnings conference calls for the second quarter of 2024, from the leaders of technology companies that I follow or have a vested interest in, on the topic of AI and how the technology could impact their industry and the business world writ large. 

A few more technology companies I’m watching hosted earnings conference calls for 2024’s first quarter after I prepared the article. The leaders of these companies also had insights on AI that I think would be useful to share. This is an ongoing series. For the older commentary:

Here they are, in no particular order:

Adobe (NASDAQ: ADBE)

Adobe’s management believes that Adobe’s approach to AI is highly differentiated; the greatest differentiation is at the interface layer, as Adobe is able to rapidly integrate AI across its product portfolio and allow users to realise value

Adobe’s customer-centric approach to AI is highly differentiated across data, models and interfaces…

…Our greatest differentiation comes at the interface layer with our ability to rapidly integrate AI across our industry-leading product portfolio, making it easy for customers of all sizes to adopt and realize value from AI. 

Adobe’s Firefly models are trained on data that allow outputs to be commercially safe and management thinks this feature of being commercially safe is really important to enterprises; Adobe now has Firefly models for imaging, vector, design, and video; Firefly is in a wide range of Adobe products; Firefly has powered more than 12 billion generations since its launch in March 2023 (was 9 billion in 2024 Q1); management’s strategy is to build Firefly models into more streamlined and precise workflows within Adobe’s products; Adobe has Firefly Service APIs for organisations to generate content at scale, and the API calls tripled quarter-on-quarter; Firefly Service APIs are gaining real traction

We train our Firefly models on data that allows us to offer customers a solution designed to be commercially safe. We have now released Firefly models for imaging, vector and design and just previewed a new Firefly video model…

… Firefly-powered features in Adobe Photoshop, Illustrator, Lightroom, and Premier Pro help creators expand upon their natural creativity and accelerate productivity. Adobe Express is a quick and easy create anything application, unlocking creative expression for millions of users. Acrobat AI Assistant helps extract greater value from PDF documents. Adobe Experience Platform AI Assistant empowers brands to automate workflows and generate new audiences and journeys. Adobe GenStudio brings together content and data, integrating high-velocity creative expression with the enterprise activation needed to deliver personalization at scale…

…We have now surpassed 12 billion Firefly-powered generations across Adobe tools…

… Our strategy is to build technology that will create more streamlined and precise workflows within our tools through features like text-to-template in Express, Generative Fill in Photoshop, Generative Recolor in Illustrator, Generative Remove in Lightroom and the upcoming Generative Extend for Video and Premier Pro. We’re exposing the power of our creative tools and the magic of generative AI through Firefly Service APIs so organizations can generate and assemble content at scale…

…The introduction of the new Firefly video model earlier this week at IBC is another important milestone in our journey. Our video model, like the other models in the Firefly family, is built to be commercially safe with fine-grain control and application integration at its core. This will empower editors to realize their creative vision more productively in our video products, including Premier Pro…

…Strong demand for Firefly Services, which provide APIs, tools and services for content generation, editing and assembly, empowering organizations to automate content production while maintaining quality and control. Total API calls tripled quarter over quarter…

…Firefly Services, which is you can think of that also as a consumption model where we have that, it’s off to a really good start. Our ability to give enterprises the ability to automate content, create custom models within enterprises, we’re seeing real traction because it’s a differentiated solution and that it’s designed to be commercially safe…

…One other thing I’d just emphasize there is that the commercial safety is so important to businesses of all sizes, frankly, and that is something that we feel very, very differentiated.

Adobe released significant advancements in AI Assistant across Adobe Acrobat and Reader in 2024 Q2 (FY2024 Q3) and saw 70% sequential growth in AI interactions in AI Assistant; the advancements in AI Assistant include content creation capabilities; Tata Consultancy Services used AI Assistant in Adobe Acrobat to create event summaries of hours of conference videos in minutes; management intends to actively promote subscription plans for Adobe Acrobat and Reader that include generative AI capabilities

For decades, PDF has been the de facto standard for storing unstructured data, resulting in the creation and sharing of trillions of PDFs. The introduction of AI Assistant across Adobe Acrobat and Reader has transformed the way people interact with and extract value from these documents. In Q3, we released significant advancements, including the ability to have conversations across multiple documents and support for different document formats, saving users valuable time and providing important insights. We are thrilled to see this value translate into AI Assistant usage with over 70% quarter-over-quarter growth in AI interactions. 

In addition to consumption, we’re focused on leveraging generative AI to expand content creation in Adobe Acrobat. We’ve integrated Adobe Firefly Image Generation into our edit PDF workflows. We’ve optimized AI Assistant in Acrobat to generate content fit for presentations, e-mails and other forms of communication, and we’re laying the groundwork for richer content creation, including the generation of Adobe Express projects.

The application of this technology across verticals and industries is virtually limitless. Tata Consultancy Services recently used Adobe Premiere Pro to transcribe hours of conference videos and then used AI Assistant in Acrobat to create digestible event summaries in minutes. This allowed them to distribute newsletters on session content to attendees in real-time.

We’re excited to leverage generative AI to add value to content creation and consumption in Acrobat and Reader in the months ahead. Given the early adoption of AI Assistant, we intend to actively promote subscription plans that include generative AI capabilities over legacy perpetual plans that do not.

Adobe GenStudio is integrated across Experience Cloud and Creative Cloud and helps marketers quickly plan, create, store, deliver, and measure marketing content; Vanguard used Adobe GenStudio to increase quality engagement with investors by 176% through one-to-one personalisation, and to enjoy millions in savings

Customers are embracing the opportunity to address their content supply chain challenges with Adobe GenStudio. With native integrations across Experience Cloud and Creative Cloud, GenStudio empowers marketers to quickly plan, create, store, deliver, and measure marketing content and drive greater efficiency in their organizations. Financial services leader Vanguard is creating an integrated content supply chain to serve the strategic goal of deepening their relationships with a broad range of investors. Leveraging the GenStudio solution, Vanguard was able to increase quality engagement by 176% by focusing on one-to-one personalization and to realize millions in savings by improving content velocity and resource allocation with an end-to-end content creation workflow.

Adobe’s management has been very consistent over the past 1-1.5 years in how they have approached AI, and that is, Adobe would be developing a broad set of models for the creative community, and the models would be highly differentiated based on quality, commercial safety, and integrability into Adobe’s product portfolio

I think we’ve been incredibly consistent with what we’ve said, dating back 1 year, 1.5 years ago, where we talked about the fact that we were going to develop the broadest set of models for the creative community. And we were going to differentiate the models based on quality, commercial safety, integratability into our tools and controllability. And as you’ve seen very methodically over the last 18 months, we continue to bring more and more of that innovation to life. And that fundamentally is working as we’ve now started to integrate it much more actively into our base. If you look at it with photography, we now have in our tool, Generative Remove, we have AI-assisted edits in design, we have Generative Pattern, Generative Fill Shape. We have, in Photoshop, we have Gen Remove. We also have Gen Fill, and I can continue on with all the generations, but we’ve also now started to integrate it in Firefly Services for what we’re enabling enterprises to be able to access and use in terms of batch work and through APIs.

Adobe’s management is seeing the accelerated use and consumption of generative AI credits in Adobe’s products play out the way they expected it to; total consumption credits are going up with the introduction of each new generative AI capability 

If you look at sort of how that’s played out, as we talked about, we’re seeing accelerated use and generative credits being consumed because of that deeper integration into all of our tools, and that is playing out as we expected…

…And we do see with every subsequent capability we integrate into the tool, total credits consumed going up. 

Adobe’s management is seeing Adobe enjoying indirect monetisation from the AI features of its products, such as (1) the products having more value and pricing, (2) users being retained better when they use generative AI features, and (3) higher conversion of users when they try out Adobe products

When you look at then how that converts to monetization, first and foremost, we’ve integrated it a lot of that value into our core products with more value and more pricing. We’re also seeing that when people use these generative features, they retain better. We’re also seeing that when people come to Adobe to try our Creative Cloud applications or Express application, they’re able to convert better. And so there are all these ancillary implied benefits that we’re getting. 

For direct monetisation of the AI features in Adobe’s products, management is thinking of (1) instituting caps on generative AI credit consumption, (2) having AI plans with different AI capabilities; but direct monetisation is currently still not the key focus that management has, because they want to focus on proliferation and usage of generative AI across the user base

In terms of direct monetization, what we’ve said in the past is that the current model is around generative credits, which is I think where you’re going with this. And we do see with every subsequent capability we integrate into the tool, total credits consumed going up. Now what we are trying to do as we go forward, we haven’t started instituting the caps yet. And part of this is, as we’ve said all along, we want to really focus our attention on proliferation and usage across our base. We see a lot of users excited about it. It’s some of the most actively used features that we’ve ever released. And we want to avoid the generation anxiety that people feel. But we’re watching very closely as the economy of generative credits evolves, and we’re going to look at instituting those caps at some point when we feel the time is right and/or we’re also looking at other alternative models. What we did with Acrobat AI Assistant has proven to be very effective. And so we’re also considering other opportunities like having standard CC plans that have a core set of generative capabilities but also having premium API — sorry, premium AI plans that will include things more like video and other things.

Adobe’s management thinks Adobe’s generative AI video models are already pretty capable, but they are going to get better over time; management thinks that the real value of generative AI video models is not in their ability to create a video through a description the user gives, but in their ability to extend the video

I don’t know if you had a chance to see some of the videos we put out there integrated directly into premier, also text to video, images to video, more controllability. We have also the ability now to generate not just themes with humans and dogs and organic animals, but all these like overlays and things that creative professionals actually want to work with. And so we’re very excited about the set of things that they can get out of the box that get going. And human faces and things will just continue to get better…

…I spend a couple of hours with our video team. They have just absolutely hit it out of the park. I mean, the work that they have done, which is leveraging the image models with video, and again, I think to David’s point, the integration with Premier, that’s where we’ve always said, it’s the integration of the model and the application that differentiates it. I think when other models first came out, people were like, “Wow, you can describe it.” That’s just such a small part of where the value is. And the real value is, you have a video, you want to extend it. It’s a game changer in terms of what we can do. So really excited about the stuff that we’re doing in video. 

MongoDB (NASDAQ: MDB)

MongoDB’s management sees AI as a longer-term opportunity for MongoDB; management is seeing companies largely still experimenting with AI applications currently; management thinks inference workloads will come, but monetisation of AI apps will take time

AI continues to be an additional long-term opportunity for our business. At the start of the fiscal year, we told you that we didn’t expect AI to be a meaningful tailwind for our business in fiscal year 2025, which has proven accurate. Based on recent peer commentary, it seems that the industry now mostly agrees with this view. Companies are currently focusing their spending on the infrastructure layer of AI and are still largely experimenting with AI applications. Inference workloads will come and should benefit MongoDB greatly in the long run, but we are still very early when the monetization of AI apps will take time. AI demand is a question of when, not if.

MongoDB’s management has been talking to customers and they think MongoDB is the ideal database for AI apps for five reasons: (1) AI workloads involve a wide variety of data types and MongoDB’s document-model database is meant to handle this variety well, thus providing a well-rounded one-stop solution, (2) MongoDB’s database is high-performance and scalable, and allows AI workloads to utilise real-time operational data, (3) MongoDB’s database is integrated with leading app development frameworks and AI platforms, (4) MongoDB’s database has enterprise-grade security and compliance features, and (5) MongoDB’s database can be run anywhere on the customer’s choice; management feels very good about MongoDB’s positioning for AI

Our discussions with customers and partners give us increasing conviction that we are the ideal data layer for AI apps for a number of key reasons.

First, more than any other type of modern workload, AI-driven workloads require the underlying database to be capable of processing queries against rich and complex data structures quickly and efficiently. Our flexible document model is uniquely positioned to help customers build sophisticated AI applications because it is designed to handle different data types, your source data, vector data, metadata and generated data right alongside your live operational data, outdating the need for multiple database systems and complex back-end architectures.

Second, MongoDB offers a high performance and scalable architecture. As the latency of LLMs improve, the value of using real-time operational data for AI apps will become even more important.

Third, we are seamlessly integrated with leading app development frameworks and AI platforms, enabling developers to incorporate MongoDB into their existing workflows while having the flexibility to choose the LLM and other specific tools that best suit their needs.

Fourth, we meet or exceed the security and compliance requirements expected from an enterprise database, including enterprise-grade encryption, authorization and auditability.

Lastly, customers can run MongoDB anywhere, on-premise or as a fully managed service in 1 of the 118 global cloud regions across 3 hyperscalers giving them the flexibility to run workloads to meet — to best meet their application use cases and business needs…

… As the performance of these LLMs and latency of these LLMs increase, accessing real-time data becomes really important like, say, you’re calling and talking to a customer support chatbot, that you want that chatbot to have up-to-date information about that customer so that they can provide the most relevant and accurate information possible…

…I think it’s a quickly evolving space, but we feel very good about our positioning for AI, even though it’s still very early days.

MongoDB’s management sees 3 ways AI can accelerate MongoDB’s business over time: (1) AI will drive the cost of building applications, as all past platform shifts have done, thus leading to more apps and higher demand for databases, (2) MongoDB can be the database of choice for developers building AI applications (see Point 9 on MongoDB’s new AI Applications Program), and (3) MongoDB can help customers modernise their application estate (see Point 10 for more on this opportunity)

We see 3 main opportunities where we believe AI will accelerate our business over time. The first is that the cost of building applications in the world of AI will come down as we’ve seen with every previous platform shift, creating more applications and more data requiring more databases. The second opportunity is for us to be the database of choice for customers building greenfield AI applications…

…The third opportunity is to help customers modernize their legacy application estate. 

MongoDB’s management made the MongoDB AI Applications Program (MAAP) generally available in July 2024; MAAP brings the cloud computing hyperscalers and prominent AI model-building startups into one ecosystem to reduce the complexity and difficulty for MongoDB’s customers when they build new AI applications 

While we see that there’s tremendous amount of interest in and planning for new AI-powered applications, the complexity and fast-moving nature of the AI ecosystem slows customers down. That’s why we launched the MongoDB AI Applications Program, or MAAP, which became generally available to customers last month. MAAP brings together a unique ecosystem, including the 3 major cloud providers, AWS, Azure and GCP as well as Accenture and AI pioneers like Anthropic and Cohere. MAAP offers customers reference architectures and end-to-end technology stack that includes prebuilt integrations, professional services and a unified support system to help customers quickly build and deploy AI applications.

Modernising legacy application estates is a big opportunity, as most of the $80 billion database market is still in legacy relational databases; MongoDB has the Relational Migrator product to help customers migrate from legacy relational databases to the company’s document-model database; management thinks AI can significantly improve the process of modernising legacy applications by helping with understanding legacy code and rewriting them as modern versions; MongoDB launched a few pilots with customers earlier in 2024 to modernise their legacy applications with the help of AI and the results are exciting; the CIO (Chief Information Officer) of an insurance company in the pilots said the modernisation process was the first tangible return he had seen in his AI investments; management thinks it will take time for the modernisation program to contribute meaningful revenue to MongoDB, but they are excited 

Most of the existing $80-billion-plus database industry is built on dated relational architecture. Modernizing legacy applications has always been part of our business, and we have taken steps over the years to simplify and demystify this complex process through partnerships, education and most recently, our Relational Migrator product. AI offers a potential step function improvement, lowering the cost and reducing their time and risk to modernize legacy applications…

…Earlier this year, we launched several pilots with our customers where we work with them to modernize mission-critical applications, leveraging both AI tooling and services. The early results from these pilots are very exciting as our customers are experiencing significant reductions in time and cost of modernization. In particular, we have seen dramatic improvements in time and cost to rewrite application code and generate test suites. We see increasing interest from customers that want to modernize their legacy application estate, including large enterprise customers. As a CIO of one of the world’s largest insurance companies said about our pilot, this is the first tangible return he’s seen on his AI investments. While it’s still early days and generating meaningful revenue from this program will take time, we are excited about the results of our pilots and the growing pipeline of customers eager to modernize their legacy estate…

…Since day one, since our IPO, we’ve been getting customers to migrate off relational to MongoDB. But one of the biggest friction points has been that while it’s easy to move the data, you can map the schema from a relational schema to a document schema and you can automate that, the biggest stumbling block is that the customer has to or some third party has to rewrite the application, which, by definition, creates more costs, more time and in some cases, more risk especially for older apps, where the development teams who built those apps no longer exist. So what’s been compelling about AI is that AI has finally created a shortcut to overcome that big hurdle. And so essentially, you can start basically diagnosing the code, understand the code, recreate a modern version of that code and generate test suites to make sure the new code performs like the old code. So that definitely gets people’s interest because now, all of a sudden, what may take years or multiyears, you can do in a lot less time. And the pilots that we have done, the time and cost savings have been very, very compelling.

That being said, we’re in the very early days. There’s a lot of interest. We have a growing pipeline of customers across, frankly, all parts of the world from North America to EMEA and even the Pac Rim. And so we’re quite excited about the opportunity. But again, I would say it’s very early days.

Delivery Hero, a leading local delivery platform, is using MongoDB Atlas Vector Search to provide AI-powered hypersonalised results to users; Delivery Hero found that MongoDB Atlas Vector Search helped it build solutions for less cost than alternative technologies

Delivery Hero, a long-time MongoDB Atlas customer is the world’s leading local delivery platform, operating in 70-plus countries across 4 continents. Their quick commerce service enables customers to select fresh produce for delivery from local grocery stores. Approximately 10% of the inventory is fast-moving perishable produce that can go quickly out of stock. The company risks losing revenue and increasing customer churn if the customer didn’t have viable alternatives to their first choice. To address these risks, they are now using state-of-the-art AI models and MongoDB Atlas Vector Search to give hyperpersonalized alternatives to customers in real time if items they want to order are out of stock. With the introduction of MongoDB Atlas Vector Search, the data science team recognized that they could build a highly performant, real-time solution more quickly and for less cost than alternative technologies. 

MongoDB’s management believes that general-purpose LLMs (large language models) will win and will use RAG (retrieval augmented generation) as the primary way to combine generally available data to proprietary data; management is seeing advanced RAG use-cases in answering complex questions

There are some questions about LLMs, whether a general-purpose LLM or a fine-tune LLM, what the trade-offs are. Our belief is that given the performance of LLMs, you’re going to see the general purpose LLMs probably win and will use RAG as the predominant approach to marry generally available data with proprietary data. And then you are starting to see things like advanced RAG use cases where you get much more sophisticated ways to ask complex questions, provide more accurate and detailed answers and better adapt to different types of information and queries.

MongoDB’s management is seeing most AI workloads happen in the cloud, but they also see a lot of customers using open-source LLMs and running those workloads locally

We predominantly see most of the AI workloads in the cloud, but there are definitely lots of customers who are looking at using open source LLMs, in particular, things like Llama, and running those workloads locally.

MongoDB’s management believes MongoDB wins against Postgres for winning AI workloads because MongoDB can handle complex data types whereas Postgres, which is a relational – or SQL – database, struggles

MongoDB is designed to handle these different data structures. And I talked about we can help unify metadata, operational data, vector data and generate it all in one platform. Relational databases, and Postgres is one of them, have limitations in terms of what they can — how they can handle different types of data. In fact, when the data gets too large, these relational databases have to do what’s called off-row storage. And it becomes — it creates a performance overhead on these relational platforms. Postgres has this thing called TOAST, which is — stands for The Oversized-Attribute Storage Technique. And it’s basically a way to handle these different data types, but it creates a massive performance overhead. So we believe that we are architecturally far better for these more complex AI workloads than relational databases.

MongoDB’s management is seeing growing adoption of Vector, and Vector is helping attract new customers to MongoDB; an existing Atlas customer, a financial news organisation, migrated from Elastic Search to Atlas Search in order to use MongoDB’s Vector Search capabilities; an European energy company is using Vector Search for a geospatial search application

On Vector, we’re continuing to see growth in adoption, and we see Vector is effective in attracting new customers to the MongoDB platform. A world-renowned financial news organization, which is already running in Atlas, migrated from Elasticsearch to Atlas Search using Search Nodes to take advantage of our Vector Search capabilities to build a site search that combines lexical search with semantic search to find the most relevant articles for user query. And a European energy company built a geospatial search application using Atlas Search and Vector search and the app was built on-prem but — and to clouds to vectorize geospatial data and facilitate research and discovery.

MongoDB’s management is seeing MongoDB’s customers improve their software development productivity with the help of AI, but the rate of improvement is all over the place

[Question] We’ve talked before in the past that AI is just driving a lot of new code, making developers significantly more productive. Have you seen that behavior in any of your existing customers on Atlas where maybe their utilization rate goes up or the number of applications built per customer goes up?

[Answer] A common question I ask our customers when I meet with them in terms of what code generation tools that they’re using and what benefits they’re gaining. The answers tend to be a little bit all over the map. Some people see 10%, 15% productivity improvement. Some people say 20%, 25% productivity improvement. Some people say it helps my senior developers be more productive. Some people say it helps my junior developers become more like senior developers. So the answers tend to be all over the map.

Nvidia (NASDAQ: NVDA)

Nvidia’s Data Center revenue had incredibly strong growth in 2024 Q2, driven by demand for the Hopper GPU computing platform; compute revenue was up by 2.5x while networking revenue was up by 2x

Data Center revenue of $26.3 billion was a record, up 16% sequentially and up 154% year-on-year, driven by strong demand for NVIDIA Hopper, GPU computing and our networking platforms. Compute revenue grew more than 2.5x. Networking revenue grew more than 2x from the last year.

Even as Nvidia is getting ready to launch its Blackwell-architecture GPUs, customers are still buying the Hopper-architecture GPUs; the H200 platform, based on the Hopper architecture, started ramping in 2024 Q2 and offers 40% more memory bandwidth than the H100; management thinks that the reasons why the Hopper-architecture chips still enjoy strong demand despite the imminent arrival of the Blackwell-architecture chips are (1) AI companies need chips today to process data right now, and (2) AI companies are in a race to build the best model and they’re all racing to be the first

Customers continue to accelerate their Hopper architecture purchases while gearing up to adopt Blackwell…

…NVIDIA H200 platform began ramping in Q2, shipping to large CSPs, consumer Internet and enterprise companies. The NVIDIA H200 builds upon the strength of our Hopper architecture and offering over 40% more memory bandwidth compared to the H100…

…The demand for Hopper is really strong. And it’s true, the demand for Blackwell is incredible. There’s a couple of reasons for that. The first reason is, if you just look at the world’s cloud service providers and the amount of GPU capacity they have available, it’s basically none…

…A generative AI company spends the vast majority of their invested capital into infrastructure so that they could use an AI to help them create products. And so these companies need it now. They just simply can’t afford — you just raise money, they want you to put it to use now. You have processing that you have to do. You can’t do it next year. You got to do it today. And so that’s one reason. The second reason for Hopper demand right now is because of the race to the next plateau. The first person to the next plateau gets to introduce some revolutionary level of AI. The second person who gets there is incrementally better or about the same. And so the ability to systematically and consistently race to the next plateau and be the first one there is how you establish leadership…

…We believe our Hopper will continue to grow into the second half. We have many new products for Hopper or existing products for Hopper that we believe will start continuing to ramp in the next quarters, including our Q3 and those new products moving to Q4. So let’s say, Hopper, therefore, versus H1 is a growth opportunity for that. 

Nvidia’s management thinks that the next generation of AI models will need 10-20 times more compute to train

Next-generation models will require 10 to 20x more compute to train with significantly more data. The trend is expected to continue.

Nvidia’s management sees inferencing accounting for 40% of Data Center revenue over the last 4 quarters (was 40% as of 2024 Q1)

Over the trailing 4 quarters, we estimate that inference drove more than 40% of our Data Center revenue.

Nvidia’s management is seeing demand coming from builders of frontier AI models, consumer Internet companies, and companies building generative AI applications for a wide range of use cases

Demand for NVIDIA is coming from frontier model makers, consumer Internet services, and tens of thousands of companies and start-ups building generative AI applications for consumers, advertising, education, enterprise and health care, and robotics. 

Nvidia’s Data Center revenue in China grew sequentially in 2024 Q2, but still remains below the level seen prior to export controls; management expects tough competition in China

Our Data Center revenue in China grew sequentially in Q2 and a significant contributor to our Data Center revenue. As a percentage of total Data Center revenue, it remains below levels seen prior to the imposition of export controls. We continue to expect the China market to be very competitive going forward.

Nvidia has leadership in inference

The latest round of MLPerf inference benchmarks highlighted NVIDIA’s inference leadership, with both NVIDIA Hopper and Blackwell platforms combining to win gold medals on all tests.

Nvidia’s Blackwell family of chips combines GPUs, CPUs, DPUs (data processing units), NVLink, and networking; the GB200 NVL72 system in the Blackwell family links up 72 GPUs to act as 1 GPU and is up to 30 times faster for LLM (large language model) inference workloads; Nvidia has made a change to the Blackwell architecture to improve production yields; Blackwell’s production is expected to ramp in the fourth quarter of 2024; management sees demand for Blackwell exceeding supply by a wide margin up to 2025; there are more than 100 different Blackwell architecture systems; Nvidia’s Blackwell systems come in both air-cooled and liquid-cooled flavours; management expects Nvidia’s Data Center business to grow significantly in 2025 and 2026, powered by the Blackwell system; management sees Blackwell as a step-function improvement over Hopper that delivers 3-5 times more AI throughput than Hopper; Blackwell required 7 one-of-a-kind chips to build; Nvidia designed and optimised the Blackwell system end-to-end

The NVIDIA GB200 NVL72 system with the fifth-generation NVLink enables all 72 GPUs to act as a single GPU and deliver up to 30x faster inference for LLM’s workloads and unlocking the ability to run trillion-parameter models in real time…

…We executed a change to the Blackwell GPU mass to improve production yields. Blackwell production ramp is scheduled to begin in the fourth quarter and continue into fiscal year ’26. In Q4, we expect to get several billion dollars in Blackwell revenue…

Demand for Blackwell platforms is well above supply, and we expect this to continue into next year…

…There are something like 100 different types of Blackwell-based systems that are built that were shown at Computex, and we’re enabling our ecosystem to start sampling those…

…We offer multiple configurations of Blackwell. Blackwell comes in either a Blackwell classic, if you will, that uses the HGX form factor that we pioneered with Volta. I think it was Volta. And so we’ve been shipping the HGX form factor for some time. It is air cooled. The Grace Blackwell is liquid cooled…

…We expect to grow our Data Center business quite significantly next year. Blackwell is going to be a complete game changer for the industry. And Blackwell is going to carry into the following year…

…Blackwall is a step-function leap over Hopper. Blackwell is an AI infrastructure platform, not just the GPU. It also happens to be the name of our GPU, but it’s an AI infrastructure platform. As we reveal more of Blackwell and sample systems to our partners and customers, the extent of Blackwell’s lead becomes clear. The Blackwell vision took nearly 5 years and 7 one-of-a-kind chips to realize: the Gray CPU, the Blackwell dual GPU and a colos package, ConnectX DPU for East-West traffic, BlueField DPU for North-South and storage traffic, NVLink switch for all-to-all GPU communications, and Quantum and Spectrum-X for both InfiniBand and Ethernet can support the massive burst traffic of AI. Blackwell AI factories are building size computers. NVIDIA designed and optimized the Blackwell platform full stack, end-to-end, from chips, systems, networking, even structured cables, power and cooling and mounts of software to make it fast for customers to build AI factories. These are very capital-intensive infrastructures. Customers want to deploy it as soon as they get their hands on the equipment and deliver the best performance and TCO. Blackwell provides 3 to 5x more AI throughput in a power-limited data center than Hopper…

…The Blackwell system lets us connect 144 GPUs in 72 GB200 packages into 1 NVLink domain, with an aggregate NVLink bandwidth of 259 terabytes per second in 1 rack. Just to put that in perspective, that’s about 10x higher than Hopper.  

Nvidia’s Ethernet for AI revenue doubled sequentially; management sees Nvidia’s ethernet product, Spectrum-X, enjoying wide support from the AI ecosystem; Spectrum-X performs 1.6 times better than traditional Ethernet; management plans to launch new Spectrum-X products every year and thinks that Spectrum-X will soon become a multi-billion dollar product

Our Ethernet for AI revenue, which includes our Spectrum-X end-to-end Ethernet platform, doubled sequentially with hundreds of customers adopting our Ethernet offerings. Spectrum-X has broad market support from OEM and ODM partners and is being adopted by CSPs, GPU cloud providers and enterprises, including xAI to connect the largest GPU compute cluster in the world. Spectrum-X supercharges Ethernet for AI processing and delivers 1.6x the performance of traditional Ethernet. We plan to launch new Spectrum-X products every year to support demand for scaling compute clusters from tens of thousands of GPUs today to millions of DPUs in the near future. Spectrum-X is well on track to begin a multibillion-dollar product line within a year.

Japan’s government is working with Nvidia to build an AI supercomputer; Nvidia’s management thinks sovereign AI revenue will be in the low-teens billion-range this year; management is seeing countries want to build their own generative AI that incorporates their own language, culture, and data

Japan’s National Institute of Advanced Industrial Science and Technology is building its AI Bridging Cloud Infrastructure 3.0 supercomputer with NVIDIA. We believe sovereign AI revenue will reach low double-digit billions this year…

…It certainly is a unique and growing opportunity, something that surfaced with generative AI and the desires of countries around the world to have their own generative AI that would be able to incorporate their own language, incorporate their own culture, incorporate their own data in that country.

Most of the Fortune 100 companies are working with Nvidia on AI projects

We are working with most of the Fortune 100 companies on AI initiatives across industries and geographies. 

Nvidia’s management is seeing a range of applications driving the company’s growth; these applications include (1) Amdocs’ smart agent which is reducing customer service costs by 30%, and (2) Wistron’s usage of Nvidia AI Ominiverse to reduce cycle times in its factories by 50%

A range of applications are fueling our growth, including AI-powered chatbots, generative AI copilots and agents to build new, monetizable business applications and enhance employee productivity. Amdocs is using NVIDIA generative AI for their smart agent, transforming the customer experience and reducing customer service costs by 30%. ServiceNow is using NVIDIA for its Now Assist offering, the fastest-growing new product in the company’s history. SAP is using NVIDIA to build Joule copilot. Cohesity is using NVIDIA to build their generative AI agent and lower generative AI development costs. Snowflake, who serves over 3 billion queries a day for over 10,000 enterprise customers, is working with NVIDIA to build copilots. And lastly, Wistron is using NVIDIA AI Omniverse to reduce end-to-end cycle times for their factories by 50%.

Every automobile company that is developing autonomous vehicle technology is working with Nvidia; management thinks that automotive will account for multi-billions in revenue for Nvidia; Nvidia won the Autonomous Brand Challenge at the recent Computer Vision and Pattern Recognition Conference

Every automaker developing autonomous vehicle technology is using NVIDIA in their data centers. Automotive will drive multibillion dollars in revenue across on-prem and cloud consumption and will grow as next-generation AV models require significantly more compute…

…At the Computer Vision and Pattern Recognition Conference, NVIDIA won the Autonomous Brand Challenge in the end-to-end driving at scale category, outperforming more than 400 entries worldwide. 

Nvidia’s management announced Nvidia AI Foundry – a platform for building custom AI models – in 2024 Q2; users of Nvidia AI Foundry are able to customise Meta’s Llama 3.1 foundation AI model; Nvidia AI Foundry is the first platform where users are able to customise an open-source, frontier-level foundation AI model; Accenture is already using Nvidia AI Foundry 

During the quarter, we announced a new NVIDIA AI foundry service to supercharge generative AI for the world’s enterprises with Meta’s Llama 3.1 collection of models… 

…Companies for the first time can leverage the capabilities of an open source, frontier-level model to develop customized AI applications to encode their institutional knowledge into an AI flywheel to automate and accelerate their business. Accenture is the first to adopt the new service to build custom Llama 3.1 models for both its own use and to assist clients seeking to deploy generative AI applications.

Companies from many industries are using NIMs (Nvidia inference microservices) for deployment of generative AI; AT&T saw 70% cost savings and 8 times latency reduction with NIM; there are 150 organisations using NIMs; Nvidia recently announced NIM Agent Blueprints, a catalog of reference AI applications; Nvidia is using NIMs to open the Nvidia Omniverse to new industries

NVIDIA NIMs accelerate and simplify model deployment. Companies across health care, energy, financial services, retail, transportation, and telecommunications are adopting NIMs, including Aramco, Lowes, and Uber. AT&T realized 70% cost savings and 8x latency reduction affter moving into NIMs for generative AI, call transcription and classification. Over 150 partners are embedding NIMs across every layer of the AI ecosystem. 

We announced NIM Agent Blueprints, a catalog of customizable reference applications that include a full suite of software for building and deploying enterprise generative AI applications. With NIM Agent Blueprints, enterprises can refine their AI applications over time, creating a data-driven AI flywheel. The first NIM Agent Blueprints include workloads for customer service, computer-aided drug discovery, and enterprise retrieval augmented generation. Our system integrators, technology solution providers, and system builders are bringing NVIDIA NIM Agent Blueprints to enterprises…

…We announced new NVIDIA USD NIMs and connectors to open Omniverse to new industries and enable developers to incorporate generative AI copilots and agents into USD workloads, accelerating our ability to build highly accurate virtual worlds.

Nvidia’s AI Enterprise software platform is powering Nvidia’s software-related business to approach a $2 billion annual revenue run-rate by the end of this year; management thinks Nvidia AI Enterprise represents great value for customers by providing GPUs at a price of $4,500 per GPU per year; management thinks the TAM (total addressable market) for Nvidia’s AI software business can be significant

NVIDIA NIM and NIM Agent Blueprints are available through the NVIDIA AI Enterprise software platform, which has great momentum. We expect our software, SaaS and support revenue to approach a $2 billion annual run rate exiting this year, with NVIDIA AI Enterprise notably contributing to growth…

…At $4,500 per GPU per year, NVIDIA AI Enterprise is an exceptional value for deploying AI anywhere. And for NVIDIA’s software TAM, it can be significant as the CUDA-compatible GPU installed base grows from millions to tens of millions. 

Computers that contain Nvidia’s RTX chip can deliver up to 1,300 AI TOPS (tera operations per second); there are more than 200 RTX AI computer models from computer manufacturers; there is an installed base of 100 million RTX AI computers; a game called Mecha BREAK is the first game to use Nvidia ACE, a generative AI service for creating digital humans

Every PC with RTX is an AI PC. RTX PCs can deliver up to 1,300 AI tops and there are now over 200 RTX AI laptops designed from leading PC manufacturers. With 600 AI-powered applications and games and an installed base of 100 million devices, RTX is set to revolutionize consumer experiences with generative AI. NVIDIA ACE, a suite of generative AI technologies is available for RTX AI PCs. Mecha BREAK is the first game to use NVIDIA ACE, including our small language model, Nemotron-4 4B, optimized on device inference. 

Foxconn, the largest electronics manufacturer in the world, and Mercedes-Benz, the well-known auto manufacturer, are using Nvidia Omniverse to produce digital twins of their manufacturing plants

The world’s largest electronics manufacturer, Foxconn, is using NVIDIA Omniverse to power digital twins of the physical plants that produce NVIDIA Blackwell systems. And several large global enterprises, including Mercedes-Benz, signed multiyear contracts for NVIDIA Omniverse Cloud to build industrial digital twins of factories.

Many robotics companies are using Nvidia’s AI robot software

Boston Dynamics, BYD Electronics, Figure, Intrinsyc, Siemens, Skilled AI and Teradyne Robotics are using the NVIDIA Isaac robotics platform for autonomous robot arms, humanoids and mobile robots.

Nvidia’s management is seeing some customers save up to 90% in computing costs by transitioning from genera-purpose computing (CPUs) to accelerated computing (GPUs)

We know that accelerated computing, of course, speeds up applications. It also enables you to do computing at a much larger scale, for example, scientific simulations or database processing. But what that translates directly to is lower cost and lower energy consumed. And in fact, this week, there’s a blog that came out that talked about a whole bunch of new libraries that we offer. And that’s really the core of the first platform transition, going from general-purpose computing to accelerated computing. And it’s not unusual to see someone save 90% of their computing cost. And the reason for that is, of course, you just sped up an application 50x. You would expect the computing cost to decline quite significantly.

Nvidia’s management believes that generative AI is a new way to write software and is changing how every layer of computing is done

Generative AI, taking a step back about why it is that we went so deeply into it, is because it’s not just a feature, it’s not just a capability, it’s a fundamental new way of doing software. Instead of human-engineered algorithms, we now have data. We tell the AI, we tell the model, we tell the computer what are the expected answers, what are our previous observations, and then for it to figure out what the algorithm is, what’s the function. It learns a universal — AI is a bit of a universal function approximator and it learns the function. And so you could learn the function of almost anything, and anything that you have that’s predictable, anything that has structure, anything that you have previous examples of. And so now here we are with generative AI. It’s a fundamental new form of computer science. It’s affecting how every layer of computing is done from CPU to GPU, from human-engineered algorithms to machine-learned algorithms, and the type of applications you could now develop and produce is fundamentally remarkable.

Nvidia’s management thinks AI models are still seeing the benefits of scaling

There are several things that are happening in generative AI. So the first thing that’s happening is the frontier models are growing in quite substantial scale. And we’re still all seeing the benefits of scaling.

The amount of compute needed to train an AI model goes up much faster than the size of the model; management thinks the next generation of AI models could require 10-40 times more compute 

Whenever you double the size of a model, you also have to more than double the size of the data set to go train it. And so the amount of flops necessary in order to create that model goes up quadratically. And so it’s not unexpected to see that the next-generation models could take 10x, 20x, 40x more compute than last generation.

Nvidia’s management is seeing more frontier model makers in 2024 than in 2023

Surprisingly, there are more frontier model makers than last year.

Nvidia’s management is seeing advertising-related computing needs shifting from being powered by CPUs to being powered by GPUs and generative AI

The largest systems, largest computing systems in the world today, and you’ve heard me talk about this in the past, which are recommender systems moving from CPUs. It’s now moving from CPUs to generative AI. So recommender systems, ad generation, custom ad generation targeting ads at very large scale and quite hyper-targeting, search and user-generated content, these are all very large-scale applications that have now evolved to generative AI.

Nvidia’s management is seeing generative AI startups generating tens of billions of revenue-opportunities for cloud computing providers

The number of generative AI start-ups is generating tens of billions of dollars of cloud renting opportunities for our cloud partners

Nvidia’s management is seeing that cloud computing providers have zero GPU capacity available because they are using it for internal workloads (such as accelerating data processing) and renting it out to model makers and other AI startups

If you just look at the world’s cloud service providers and the amount of GPU capacity they have available, it’s basically none. And the reason for that is because they’re either being deployed internally for accelerating their own workloads, data processing, for example…

…The second is, of course, the rentals. They’re renting capacity to model makers. They’re renting it to start-up companies. 

Nvidia’s management thinks Nvidia’s GPUs are the only AI GPUs that process and accelerate data; before the advent of generative AI, the number one use case of Nvidia’s GPUs was to accelerate data processing

NVIDIA’s GPUs are the only accelerators on the planet that process and accelerate data. SQL data, Panda’s data, data science toolkits like Panda’s, and the new one, Polar’s, these are the ones that are the most popular data processing platforms in the world, and aside from CPUs which, as I’ve mentioned before, are really running out of steam, NVIDIA’s accelerated computing is really the only way to get boosting performance out of that. And so the #1 use case long before generative AI came along is the migration of applications one after another to accelerated computing.

Nvidia’s management thinks that those who purchase Nvidia AI chips are getting immediate ROI (return on investment) for a few reasons: (1) GPUs are a better way to build data centers compared to CPUs because GPUs save money on data processing compared to CPUs, (2) cloud computing providers who rent out GPUs are able to rent out their GPUs the moment they are built up in the data center because there are many generative AI companies clamouring for the chips, and (3) generative AI improves a company’s own services, which delivers a fast ROI

The people who are investing in NVIDIA infrastructure are getting returns on it right away. It’s the best ROI infrastructure, computing infrastructure investment you can make today. And so one way to think through it, probably the easiest way to think through it is just to go back to first principles. You have $1 trillion worth of general-purpose computing infrastructure. And the question is, do you want to build more of that or not?

And for every $1 billion worth of Juniper CPU-based infrastructure that you stand up, you probably rent it for less than $1 billion. And so because it’s commoditized, there’s already $1 trillion on the ground. What’s the point of getting more? And so the people who are clamoring to get this infrastructure, one, when they build out Hopper-based infrastructure and soon, Blackwell-based infrastructure, they start saving money. That’s tremendous return on investment. And the reason why they start saving money is because data processing saves money, and data processing is probably just a giant part of it already. And so recommender systems save money, so on and so forth, okay? And so you start saving money.

The second thing is everything you stand up are going to get rented because so many companies are being founded to create generative AI. And so your capacity gets rented right away and the return on investment of that is really good.

And then the third reason is your own business. Do you want to either create the next frontier yourself or your own Internet services, benefit from a next-generation ad system or a next-generation recommender system or a next-generation search system? So for your own services, for your own stores, for your own user-generated content, social media platforms, for your own services, generative AI is also a fast ROI.

Nvidia’s management is seeing a significant number of data centers wanting liquid-cooled GPU systems because the use of liquid cooling enables 3-5 times more AI throughput compared to the past, resulting in cheaper TCO (total cost of ownership)

The number of data centers that want to go to liquid cooled is quite significant. And the reason for that is because we can, in a liquid-cooled data center, in any power-limited data center, whatever size of data center you choose, you could install and deploy anywhere from 3 to 5x the AI throughput compared to the past. And so liquid cooling is cheaper. Our TCO is better, and liquid cooling allows you to have the benefit of this capability we call NVLink, which allows us to expand it to 72 Grace Blackwell packages, which has essentially 144 GPUs.

Nvidia does not do the full integration of its GPU systems into a data center because it is not the company’s area of expertise

Our customers hate that we do integration. The supply chain hates us doing integration. They want to do the integration. That’s their value-add. There’s a final design-in, if you will. It’s not quite as simple as shimmying into a data center, but the design fit-in is really complicated. And so the design fit-in, the installation, the bring-up, the repair-and-replace, that entire cycle is done all over the world. And we have a sprawling network of ODM and OEM partners that does this incredibly well.

Nvidia has released many new libraries for CUDA, across a wide variety of use cases, for AI software developers to work with

Accelerated computing starts with CUDA-X libraries. New libraries open new markets for NVIDIA. We released many new libraries, including CUDA-X Accelerated Polars, Pandas and Spark, the leading data science and data processing libraries; CUVI-S for vector databases, this is incredibly hot right now; Ariel and Ciona for 5G wireless base station, a whole world of data centers that we can go into now; Parabricks for gene sequencing and AlphaFold2 for protein structure prediction is now CUDA accelerated.

Nvidia now has 3 networking platforms for GPUs

We now have 3 networking platforms, NVLink for GPU scale-up, Quantum InfiniBand for supercomputing and dedicated AI factories, and Spectrum-X for AI on Ethernet. NVIDIA’s networking footprint is much bigger than before. 

Salesforce (NYSE: CRM)

Agentforce is a new architecture and product that management believes will be fundamental to Salesforce’s AI leadership in the next decade; Salseforce will be introducing Agentforce in its upcoming Dreamforce customer-event; Agentforce is an autonomous AI and management will be getting every attendee at Dreamforce to turn on their own AI agents; Salesforce is already building agents for the company, Workday, and Workday will be Salseforce’s first Agentforce partner; Agentforce allows companies to build custom agents for sales, service, marketing, and commerce; management believes that within a year, most companies will be deploying autonomous AI agents at scale, and these agents will have a big positive impact on companies’ operations; Agentforce is currently management’s singular focus; many companies are already using Agentforce, including one of the world’s largest healthcare companies, which is resolving more than 90% of patient inquiries with Agentforce, and thinks Agentforce is much better than any other competing AI platform; a very large media company is using Agentforce to resolve 90% of employee and consumer issues; management thinks Salesforce is the first company to deploy high-quality enterprise AI agents at scale; Agentforce is not a co-pilot, it is an autonomous agent that is accurate and can be deployed right out of the box; users of Agentforce can do advanced planning and reasoning with minimal input; management sees Agentforce as being a trusted colleague that will complement human users; management sees thousands of companies using Agentforce by January 2025; early trials of Agentforce has showed remarkable success

We’re going to talk about a whole different kind of sales force today, a different kind of architecture and a product that we didn’t even talk about on the last earnings call that is going to be fundamental to our future and a manifestation of our decade of AI leadership, which is Agentforce. Now in just a few weeks, we’re going to kick off Dreamforce, and I hope all of you are planning to be there, the largest AI event in the world with more than 45,000 trailblazers in San Francisco. And this year, Dreamforce is really becoming Agentforce…

…We’re going to show our new Agentforce agents and how we’ve reimagined enterprise software for this new world of autonomous AI. And every customer, I’m going to try to get every customer who comes to Dreamforce to turn agents on while they’re there…

…This idea that you’re not just going to have sales agents and service agents who probably read, heard maybe you saw in CBC, we’re building the agents for Workday and we’re going to be building custom agents for so many of you as well with Agentforce, because it is a development platform as well as this incredible capability to radically extend your sales and service organizations.

So when you arrive at the Dreamforce campus, you’re going to see a big sign outside that says, humans with agents drive customer success together. And that’s because we now so strongly believe the future isn’t about having a sales force or a service force or a marketing force or a commerce force or an analytics force. The future is about also having an Agentforce. And while many customers today don’t yet have agent forces, but they do have sales forces or service forces, I assure you that within a year, we’re all going to have agent forces, and we’re going to have them at scale. And it’s going to radically extend our companies and it’s going to augment our employees, make us more productive. It’s going to turn us into these incredible margin and revenue machines. It’s going to be pretty awesome…

…with this Agentforce platform, we’re making it easy to build these powerful autonomous agents for sales, for service, for marketing, for commerce, automating the entire workflow on their own, embedding agents in the flow of work and getting our customers to the agent future first. And this is our primary goal of our company right now. This is my singular focus…

…We’re going to talk about the customers who have it, customers like OpenTable and Wiley and — and ADP and RBC and so many others who are deploying these agents and running them on top of our Data Cloud and our apps…

…At Dreamforce, you’re going to hear one of the very largest health care companies in the world. It’s got 20 million consumers here in the United States who is resolving more than 90% of all patient inquiries with Agentforce and they’re benchmarking us significantly higher than any other competing AI platform, and that’s based on some incredible AI breakthroughs that we have had at Salesforce…

…One of these very large media companies that we work with, a lot of probably know who have everything, every possible media asset, while they’re just resolving 90% of all of their employee and consumer issues with Agentforce, pretty awesome. So there’s nothing more transformational than agents on the technology horizon that I can see and Salesforce is going to be the first company at scale to deploy enterprise agents and not just any enterprise agents, the highest quality, most accurate agents in the world…

…We’re seeing that breakthrough occur because with our new Agentforce platform, we’re going to make a quantum leap for in AI, and that’s why it wants you all at Dreamforce, because I want you to have your hands on this technology to really understand this. This is not co-pilots…

…These agents are autonomous. They’re able to act with accuracy. They’re able to come right out of the box. They’re able to go right out of the platform…

…These agents don’t require a conversational prompt to take action. You can do advanced planning, reasoning with minimal human input. And the example of this incredible health care company, you’re going to be able to say to the agent, “Hey, I want to look at my labs, I want to understand this. It looks like I need repeat labs. Can you reschedule those for me? It looks like I need to see my doctor, can you schedule that for me? I also want to get an MRI, I want to get this.” And the level of automation that we’re going to be able to provide and unleash the productivity back into these organizations is awesome…

…This is going to be like having these trusted colleagues can handle these time-consuming tasks engaging with these — whether it’s inbound lead or resolving this customer, patient inquiry or whatever it is, this is humans with agents driving customer success together and Agentforce agents can be set up in minutes, easily scalable, work around with the block, any language. And by the beginning of next fiscal year, we will have thousands of customers using this platform. And we will have hand help them to make it successful for them, deploy it. The early trials have been remarkable to see these customers have the success, it has been just awesome…

…We’re just at the beginning of building an Agentforce ecosystem with companies able to build agents on our platform for their workforce and use cases, and we’re excited to have Workday as our first agent force partner.

Salesforce has been able to significantly reduce hallucinations with its AI products, and thus deliver highly accurate results, through the use of new retrieval augmented generation (RAG) techniques

The accuracy of our results, the reduction of hallucinations and the level of capability of AI is unlike anything I think that any of us have ever seen, and we’ve got some incredible new techniques, especially incredible new augmented RAG techniques that are delivering us the capability to deliver this accuracy with our — for our customers.

Salesforce’s management still sees the company as the No.1 AI CRM in the world

Of course, Salesforce is the #1 AI CRM.

 In 2024 Q2, Einstein delivered 25 trillion transactions and 1 trillion workflows; Wyndham is using Einstein to reduce average call times to free up time for service agents for higher-value work

We’re just operating at this incredible scale, delivering 25 trillion Einstein transactions across all of the clouds during the quarter, that’s 25 trillion and more than 1 trillion workflows…

…MuleSoft allows Wyndham to unlock business-critical data from various platforms and onboard future franchisees faster. And with Einstein generated recommended service replies, average call times have been reduced and service agents can focus on higher priority work

Salesforce’s management thinks many of the company’s customers have a misconception about AI in that they need to build and train their own AI models; management is able to use Salesforce’s AI models and resolve issues much better than its customers’ own models; management thinks Salesforce’s AI models have the highest efficacy

I think that there’s a lot of misconceptions about AI with my customers. I have been out there very disappointed with the huge amount of money that so many of these customers have wasted on AI. They are trying to DIY their AI…

…This idea that our customers are going to have to build their own models, train their own models, retrain their own models, retrain them again and I’m meeting with these customers, and they’re so excited when they and they say, “Oh, I built this model, and we’re resolving 10%, 20%, 30%, 40% and — of this or that and whatever. ” And I’m like, really, we’ll take a look at our models and our capability where you don’t have to train or retrain anything and you’re going to get more than 90%. And then they say, wait a minute, how do you do that? And this is a moment where every customer needs to realize you don’t need the DIY your AI. You can use a platform like Salesforce to get the highest efficacy of artificial intelligence, the best capability to fully automate your company, achieve all of your goals and you can do it with professional enterprise software…

…We’re in met with one of the largest CIOs in the world is telling me how excited he was for the B2C part of this business, he built this model and accuracy rates, than I was like, really, let me show you what we’re doing here. And then he said to me, why am I doing this? Why am I not just using your platform? And I said good question. So these customers are spending millions of dollars, but are they really getting the results that they want? It feels like this early days of cloud. It’s just early days of social mobile. Customers feel like they have to DIY it, they don’t need to, they can make it all happen themselves. And you can just see that to deliver this very high-quality capability they can use a deeply integrated platform like Salesforce.

Salesforce’s management is seeing the company’s customers get immediate ROI (return on investment) from deploying AI automation solutions because the solutions can be easily and quickly customised and configured

We’ve created out-of-the-box platform to deliver all of this for them. So this could be service reply recommendations, account summaries, report generation, you’ve seen in Slack, this kind of auto summarization, recaps, all of these amazing things, the level of automation, the amount of code that our team has written, the transformation of our platform in the last 18 months, it’s remarkable. And customers love it because they can take the platform and then all of this generative AI use case customize it for their own needs or configure it using our capability because they’re doing that without writing a line of code. It’s clicks, not code, deploy them in days, not weeks. They’re doing this in months, not years, and you’re getting immediate ROI. 

Salesforce’s management thinks many of the company’s customers are really disappointed with Microsoft Co-pilot because of a lack of accuracy

So many customers are so disappointed in what they bought from Microsoft CoPilots because they’re not getting the accuracy and the response that they want. Microsoft is disappointed so many customers with AI. 

Wiley is achieving double-digit percentage increase in customer satisfaction and deflection rate, and 50% case resolution with the first generation of Agentforce; Royal Bank of Canada and AP are seeing 90% case resolution rates with the second generation of Agentforce; OpenTable is using Agentforce to support its interactions with 60,000 restaurants and 160 million diners

Wiley is a long-standing Salesforce customer. It’s one of our first deployments in the first agent force trial. It’s pretty awesome. And you all know they make textbooks and it’s back-to-school. But maybe you don’t know that Wiley has to like surge their sales and service organization at back-to-school time when everyone’s buying these textbooks. Well, now they can use agents to do that surge. They don’t have to go buy a bunch of gig workers and bring them in. and that age and capacity is so exciting for them. What we saw with Wiley was, this is a quote from them, “we’re seeing double-digit percentage increase in customer satisfaction and deflection rate compared to older technologies and in these early weeks of our busiest season. ” So that was very reassuring to us. that we have the right thing that’s happening. And Wiley has already seen a 50% increase in case resolution. That’s with our first generation of Agentforce.

As I mentioned, the second generation of Agentforce, which we have with customers already, including some of these amazing organizations like Royal Bank of Canada, ADP and others is 90% case resolution. It is an awesome moment in this tech business.

OpenTable is another super great story. You all know they are managing 60,000 restaurants, 160 million diners to support. They’re on Agentforce now. They require that incredible scale to deliver top-notch customer service. That’s why they’re using the product. It’s been awesome to get their results and it can be all kinds of questions resolving basic issues, account activations, reservation management, loyalty point expiration. Agentforce for service can easily answer all of these questions like when do my points expire for a diner asset, a follow-up question like, what about in Mexico? What about — can I make this change? That’s where we’re delivering those incredible moments for OpenTable, giving them this kind of productivity enhancement. 

Agentforce is driving growth in cloud products’ sales for Salesforce

Agentforce for sales, you can imagine extending your sales force with SDRs, BDRs who are agents that are going out and building pipeline for you and generating all kind of demand and even really closing deals. So, this is going to drive sales cloud growth. It already is, service cloud growth. It already is because customers are going to extend their sales and service organizations and become a lot more productive with these agents.

Salesforce will be releasing industry-specific AI agents in the coming months

In the coming months, we’re going to release Agentforce agents for other roles, including industry-specific agents, health agents, as I mentioned. 

Data Cloud provides the foundation for Agentforce because it holds a huge amount of data and metadata; management continues to believe that data is the foundation of AI; Data Cloud federates and connects to all other data clouds of a user to deliver super accurate AI; Data Cloud is Salesforce’s fastest-growing organic product and will be the fastest to hit $1 billion, $5 billion, and $10 billion in revenue; Data Cloud customers were up 130% year-on-year in 2024 Q2; number of Data Cloud customers spending more than $1 million annually have doubled; Data Cloud processed 2.3 quadrillion records in 2024 Q2 (was 2 quadrillion in 2024 Q1); Data Cloud consumption was up 110% year-on-year in 2024 Q2; American Family Insurance is using Data Cloud to create a 360-degree view of customers; Adecco Group is using Data Cloud to create seamless access to information for 27,000 of its employees; Windhma is using Data Cloud to unify profiles of 165 million guest records, many of which are duplicates across multiple sources

This type of performance from our Agentforce platform wouldn’t be possible without Data Cloud. One of the reasons that our agents are so accurate is because of the huge amount of data and metadata that we had. And data is the foundation for every AI transformation. And with Data Cloud, we’re providing a high-performance data lake that brings together all our customer and business data, federating data from external repositories through this credible zero-copy alliance. So customers can use our Data Cloud and then federate and connect to all their other data clouds and then we can bring it all together to deliver the super accurate AI. 

That’s why Data Cloud is absolutely our fastest-growing organic product in history. It will be the fastest product to $1 billion — it’s going to probably be the fastest product of $5 billion, $10 billion. In Q2, the number of paid Data Cloud customers grew 130% year-over-year and the number of customers spending more than $1 million annually have already doubled. In the second quarter alone, and this is amazing, data Cloud processed 2.3 quadrillion records with 110% platform consumption growth year-over-year…

…American Family Insurance with millions of policyholders nationwide is using Data Cloud to consolidate data from multiple sources through our zero-copy partner network, creating a 360-view of the customers, enabling quick segmentation and activating lead data, including their real-time web interactions. The Adecco Group expanded their data cloud in the quarter, a great example of a company leveraging its gold mine of data to gain a unified view of its customers. Connecting all this data means that 27,000 Adecco employees using Salesforce will have seamless access to key information, including financial metrics and job fulfillment status, to help Adecco improve their job fill rate ratio and reduce their cost to serve…

…Wyndham utilizes Data Cloud to unify profiles of 165 million guest records, many of which were duplicates across many sources like Amazon Redshift and the Sabre Reservation System as well as Sales Cloud, Marketing Cloud and Service Cloud. 

Salesforce has rewritten all of its software to be under one unified platform; management thinks building AI agents without a unified platform is risky; the decision to unite all of Salesforce’s software was made 18 months ago with the shift to AI

We’ve automated every customer touch point and now we’re bringing these apps, data and agents together. It’s these 3 levels, and this is in 3 separate pieces of code or 3 different platforms or 3 different systems. This is 1 platform. We’ve rewritten all of our acquisitions, all of our core modules, our Data Cloud and our agents as 1 unified platform, which is how we are delivering not only this incredible functionality but this high level of accuracy and capability. And from this first-hand experience in meeting with these customers around the globe, I can unequivocably tell you that building these agents without a complete integrated platform is like trying to assemble a plane mid-flight, it’s risky chaotic and it’s not likely to succeed…

…With the shift to AI, it just became clear 18 months ago, we need to hit the accelerator pedal and rewrite all these things onto the core platform because customers are going to get this incredible value by having 1 integrated system, and it scales from small companies to extremely large companies. 

Bookings for Salesforce’s AI products was up more than 100% year-on-year in 2024 Q2; Salesforce signed 1,500 AI deals in 2024 Q2; aircraft maker Bombardier is using Salesforce’s AI products to arm sales reps with better information on, and recommendations for, prospects

We’re already accelerating this move from AI hype to AI reality for thousands of customers with amazing capabilities across our entire AI product portfolio. New bookings for these products more than doubled quarter-over-quarter. We signed 1,500 AI deals in Q2 alone. Some of the world’s largest brands are using AI solutions, including Alliant, Bombardier and CMA CGM. Bombardier, the maker of some of the world’s top performing aircraft, is enabling sales reps to sell smarter by consolidating need to know information on prospects in advance of meetings and providing recommendations on how to best engage with them through the Einstein copilot and prompt builder. 

Salesforce has a new team called Salesforce CTOs that will work alongside customers in deploying AI agents

To help our customers navigate this new world, we just launched a new team called Salesforce CTOs. These are deeply technical individuals who work alongside our customers to help them create and execute a plan for every stage of their AI journey to become agent first companies. 

Salesforce sees itself as customer zero for all its AI products, including Agentforce, and it is deploying its own AI products internally with success; 35,000 Salesforce employees are using Einstein as an AI assistant; Salesforce has already used Slack AI to create 500,000 channel summaries since February 2024, saving 3 million hours of work

We’re continuing our own AI journey internally as a Customer Zero of all of our products with great results. We now have 35,000 employees using Einstein as a trusted AI assistant, helping them work smarter and close deals faster. And since we launched Slack AI in February, our employees have created more than 500,000 channels — channel summaries, saving nearly 3 million hours of work. We’ll, of course, deploy Agentforce agents soon in a variety of different roles and tasks to augment, automate and deliver productivity and unmatched experiences for all employees and customers at scale.

Salesforce will be introducing Industry Toolkit at Dreamforce; Industry Toolkit contains more than 100 ready-to-use AI-powered actions; Industry Toolkit can be used with Agentforce 

At Dreamforce, we’re excited to share our new AI tool kit — industry toolkit, which features more than 100 ready-to-use customizable AI-powered actions. All of these actions can be applied to build industry-specific agents with Agentforce.

Salesforce’s management wants to see 1 billion AI agents by FY2026; there are already 200 million agents identified in trials

I’ll just give you my own personal goals. So I’m not giving any guidance here. My goal is that by the end of fiscal year ’26 that we will have 1 billion agents. Already in just looking at the number of consumers identified just in the trials that we have going on, we have like 100 million identified or more. Okay. call it, 200 million. But the funny thing is, of course, it’s only 1 agent. But let’s just think it’s like a manifestation of all these agents talking to all these consumers.

Salesforce already has a long history of selling non-human consumption-based products; with AI agents, management sees pricing on a consumption basis or on a per conversation basis (at $2 per conversation); management thinks AI agents is a very high-margin opportunity

On pricing. When you think about — when you think about apps and you think about humans, because humans use apps, not in all cases. So for example, the Data Cloud is a consumption product. The Commerce Cloud is a consumption product. Of course, the e-mail product, Marketing Cloud is a consumption product. Heroku is a consumption product. So of course, we’ve had non-human consumption-based products for quite a long time at Salesforce…

…When we look at pricing, it will be on a consumption basis. And when we think about that, we think about saying to our customers, and we have, it’s about $2 per conversation. So, that is kind of how we think about it, that we’re going to have a lot of agents out there, even though it’s only 1 agent. It’s a very high margin opportunity, as you can imagine, and we’re going to be reaching — look, you have to think about these agents are like, this is the new website. This is your new phone number. This is how your customers are going to be connecting with you in this new way, and we’re going to be helping our customers to manage these conversations. And it’s probably a per conversational charge as a good way to look at it or we’re selling additional consumption credits like we do with our data cloud. 

Veeva Systems (NYSE: VEEV)

Veeva’s management is seeing the company’s customers appreciate the patience they have displayed in adopting AI; customers started using Veeva’s Vault Direct Data API for AI use cases in 2024 Q2; Vault Direct Data API provides data access 100 times faster than traditional APIs; management thinks that the advantage of providing API access for AI use cases is the possibility of partners developing use cases that management could not even forsee; customers have to pay a fee to turn on Vault Direct Data API and the fee is for covering Veeva’s compute costs; there’s no heavy implementation tasks needed for Vault Direct Data API

When groundbreaking technology like GenAI is first released, it takes time for things to settle and become clearer. That’s starting to happen now. Customers have appreciated our taking the long view on AI and our orientation to tangible value rather than hype. In Q2, our first early customers started using the Vault Direct Data API to power AI and other use cases. The ability to retrieve data 100 times faster than traditional APIs is a major software platform innovation and will be a big enabler of AI that uses data from Vault applications…

… When you make an API like the Direct Data API, you don’t know the innovation you’re unleashing. And that’s the whole point because the data can be consumed so fast and transactionally accurately, use cases that weren’t practical before can become practical. I mean if I step back way back when to designing the first salesforce.com API, I knew it was going to unleash a lot of innovation, and you just don’t know. It’s not predictable, and that’s the good thing…

…[Question] Looking at Vault Direct Data API, how seamless is it for customers to turn it on and start using it? Is it something that needs an implementation? 

[Answer] That is something that’s purchased by the customer, so that is something that is not free for the customers to use. They purchase it. The fee is not that large. It covers our compute cost, that type of thing… 

…After that, no, there’s no implementation. You turn it on, and it’s on. And that’s that.

Veeva’s AI Partner Program is progressing well and has seen 30 AI use cases being developed by 10 partners across Veeva Development Cloud and Veeva Commercial Cloud; the AI use cases in Veeva Commercial Cloud are mostly related to data science while the use cases in Veeva Development Cloud are mostly related to generation of documents and reports; management does not want to compete with the partners that are in the AI Partner Program 

Our AI Partner Program is also progressing well. We now have more than 10 AI partners supporting roughly 30 use cases across R&D and Commercial. We also continue to explore additional AI application opportunities beyond our current AI solutions…

… [Question] You talked about some of the early traction you’re seeing with the AI Partner Program. Can you maybe talk about what are some of the use cases you’ve seen so far?

[Answer] The types of use cases in commercial often have to do with data science. So things like next best action, dynamic targeting, pre-call planning, things like that. And then in R&D, they can be more things like document generation, generate a clinical study report or doing specific medical coding, things like that. So those are the type of use cases…

…In terms of us monitoring that and informing our own road map, I guess there may be some of that. But mostly, that type of innovation really comes from internally our own thinking with our customers. We don’t want to really disrupt our partners, especially when the partners are having customer success. If there’s a major use case that we’re very clear that customers need and for some reason, the ecosystem is not delivering customer success, yes, maybe we might step in there. But I would guess that what we would do would be more holistic, I guess, in some sense and not specifically something a partner would tackle because we’re generally going to have more resources and more ability to sway our own road map than a partner would, and we want to be respectful to the ecosystem.

Zoom Video Communications (NASDAQ: ZM)

Zoom’s management is seeing customers seeking out the AI capabilities of Zoom’s Contact Center packages; Zoom’s management saw the ASP (average selling price) for its Contact Center product double sequentially because of the product’s AI-tier, which comes with higher pricing

We are seeing increased adoption of our advanced Contact Center packages, as customers seek to utilize our AI capabilities to enhance agent performance…

…If you remember, we started with one pricing tier. We eventually added two more and the AI agent is like that Eric was speaking about earlier, is in the highest tier. We actually saw our ASPs for Contact Center almost double quarter-over-quarter because it’s such a premium feature. And when I look at the Q2 deals, the majority of them were purchasing in one of the top 2 tiers, so all of that is contributing to what I would say is not only expansion in terms of seat count but expansion in terms of value being derived from the product.

Zoom’s AI companion uses generative AI to produce meeting summaries, live translations, image generation and more; Zoom AI Companion is now enabled on over 1.2 million accounts; management wIll be upgrading AI companion as Zoom transitions into the 2.0 phase of AI-enabled work; customers really like ZoomAI Companion; Zoom AI Companion is provided at no additional cost; in Zoom meetings, the No.1 use case of Zoom AI Companion is to create meeting summaries; management is constantly improving the quality of Zoom AI Companion’s meeting summaries; customers are giving positive feedback on Zoom AI companion

Today, AI Companion enhances an employee’s capabilities using generative AI to boost productivity through features like meeting summary, chat compose, image generation, live translation and enhanced features in Contact Center. As these features have grown in popularity, we are happy to share that Zoom AI Companion is now enabled on over 1.2 million accounts…

…Our progress broadening Zoom Workplace, building out enhanced AI tools for Contact Center and amassing a large base of AI users sets us up well to transition into the 2.0 phase of AI-enabled work. In this phase, Zoom AI Companion will move beyond enhancing skills to simplifying your workday, providing contextual insights, and performing tasks on your behalf. It will do this by operating across our collaboration platform to ensure your day is interconnected and productive…

…Our customers really like Zoom AI Companion. First of all, it works so well. Secondly, at no additional cost, not like some of other vendors who got to charge the customer a lot. And in our case, this is a part of our package…

… You take a Meeting, for example, right? For sure, the #1 use case like a meeting summary, right? And we keep improving that quality like in the [indiscernible] and or meeting summary are getting better and better. Like in July, we had another upgrade quarter-wise, even better than previous deliveries, right?..

… [Question] One question I had is when you’re looking at Zoom AI Companion, we’ve heard a lot of great things in the field if customers kind of comparing that to other products that are offered out there. Can you kind of remind us about how you guys think about tracking success with the product internally, given that you don’t kind of charge for it directly beyond having millions of people using it?

[Answer] The metrics that we’ve been talking about on here is account activation. So looking at how many — it’s not individual users, it’s actual customer accounts that have activated it… And also they share the stories like how Zoom AI Companion like is very accurate summary, action items are helping their employees’ productivity as well. And yes, a lot of very positive feedback about adopting Zoom AI Companion.

Zoom’s management has intention to monetise AI services for the Contact Center product, but not for Zoom Workplace

[Question] Now that you’re seeing more adoption, Kelly, of Zoom Companion, how do you think about the cost of providing these generative AI features and capabilities? And do you think Zoom could eventually charge on a usage basis for power users of the generally just trying to weigh cost versus revenue opportunities here?

[Answer] I mean when we launched AI Companion, right? So we do not want to charge the customer. However, that’s for the workplace for the business services like a Contact Center, all those new offerings. And I think for sure, we are going to monetize. As I mentioned in the previous earnings calls, new — new solutions or the billing services, AI, I think we are going to charge. They are AI Companion, right? But the workplace and our core you see offering and collaboration offering we do not want to charge. I want to see — I really appreciate our AI team’s great effort, right? And focus on the quality, focus on the cost reduction and so on and forth.

AI services are a drag on Zoom’s margins at the moment (as the company is providing a lot of AI services for free now) but management sees them as important investments for growth

[Question] Just on gross margins, like the impact of generative AI and maybe what you can do to alleviate some of that off there.

[Answer] I mean we’re guiding to 79% for this year, which we will, reflects the prioritization of AI, but also the very strong discipline that we continue to apply. And we are holding to our long-term target for gross margins of 80%. But of course, we think at this point in time, it’s very important to prioritize these investments as they really set us up for future growth.

Zoom’s dev ops team is saving costs for the company to make room for more AI investments

I also want to give a credit to our dev ops team. On the right hand, for sure, we are going to buy more and more GPUs, right? And also leverage that. Our team tried to save the money from other areas, fully automated, and so on and so forth, right? So that’s another way for us to save the cost, right, to make some room for AI.

The regulatory environment for AI in the US and Europe has so far had very little impact on Zoom’s business because Zoom’s management has been adamant and clear that it is not going to use customer’s data to train its AI models
[Question] Are you seeing anything in the broad sweep of AI regulation in the U.S. or Europe that you think can dampen innovation?

[Answer] That’s the reason why we launch AI Companion, we already mentioned, we are not going to use any of our customer data to train our AI models, right? And we take customers data very, very seriously, right? And as a customer, they know that they trust our brand and trust of what we’re doing. And so far, I do not see any impact in terms of like regulation. And again, this AI is moving rapidly, right? So almost the EMEA here and we all look at the potential regulation. But so far, impact actually to us, to our business, I think it’s extremely limited. So like meeting summary, and it’s a very important feature, customer like that. I think we do not use our customer data to train our AI model. And so why not keep using the feature? I think there’s no impact so far.


Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I have a vested interest in Adobe, MongoDB, Salesforce, Veeva Systems, and Zoom Video Communications. Holdings are subject to change at any time.

Does News Move The Stock Market?

If we’re constantly looking for news to explain short-term stock price movements, how often can we be right?

A great book I started reading recently is Making Sense of Chaos by economist J. Doyne Farmer. In the book, Farmer discusses his ideas for understanding economies through the lens of complexity science, which is the study of complex adaptive systems. The book referenced an interesting academic finance paper published in 1988 titled What Moves Stock Prices. The paper, authored by David Cutler, James Poterba, and Larry Summers, investigated the influence of news on stock prices.

Farmer described their work as such:

“Cutler, Poterba and Summers began by finding the 100 largest daily fluctuations in the S&P 500 index between 1946 and 1987. They then looked at the New York Times on the day after each move and recorded a summary of the paper’s explanation for the price change. The authors made a subjective judgement as to whether these explanations could plausibly be considered ‘real news’ – or at least real enough to have triggered a sizable change in stock price.”

The largest daily move in the paper’s dataset occurred on 19 October 1987 – now famously known as Black Monday – when the S&P 500 fell by 20.5%. Interestingly, there was no substantial news to explain the collapse. Farmer mentioned in his book:

“The explanations for the 20 per cent drop on October 19, 1987, were ‘worry over dollar decline and rate deficit’ and ‘fear of US not supporting dollar’. Cutler, Poterba and Summers didn’t classify this as news, and I agree. ‘Worry’ and ‘fear’ are subjective statements about the emotional state of the market that have no specific reference to external events.”

Farmer went on to mention:

“Of the dozen largest price fluctuations [shown below], only four were attributed to real news events, a ratio that they found also roughly applied to the largest 100 moves.”

In other words, as I have suspected to be the case for as long as I have been investing, stock prices are indeed more often than not driven by factors outside of the news. I find this to be an important trait of the stock market to know because if we’re constantly looking for news to explain short-term stock price movements, we’re likely to be wrong often, and this can impair our investment decision-making process.

The twelve largest daily price fluctuations in Cutler, Poterba and Summers’ dataset for What Moves Stock Prices:

  1. Date: 19 October 1987
    • Daily change: -20.5%
    • Explanation given: Worry over dollar decline and trade deficit; Fear of US not supporting dollar
  2. Date: 21 October 1987
    • Daily change: 9.1%
    • Explanation given: Interest rates continue to fall; deficit talks in Washington; bargain hunting
  3. Date: 26 October 1987
    • Daily change: -8.3%
    • Explanation given: Fear of budget deficits; margin calls; reaction to falling foreign stocks
  4. Date: 3 September 1946
    • Daily change: -6.7%
    • Explanation given: “… no basic reason for the assault on prices.”
  5. Date: 28 May 1962
    • Daily change:-6.7%
    • Explanation given: Kennedy forces rollback of steel price hike
  6. Date: 26 September 1955:
    • Daily change: – 6.6%
    • Explanation given: Eisenhower suffers heart attack
  7. Date: 26 June 1950:
    • Daily change: -5.4%
    • Explanation given: Outbreak of Korean War
  8. Date: 20 October 1987
    • Daily change: 5.3%
    • Explanation given: Investors looking for “quality stocks”
  9. Date: 9 September 1946
    • Daily change: -5.2%
    • Explanation given: Labor unrest in maritime and trucking industries
  10. Date: 16 October 1987
    • Daily change: -5.2%
    • Explanation given: Fear of trade deficit; fear of higher interest rates; tension with Iran
  11. Date: 27 May 1970
    • Daily change: 5.0%
    • Explanation given: Rumours of change in economic policy; “… the stock surge happened for no fundamental reason”
  12. Date: 11 September 1986
    • Daily change: -4.8%
    • Explanation given: Foreign governments refuse to lower interest rates; crackdown on triple witching announced

Disclaimer: The Good Investors is the personal investing blog of two simple guys who are passionate about educating Singaporeans about stock market investing. By using this Site, you specifically agree that none of the information provided constitutes financial, investment, or other professional advice. It is only intended to provide education. Speak with a professional before making important decisions about your money, your professional life, or even your personal life. I have no vested interest in any companies mentioned. Holdings are subject to change at any time.