Stories by Jay Rodge on Medium

Scaling AI Reasoning: Key GTC 2025 Announcements for LLM Developers

Jay Rodge — Wed, 19 Mar 2025 06:30:22 GMT

As the “Super Bowl of AI,” this year’s GTC highlighted significant advancements in hardware and software specifically designed to address the growing demands of large language models.

Here’s a concise recap of the announcements most relevant to you as an LLM developer.

The Focus on Scale and Reasoning in LLMs

AI Scaling Laws

Scaling laws continue to drive exponential demand for compute power. As models grow larger and more complex, the need for efficient hardware and software solutions becomes critical.

Jensen highlighted how test-time scaling — applying more compute during inference — enhances reasoning capabilities, enabling models to solve increasingly complex problem.

Reasoning in LLMs

The keynote emphasized a major shift toward reasoning capabilities in LLMs. To support these reasoning-focused models, here are the key announcements:

NVIDIA Dynamo: A new open-source inference serving library designed specifically to accelerate and scale reasoning workloads. Dynamo efficiently distributes inference across GPUs, dramatically boosting throughput (up to 30X for DeepSeek-R1 models).

AI inference-serving software designed to maximize token revenue generation for AI factories deploying reasoning AI models

NVIDIA Llama Nemotron Reasoning: NVIDIA’s latest family of open reasoning models, optimized for enterprise use cases. These models deliver best-in-class accuracy across benchmarks like GPQA Diamond and MATH 500, thanks to advanced distillation techniques, supervised fine-tuning, and reinforcement learning alignment.

5x higher throughput

These models come in three sizes:

Nano (8B): Distilled from Llama 3.1 8B for edge and PC deployment
Super (49B): Distilled from Llama 3.3 70B for optimal accuracy and throughput on data center GPUs
Ultra (253B): Distilled from Llama 3.1 405B for maximum agentic accuracy (coming soon).

Hardware Innovations for LLM Workloads

Blackwell Ultra GPU

NVIDIA Blackwell Ultra Enables AI Reasoning

Delivers up to 1.5 ExaFLOPS FP4 performance per GPU, ideal for large-scale LLM inference tasks.
Features HBM3e memory with up to 288GB per GPU, dramatically improving memory bandwidth and capacity for handling large model parameters.
Optimized specifically for reasoning workloads, enabling faster inference and higher accuracy at scale

DGX Systems

NVIDIA introduced two new personal AI supercomputers designed to empower developers directly from their desktops:

DGX Spark: Compact desktop AI system featuring GB10 Superchip with 128GB unified memory, ideal for prototyping and fine-tuning LLMs locally. Reservations for DGX Spark systems open today.

DGX Spark is the world’s smallest AI supercomputer

DGX Station: High-performance desktop solution powered by GB300 Grace Blackwell Ultra Superchip, delivering up to 20 PFLOPS FP4 performance and 784GB coherent memory. This system supports intensive local development and rapid iteration of large-scale model.

DGX Station is expected to be available from manufacturing partners like ASUS, BOXX, Dell, HP, Lambda and Supermicro later this year.

Tools for Building Intelligent Agents

To simplify building sophisticated agentic systems, NVIDIA launched two powerful tools:

AgentIQ: An open-source Python library designed to streamline the development of multi-agent AI systems. It offers reusable components, easy configuration via YAML files, detailed telemetry profiling, and built-in optimization tools for efficient agent workflows.
AI-Q Blueprint: A comprehensive reference architecture leveraging reasoning capabilities to seamlessly connect AI agents with enterprise data and tools. It integrates multimodal retrieval (via NeMo Retriever), optimized microservices (via NIM), and agent orchestration (via AgentIQ), providing a robust foundation for enterprise-grade agentic applications

Conclusion

GTC keynote highlighted significant leaps forward in hardware, software frameworks, and tools that directly empower LLM developers.

With innovations like Blackwell Ultra GPUs, Dynamo library, advanced Nemotron reasoning models, and robust tooling such as AgentIQ and AI-Q Blueprint, NVIDIA continues to equip developers with everything needed to build the next generation of intelligent applications.

Accelerating PyTorch Inference with Torch-TensorRT on GPUs

Jay Rodge — Thu, 16 Jun 2022 14:21:09 GMT

Today, we are pleased to announce that Torch-TensorRT has been brought to PyTorch. The official repository for Torch-TensorRT now sits under PyTorch GitHub org and documentation is now hosted on pytorch.org/TensorRT.

Torch-TensorRT is now an official part of the PyTorch ecosystem. The PyTorch ecosystem includes projects, tools, models and libraries from a broad community of researchers in academia and industry, application developers, and ML engineers.

Torch-TensorRT aims to provide PyTorch users with the ability to accelerate inference on NVIDIA GPUs with a single line of code.

Torch-TensorRT

NVIDIA TensorRT is an SDK for high-performance deep learning inference that delivers low latency and high throughput for inference applications across GPU-accelerated platforms running in data centers, embedded and edge devices.

Torch-TensorRT is an integration for PyTorch that leverages inference optimizations of TensorRT on NVIDIA GPUs. With just one line of code, it provides a simple API that gives up to 4x performance speedup on NVIDIA GPUs.

This integration takes advantage of TensorRT optimizations, such as FP16 and INT8 reduced precision through Post-Training quantization and Quantization Aware training, while offering a fallback to native PyTorch when TensorRT does not support the model subgraphs.

Getting Started

import torch_tensorrt

trt_module = torch_tensorrt.compile(model,
    inputs = [torch_tensorrt.Input((1, 3, 224, 224))], # input shape   
    enabled_precisions = {torch_tensorrt.dtype.half} # Run with FP16
)

# save the TensorRT embedded Torchscript
torch.jit.save(trt_module, “trt_torchscript_module.ts”)

result = trt_module(input_data) # Run inference

Learn more about Torch-TensorRT’s features with a detailed walkthrough example here.

Summary

PyTorch is a leading deep learning framework today, with millions of users worldwide. Torch-TensorRT enables PyTorch users with extremely high inference performance on NVIDIA GPUs while maintaining the ease and flexibility of PyTorch through a simplified workflow when using TensorRT with a single line of code.

Download and try samples from GitHub Repository here and full documentation can be found here.

We would be deeply appreciative of feedback on the Torch-TensorRT by reporting any issues via GitHub or TensorRT discussion forum.

Accelerating PyTorch Inference with Torch-TensorRT on GPUs was originally published in PyTorch on Medium, where people are continuing the conversation by highlighting and responding to this story.

Product Marketing 101

Jay Rodge — Tue, 01 Sep 2020 03:42:59 GMT

Product Marketing 101 (Part 2)

Product Lifecycle Part 2 of 4

All successful products go through a Product Lifecycle, which is to say they pass through the following stages:

Development
Introduction
Growth
Maturity
Decline

We often depict this lifecycle along a bell-shaped curve to demonstrate the trajectory that most products take, which looks like as below:

Product Lifecycle

It’s important to know this lifecycle because of the role of Product Marketing Manager and the goals in product marketing change depending on which phase the product is in.

Development: Effort to create the product and prepare it for release into the market. From there, we launch the product into the market

Introduction: Generate awareness, build brand identity, and achieve market penetration

Growth: Increase consumer adoption and build brand preference and market share

Maturity: Maintain market share, defend itself from the competition, building a reputation, and find opportunities to increase revenue.

Decline: Loses desirability and companies must decide to improve it with new features, discontinue it, or pivot

Understanding the lifecycle stage the product is in, is important, but knowing what’s ahead is just as important.

Having a short-term view or tunnel vision on a particular phase may lead to making decisions that hurt the next stage of the lifecycle. It’s early in the process to do a full assessment, but take 15 to 20 minutes and evaluate what stage you think the product is in. If you’re not actively involved with a product, pick a product that you’ve recently started using and work through the same exercise.

Development Phase

No amount of marketing or advertising can make up for a substandard product. Skimp on the development phase and you’ll find yourself struggling in future stages of the product lifecycle.

In the development stage, you’re still figuring out what it is you’re building. It might be an entirely new product, a major improvement, or a new feature. During this phase, the focus is to figure out what the market wants. It's here that you’re researching the competition, understanding the users, and determining what your product must have in order to be successful at launch. You want to do everything you can to get the product right. You must have sufficient features to satisfy your innovators and your early adopters. You can’t satisfy the entire market at launch, so narrow your approach to resonate with this consumer group.

Consider Amazon’s attempt at entering the mobile phone market. In 2014, Amazon launched the Amazon Fire phone. The marketing was everywhere. They clearly invested impressive capital to drive awareness on their new device, but the market didn’t respond well to the product at all. The entire product was scrapped less than 13 months after launch. Even huge marketing budgets can’t overcome a product that misses the mark.

So here are the steps that you want to take during the development stage:

First, embed yourself in the product development process. Product marketing must work hand in hand with product development throughout this first stage unless the product is too tech-focused. Find ways to be part of the process, if the product team has daily standup sessions, start attending them, ask questions. Attend every planning meeting and get access to the planning software that’s being used.

Next, take ownership of researching the target market. Its really important that you understand who you’re targeting, what you’re building, and how your messaging it is going to depend on who will ultimately be buying this product.

Now be sure you know the answers to these key questions:

-Who is the target market?
-Who is the decision-maker in the purchasing process?
-What is the awareness of the existing products available on the market?
-Are customers satisfied with the products available?

Then, demonstrate how your audience will perceive the new product. There’s no reason to launch into the unknown. Start describing how you think your audience will perceive this new product. Include a summary of their needs. Why their needs aren’t being met and how you plan to meet those needs. You can do this by releasing a beta, conducting a digital survey, or setting up interviews with existing customers or consumers in your target market.

And finally, understand the product completely. Go beyond just the features, know what it will cost to produce, what the risks are in development, and how much it will cost to maintain. You also want to know what happens if consumers are unhappy and what features are going to come later and so on. Even if you’re jumping into product marketing well after development, still go through this exercise as if you’re about to launch, it will reveal important details about your product.

Introduction phase

When your product is first brought to the market, it’s in the introduction stage. You’ve decided what to build, you’ve built it, and now it’s available to the consumer. But during this stage, demand is not yet proven. You’re working to validate that what you built resonates with the early adopters in your target market.

In the introduction phase, sales are likely to be minimal and growth is fairly flat. It’s tempting to put the gas pedal down and start ramping up your marketing efforts. Instead, you wanna start the introduction phase by focusing on testing. Once you’ve determined that customers are responding favorably, you push forward with your marketing to generate awareness and grab a foothold in the market.

Now during this phase, you can get away with higher prices, more selective placements, and very personalized promotions. Your immediate goal is the evaluate the consumer response.

So be sure you can answer these three questions:

First, what is the consumer reaction to the product?
Are they responding favorably to the initial exposure?
Are there specific features or attributes they like or dislike?
Are there specific elements they prefer against the competition?

Second, what are the consumer concerns about the product?

Third, what are the consumer’s unmet or unstated needs?

You can conduct surveys, review your customer support interactions, or listen to conversations happening on social media to gather this data. Once you know the market response, you can decide to go back, and fine-tune the product, or press on. Once you press on, your goals will change to generating awareness and establishing a foothold in the market.

Now the introduction phase had a misleading name. Introductions are usually fairly quick. But in the product life cycle, this can take a while. So work to constantly improve your messaging, adjust your value propositions, and keep the feedback loop open with your customers, so you can fine-tune your marketing and unlock the next stage.

Growth Phase

When the product has generated awareness and it has increased consumer demand, you’re moving into the growth phase. Throughout this phase, you’ll be deploying various marketing strategies to significantly increase your growth curve. The focus is to get consumers to prefer your brand over other options. This will require differentiating your marketing strategy and testing new approaches. How quickly you move from the introduction stage to the growth stage, and how rapidly sales increase, will vary. But you’ll recognize a few key attributes of this stage.

First, you’ll begin to see an increase in competition. When the demand for your product starts to increase, other companies will look to enter the market. You’ll need to begin to adjust your marketing strategies so that you can defend yourself from the new market entrants.

Secondly, you can expect the size of the market to increase, which increases the demand for the product and ultimately leads to a sharp increase in sales.

And lastly, you’ll start to see increased profits.

Typically, you’ve figured out how to reduce your costs by this stage, marketing and the business as a whole tends to be more efficient.

Take Tesla, for example, they started by producing an expensive all-electric sports car aimed at innovators or early adopters. They learned about the process, the business, and the market. Then, they moved on to more mainstream but still high-priced vehicles with their sedan and an SUV. The cost kept their market size small, but this was their introduction phase. They tested the market, evaluated the response, and generated a ton of brand awareness in the process. From there, they launched an affordable sedan aimed at a much larger market, and it’s there that growth really took off. Now many other car companies are developing fully electric vehicles.

This increase in competition helps demonstrate that Tesla is now in that growth stage. So what are your goals for product marketing during growth? Well, first evaluate existing value propositions. You must understand that as the market expands, perceptions shift. Reevaluate the value propositions and your key differentiators. Verify the messaging is resonating with this new audience. You may need to work to improve product quality or add new features, or even support services to increase that market share.

Next, listen intently. Launch customer interviews again, and dig deeper into why new customers are coming on board. Sit down and review customer service records to understand pain points. Work with the sales team to determine where they have friction in their sales process, and listen in on other consumers to evaluate new market segments to enter.

Third, strategize. Now is when all the common marketing tactics come out. Visit opportunities in content marketing, digital, PR, trade shows, and so on. Look to increase your distribution channels and start shifting your marketing messages from product awareness to product preference. You’ve departed from your go-to-market strategy and you’re now deploying a wide array of marketing and advertising efforts.

And finally, measure. Measure the metrics that matter. Understand what your business goals are and which metrics demonstrate your success. A robust analysis will help you readjust your marketing strategies. Throughout your growth phase, I encourage you to explore and practice the benefits of Agile marketing.

Maturity Phase

Eventually, demand levels off, your sales peak, and growth become a lot less aggressive. It’s here that the market is nearly saturated. At this point, a product has reached the maturity stage. Your focus now is to defend your turf and prolong your product lifecycle. You’re working to maintain brand preference while at the same time, trying to keep your price competitive so it maintains profitability.

You’re also going to be making finer and finer differentiation in your product as longevity tends to cause competing products to get closer and closer to being identical. Plus, you must be ready for consumers to launch new features and try to distance themselves from you. The real goal here is to stop trying to prolong the inevitable. You need to start thinking about ways to disrupt your business model before someone else does.

Take Blue Apron, for example, this online meal kit delivery service was founded in 2012 and spent quite some time in the introduction phase before entering a growth phase in 2015. Competitors saw this and flooded the market, brands like HelloFresh and Home Chef. And then, Blue Apron hit maturity. They filed for a public offering in 2017, but as of May in 2018, they’ve lost 81% of their market value since that public offering. Growth is flat, the market is crowded, and now they’re forced to defend their turf and pursue differentiation.

Now, maturity isn’t bad, and it doesn’t always mean a loss in market value. It just comes with its own set of challenges. So then, what are the product marketing goals during this stage? For starters, understand what changes have occurred since the introduction and growth phases. Evaluate where the brand started and where it is now.

Really understand the consumer mindset.

-How did they become aware of the product?
-Why did they consider it? Why did they purchase it?
-Why did they not purchase?

Evaluate all the details. What are consumers saying about the competition and their products? Are they talking about any perceived differentiation or new features? Use this to arm your product team with ideas for new features and to modify your messaging and marketing angles.

Next, consider new marketing segments. It may be possible to redefine your target market or even enter new market segments altogether. Perhaps, Blue Apron looks at developing a college line. It may even be necessary to pivot and find different ways other markets can use the product.

Third, attack the competition, know that you’re at war. Apple’s iPhone is a mature product, and it never stops chasing after the competition. And the competition never stops chasing after the iPhone.

Fourth, seek bigger opportunities. Other key brand advocates and influencers that have emerged since the product entered the maturity phase, can they be engaged moving forward?
Are there new products in the market that are in their growth stage where alignment or strategic partnership becomes valuable, for instance?

Finally, leverage your loyal customers. Use those customers that are loyal and evangelist to your advantage. Survey them, offer them incentives, and consider ways to get them to spread the word to, late adopters and laggards who you know to be skeptical and difficult to convert.

The maturity phase is a great place to be, but don’t get complacent and assume that you’re in bedrock. It’s a different marketing game, so stay on top of it.

Decline Phase

When market maturity tapers off, products enter the final phase, decline. During the end stages of a product, you’ll see declining sales and profits. Consumers get bored, technology evolves, companies make mistakes, and people’s preferences change.

Consider a few popular technology products that have changed over the last few years. DVD subscriptions with Netflix shifted to online streaming. Apple’s iPod line is essentially now a simplified iPhone. Digital cameras are less popular, as phone cameras are sufficient. Standalone GPS products are being replaced by Google Maps on mobile phones.

Now, all of those products still have a foothold somewhere. They’ve just shifted to new markets and reduced distribution, cut prices, or adjusted the features. They’re in decline. It’s not over yet. But very few products make it out of this decline stage. They can hold on for quite some time, but they hit the bottom eventually.

Now, this isn’t to say the business is doomed, just that the product offered as-is by the business is likely on the way out. At this stage, your primary goal is to pick a strategy that makes the most sense for your product. Now, the most popular strategies are to reduce your marketing expense on the product and thin the teams out, implement price cuts to convince late-stage and laggers to buy the product, find another use for the product, maintain the product and wait for the market to shift back, sell the product off to another company, or discontinue it altogether.

Product Marketing tends to take a backseat once a product is in decline. There tends to be limited support for investing in marketing spend. Now, there’s little for marketing to do in a true decline, but products can flirt with the decline stage or even enter decline prematurely, and this can happen abruptly. A product in the midst of growth is all of a sudden fast-tracked to decline, and this typically happens for two reasons. The product saw too high a rate of a refresh, or the product failed to meet the necessary rate of a refresh.

In product marketing, you must keep a pulse on the consumer. It’s not strictly about the marketing campaigns. It’s about maintaining product-market fit. If you’re updating a product too frequently, changing the core features that consumers liked, you may see customers depart. Think about products that went through unnecessary redesigns, leaving consumers dissatisfied and confused.

Alternatively, some products fail to change. In some markets, consumers expect seasonal refreshes. They expect new features. The iPhone must always evolve. The market expects it. If Apple stops innovating, the consumer will pursue a brand or product that does. If you’re in decline, embrace it. Either sunset the product or find a way to reinvent it so that it moves back to an earlier stage of the lifecycle.

By creatively repositioning your product, you can change the way consumers evaluate it and potentially rescue them from decline, and push them back into growth.

This was the Part of Product Marketing series, I’ll be posting the Part 3 next week (09/08)

Part 1: What is Product Marketing? (link)

Part 2: Product Lifecycle (link)

Part 3: Product Market Fit (Coming on 09/08)

Part 4: Go-To-Market Plan (Coming on 09/14)

Connect with me on LinkedIn: here

Product Marketing 101

Jay Rodge — Mon, 24 Aug 2020 02:52:40 GMT

Product Marketing 101 (Part 1)

What is Product Marketing (Part 1 of 4)

One of the biggest challenges in marketing today is becoming too focused on tactical, buying ads, designing emails, creating blog content, etc. We often forget that the most successful aspect of marketing is understanding what the customer wants, and how to align products with their needs. This is handled through product marketing, and it’s among one of the most important areas of marketing at any organization.

What is Product Marketing?

Product marketing is ultimately the sum of all the efforts necessary to research, message, position, promote, and support a new product so that it successfully resonates with the target audience.

Product marketing requires participating in discovery, delivery, and ultimately, a go-to-market strategy. Now that’s all fairly broad. So let me give you some examples of these specific efforts.

So specific tasks that you might encounter:

Coordinating with the product teams to understand roadmaps and deliverables
Conducting market research to understand customer needs and wants
Forecasting results
Competitive intelligence to understand the market
Go-To-Market campaign strategies

Furthermore, being successful in product marketing means you’re constantly hedging against risk. Ideally, the work starts well before the product is developed.

You have to understand the market, what the customer really wants, and how your product will deliver. All too often, companies make the fatal mistake of handing a marketer a finished product and saying, here, go “market this”.

To be successful, you need to be armed with not only the knowledge of how product marketing works but also the confidence to put on the brakes and push the company to shift directions if you sense disaster ahead.

Product Management vs Product Marketing

Describing simply, the Product Manager’s job is to get the product to the shelf and the Product Marketing Manager’s job is to get it off the shelf. The Product Manager develops the product, the Product Marketer packages it attractively and designs campaigns to motivate consumers to purchase it.

Despite their different job roles, product managers and product marketers have a few things in common:

They both invest tremendous effort in knowing who their customer is.
They collaborate with teams across the entire organization and they must work together to see meaningful results.

The Product Marketing Manager typically takes on the following responsibilities:

Researching the market and segmenting the target customers. Positioning and messaging the product and its features.
Understanding the competitive landscape.
Securing product and message market fit.
Driving demand and adoption of the product.

The Product Manager’s responsibilities focused on:

Being the voice of the customer and championing what they want internally.
Decide what to build and organizing that process.
Understand the technology and managing scale.
Ship the right product and
Prioritize what to build.

Both roles must work hand-in-hand, but it’s not always easy. Product managers are often using a different set of success metrics to determine what they’re prioritizing. They may be focused on positive user reviews, whereas product marketing might be measuring success by consumer activations or overall brand awareness. To prevent friction, clarify goals with product management. Agree to align your goals towards what the company’s major objective is. Take time to meet weekly and be transparent and thorough in your communication.

This was the Part of Product Marketing series, I’ll be posting the Part 2 next week (08/31)

Part 1: What is Product Marketing? (link)

Part 2: Product Lifecycle (link)

Part 3: Product Market Fit (Coming on 09/08)

Part 4: Go-To-Market Plan (Coming on 09/14)

Connect with me on LinkedIn: here

Machine Learning Engineer vs Data Scientist

Jay Rodge — Fri, 07 Feb 2020 17:33:07 GMT

https://medium.com/media/efa2b2a72c9414d666557f6995461433/href

How these roles are different?

The role of a Machine Learning Engineer is relatively new, and it confuses so many people. This article will give an overview of each role and how they are different.

Let's have a look at what Data Science and Machine Learning is, before talking about the roles.

Data Science

Data science analyzes the data and uses the results to draw a causal inference. The goal of data science is to help companies understand their current state, find reasons for something that has happened in the past and come up with the best solutions for the future.

Machine Learning

Machine learning technology uses data-driven algorithms in simple words, so that a machine (software application) can learn and draw data-based assumptions. The more data the machine is “fed” to, the better the predictions. So predictions will be more accurate if the data is processed well.

The Difference

What could be a better way to ask someone who has experience in both roles. So for this, I reached out to Brandon Schabell, who has experience in working as a Senior Data Scientist as well as a Senior Machine Learning Engineer. Here’s how he explains it:

The definitions of each role are definitely going to vary from company to company. In fact, at my current company (GoHealth), they are literally the same position. I just wanted a different title for different reasons

There are a couple of distinct areas within data science where people could spend the majority of their time. There are companies where data scientists exclusively do machine learning and modeling, but there are plenty of others where a data scientist’s role is really doing data analysis. It could be advanced, even using machine learning at times, but the end product there is often business insight rather than a product.

I think at a lot of companies, a machine learning engineer could be pretty accurately defined as being the type of data scientist that focuses on machine learning. There’s often a larger software engineering component to a machine learning engineer's job, however. A machine learning engineer may or may not be involved in the development of a model, but they are almost always the ones to optimize the model and put it into production. A lot of financial institutions will have a stricter differentiation if you look at quantitative researchers (quants) vs quantitative developers. (Which is similar to a data scientist vs machine learning engineer if you are in any other field.)

I quite enjoy both model building and some of the new challenges of putting complex models into stable production systems, so I pushed to have the ML engineer title, as I feel it differentiated me, at least on paper, from someone who is only focused on model building. It also made applying for jobs easier- I only really focused on ML engineer positions as a lot of data scientist positions were too ambiguous for my liking.

For majority of the companies, to be a Machine Learning Engineer should be knowing ML techniques for your industry is important (CV, NLP, Recommendation Systems, Predictive Models, etc ) but you’ll also need some good software engineering knowledge. That typically (at least in my experience) means Python, AWS, SQL, docker, good testing methodologies, maybe some Java, and CI/CD basics. I was a software engineer for a couple of years before moving into a data scientist and that was incredibly valuable for me to become an ML engineer.

Conclusion

While there is some overlap, which is why some data scientists with backgrounds in software engineering move into machine learning engineering roles, data scientists focus on data analysis, business insights, and model prototyping, while machine learning engineers focus on coding and deploying complex, large-scale machine learning products.

If you found this helpful, please share it on Linkedin, Twitter, Facebook or any of your favorite forums.

Connect with me on LinkedIn.

Magic of Thinking Big

Jay Rodge — Wed, 13 Mar 2019 15:29:59 GMT

The Magic of Thinking Big

https://medium.com/media/d75daefe377db489bb1330d0f17d934a/href

I recently finished the book called “The Magic of Thinking Big” by David Schwartz and its one of the best books I have ever read. Here are my thoughts about the book.

According to David Schwartz, Whether your life goals are unbelievably large and intimidating or relatively small and achievable, you may have thought more than once, “Where do I start?”

The author suggests that you begin by creating an attitude in which you feel 100% able to achieve what you intend to do, or in other words, you should act like being one of the best.

Why is this going to work? Because once you begin to believe in yourself sufficiently, your brain creates the creativity required to achieve your objective. One McKinsey study cited in the book states that the drive to move forward is what management and social leaders seek most when working with people.

The author emphasizes working on your creative thinking abilities constantly. Your brain can evaluate and adapt to every situation, as it remains flexible.

He also suggests being in “Always Learning” mode, For example, if you work in a car dealer, it may seem useless to you to learn photoshop, but if it is fun, do it anyway. Take the “Who knows what’s good for?” mentality. You can easily use your new graphic design skills to create a bunch of great Facebook ads that can help you sell a lot more cars.

The second strategy that he suggests is to shut down your head’s negative voices. With the news reporting mainly on horrifying events and complaining only to everyone around you, negative thinking is the norm. You will find that nay-sayers are nearly always unsuccessful or average.

If you have an imposter syndrome or you don’t you are capable enough then this can be eliminated by writing a pep talk that reads like a commercial in which you try to sell yourself. Focus on what makes you different, e.g. you’re funny and always make people laugh at work. Read it loud and quiet once a day when you feel a little down.

No one is born confident, but everybody can excel at it. “Fake it until you make it ” is true in this case, because you can control your emotions in the way you feel. So sit down in the first row, get in touch with people and walk faster than anyone else.

Although the individual ideas discussed in the book are all well known today, I liked how the book rounded them all to one end, especially as the book is old. I could relate to quotes and ideas, for example, I’m a fast walker, and I’ve noticed that I feel more confident when I rush through all the slow people dragging their feet :).

The summary I have discussed covers a very small subset of the book, and it has a lot more information that I wasn’t able to cover. Therefore I recommend you give a read on this book and experience the changes that you could bring to yourself in order to excel at your work/studies by applying some small changes.

To know what I’m reading/What I’ve read, let’s connect on Goodreads!

Productivity Hack

Jay Rodge — Wed, 13 Mar 2019 15:24:57 GMT

I have been reading a book called “The Productivity Project” by Chris Bailey.

This book is about how to maximize your productivity and the author Chris Bailey has done a lot of experiments to demonstrate the results.

The one big and simple tip he suggested was to keep your to-do as short as possible, and the number of tasks he recommends is 3.

Because creating a huge to-do list wouldn’t make sense as the chances of checking off all the tasks would be lower which could result in a feeling of failure at the end of the day.

On the other hand keeping only one task can eat up all your time and after completing the task, you may scroll mindlessly on your social network feeds.

Think about the situation being at the end of the day and thinking ‘What are important tasks if completed today will give a sense of relief and a good night sleep?” and add them on your to-do list.

I have been following this routine from a week now and the results are great, the sense of feeling that you get after completing those tasks is just awesome!

Therefore, by just following this simple hack/habit overtime will increase your productivity by a huge margin.

Text/Document Classification using PyText

Jay Rodge — Sat, 09 Mar 2019 18:46:05 GMT

Hands-on with Facebook’s newly open-sourced NLP library PyText based on PyTorch

After releasing PyTorch 1.0, Facebook Research recently open-sourced its Natural Language Modelling Framework based on PyTorch, PyText. It tries to bridge the gap between experimentation and rapid deployment/production, which was difficult with existing libraries.

PyText aims to achieve the following things:

Make experimentation easy and fast
Reduce extra work when using pre-built models on new data
Define a clear path for researchers and engineers to build, test and deploy their models quickly
Ensure high performance

PyText has a lot of support for rapid prototyping and is faster than other Natural Processing(NLP) Libraries available. Here’s a comparison of PyText with other NLP libraries:

https://arxiv.org/abs/1810.07942

Facebook now uses PyText in their Portal which is a video calling service, and in their M suggestions feature in Messenger. The M suggestions feature generates more than a billion daily predictions, which shows its capability to operate at production level and still has low latency.

PyText is built on PyTorch, and it connects to ONNX and Caffe2. With PyText, AI researchers and engineers can convert PyTorch models to ONNX and then export them as Caffe2 for production deployment at scale.

PyText relies on the components displayed in the figure below:

https://code.fb.com/wp-content/uploads/2018/12/06_PyText_Flowchart_hero.png

Task: combines various components required for a training or inference task into a pipeline. It can be configured as a JSON file that defines the parameters of all the child components. We’ll be discussing a sample config for a document classification task later in the post.

Data Handler: processes raw input data and prepare batches of tensors to feed to the model.

Model: defines the neural network architecture.

Optimizer: encapsulates model parameter optimization using the loss from forward pass of the model.

Metric Reporter: implements the relevant metric computation and reporting for the models.

Trainer: uses the data handler, model, loss, and optimizer to train a model and perform model selection by validating against a holdout set.

Predictor: uses the data handler and model for inference given a test dataset.

Exporter: exports a trained PyTorch model to a Caffe2 graph using ONNX.

Let’s start with building a sentiment classifier using PyText, it’s simple!

To install PyText on your machine, enter the following on to your command line via pip:

pip install pytext-nlp

Before getting started let’s get the data right. Here, we will be using Amazon reviews which has positive and negative reviews for the various product and has 10000 total examples.
The PyText needs a .tsv(tab separated values, in the following fashion):

___label___ 'This is a Text'

Here’s how our dataset(.tsv format) looks:

Dataset

To define a model in PyText, it uses a configuration file (Task) which is in a .json format, where you can define your model.

Here’s a configuration file, where we are providing the training, validation and testing data, as well as other details like the number of epochs, batch size, and optimizer.

{
  "task": {
    "DocClassificationTask": {
      "data_handler": {
        "train_path": "data/train.tsv",
        "eval_path": "data/eval.tsv",
        "test_path": "data/test.tsv",
        "train_batch_size": 128,
        "eval_batch_size": 128,
        "test_batch_size": 128

},
      "trainer": {
        "epochs": 20
      },
      "optimizer":{
        "lr": 0.001,
        "type": "adam",
        "weight_decay": 0.000004
      }
    }
  }
}

Now to train the model, just type on the command line as:

pytext train < config.json

And boom, it should be training. By default, it uses a Bidirectional LSTM model and with 15 epochs, the model achieves around 83% which is good, considering data as we didn’t preprocess the text data.

PyText exports the model as Caffe2 object, to save the trained model:

pytext export --output-path model.c2 < config.json

For predicting, we use a PyText predictor object which requires a saved model and the configuration file(.json). Here’s a small python script to predict the sentiment of a given Sentence/Text:

https://medium.com/media/6699d630b7cc1121d04812695803e475/href

The above four lines is what it takes to predict a review based on the previous model trained.

The predictor object predicts the sentiment, and the returns the probability of all the labels, the labels with the higher probability can be chosen as the answer.

Conclusion

This was a basic guide for learning about PyText and getting started with a basic classifier. I will be writing much more about PyText in the coming weeks, so make sure you follow me know more about PyText and its applications

GitHub Repo: https://github.com/jayrodge/PyText-Classifier

If you found this helpful, please share it on Linkedin, Twitter, Facebook or any of your favorite forums.

Connect with me on Linkedin, about.me

Text/Document Classification using PyText was originally published in HackerNoon.com on Medium, where people are continuing the conversation by highlighting and responding to this story.

Beginners Guide for Data Science

Jay Rodge — Sat, 05 Jan 2019 17:24:31 GMT

Beginner Guide to Data Science

Curated list of Resources for Getting Started in Data Science and Deep Learning in 2020

Photo by Henri L. on Unsplash

https://medium.com/media/b7a89f07b2da68c6e1054374ac30542a/href https://medium.com/media/b7a89f07b2da68c6e1054374ac30542a/href

Data Science is being adopted by almost all the companies right now whether it is a machinery business or automobiles.

According to Glassdoor, Data Scientist is the best job in America in 2018 with a median base salary of $110,000. However, there is also a huge skill gap for Data Science.

Becoming a Data Scientist is not that hard if given the right amount of time and effort while learning. However, I often find people trying out different courses, resources but still aren’t able to learn

Photo by Tim Gouw on Unsplash

I have been doing Data Science from 2 years, and I tried several courses/resources, so here’s a list of recommendation for Getting Started in:

Data Science

There are so many paid as well as free courses/MOOCs for getting started with Data Science, due to which choosing one becomes difficult.

1. Applied Data Science with Python

The Second best resource for getting started in Data Science is “Applied Data Science with Python” by Coursera.

Applied Data Science with Python

This a specialization that contains 4 courses which start with Python Basics, then learning Statistics required for Data Science.

Then it covers various Visualization techniques using libraries like matplotlib etc in Python, Fundamentals of Machine Learning, and in Final course, it covers basics of Natural Language Processing.

This specialization is free to access if you choose the ‘Audit this course’. However, for getting a certificate you would have to apply for Financial Aid or pay $50 subscription fee to coursera.

Machine Learning

The ‘Applied Data Science with Python’ from coursera covers Machine Learning in the specialization, however, if you want to deep dive into Machine Learning algorithms and the mathematics behind it, there’s another great free resource called as fast.ai

Deep Learning For Coders-36 hours of lessons for free

This course is taught by an AI Researcher (Also, Ex-President Kaggle) Jeremy Howard and is a part of a Master of Science in Data Science Program from University of San Francisco.

Deep Learning

Also, with Deep Learning there are so many courses available which teach you to apply deep learning algorithms and get State of the Art results within few lines of codes.

Applying these algorithms and getting results feels great, but one must know how they are working, instead of thinking of these algorithms as a black box.

Deep Learning Specialization

The specialization is taught by the great Andrew Ng.

Deep Learning

This course is targeted towards beginner and just requires knowledge of Basic Python, Linear Algebra and Calculus.

The algorithms are taught from scratch and it’s a resource for getting started with Deep Learning.

There’s a great review of this course by Daniel Bourke on his YouTube channel, here’s the video if you’d like to see:

https://medium.com/media/a20168e38f38781ac4f72933906ebf2e/href

2. fast.ai

Practical Deep Learning for Coders, v3 | fast.ai course v3

This is taught by Jeremy Howard(Ex-VP, Kaggle).

This is the richest and comprehensive course for Deep Learning. It covers all aspects of Algorithms.

This course is taught with the help of fastai library which is a PyTorch wrapper, and it has a great community which will help you at every roadblock you face!

3. Intro to Deep Learning with PyTorch

Introduction to PyTorch | Deep Learning | Free Courses | Udacity

Recently, PyTorch 1.0 stable was released by Facebook, and it has ability to fully utilize your gpu power for training models since it works on a basic data structure Tensors.

Udacity partnered with Factbook launched the free Deep Learning course. They start from the basics of Neural Network and then goes to implementing various deep learning algorithms using PyTorch.

Miscellaneous Data Science Resources:

Apart from doing MOOCs you can always stay updated with the latest trends using the following links:

Data Science in Tech - Hacker Noon

Youtube:

Publications:

Medium Publications

Notes:

Chris Albon

Cheatsheets by Favio Vázquez:

GitHub - FavioVazquez/ds-cheatsheets: List of Data Science Cheatsheets to rule the world

Again, these are the recommendations. You can use any resources to get started in this field. And don’t just complete these MOOCs, develop some real-world project to test your skills by using the datasets available on Kaggle or by creating your own, because implementing them will give you an idea of how the tools and libraries work.

Happy Learning!

Data Science in Tech - Hacker Noon

Beginners Guide for Data Science was originally published in HackerNoon.com on Medium, where people are continuing the conversation by highlighting and responding to this story.

Binary Face Classifier using PyTorch

Jay Rodge — Mon, 24 Dec 2018 19:39:28 GMT

Binary Image Classifier using PyTorch

Image classification using PyTorch for dummies

Source

https://medium.com/media/0dbfc43f90165e1c7d0527c7709b17f1/href

Facebook recently released its deep learning library called PyTorch 1.0 which is a stable version of the library and can be used in production level code.

I’m a part of Udacity’s PyTorch Scholarship Challenge program and learned a lot about PyTorch and its function. Coming from keras, PyTorch seems little different and requires time to get used to it.

In this article, I’ll be guiding you to build a binary image classifier from scratch using Convolutional Neural Network in PyTorch.

The whole process is divided into the following steps:

1. Load the data
2. Define a Convolutional Neural Network
3. Train the Model
4. Evaluate the Performance of our trained model on a dataset

1. Load the data

When comes to loading/ preprocessing the data PyTorch is much simpler as compared to other libraries. However, PyTorch has a built-in function called transforms using which you can perform all your pre-processing tasks all at once which we’ll see in a while.

For the dataset, I couldn’t find one with the faces as positive labelled, therefore I made my own dataset manually by using the images from LFW Face Dataset for positives and added some random images for the negatives which includes the images of vehicles, animals, furniture etc.
If you want, you can download the dataset from here: link

The data needs to be split in Train, Test and validation set before training. Train set will be used to train the model, validation set will be used for validating the model after each epoch, and the Test set will be used to evaluate the model once it is trained.

First, we need to get the dataset into the environment, which can be done by:
(Note: ‘face’ is the name of the directory which contains a positive and negative example of faces)

train_data = datasets.ImageFolder('face',transform=transform)

We’ll also need to define a transform object to perform the preprocessing steps. We can mention in the object what types of processing we need. In the following code, I have defined the transform object which performs Horizontal Flip, Random Rotation, convert image array into PyTorch (since the library only deals with Tensors, which is analogue of numpy array) and then finally normalize the image.

transform = transforms.Compose([
    transforms.RandomHorizontalFlip(),
    transforms.RandomRotation(20),
    transforms.ToTensor(),
    transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))
    ])

Once we are done with the loading the dataset and defining transform object we can split the dataset into train, test and validation sets as discussed before. For carrying out splits:

#For test
num_data = len(train_data)
indices_data = list(range(num_data))
np.random.shuffle(indices_data)
split_tt = int(np.floor(test_size * num_data))
train_idx, test_idx = indices_data[split_tt:], indices_data[:split_tt]

#For Valid
num_train = len(train_idx)
indices_train = list(range(num_train))
np.random.shuffle(indices_train)
split_tv = int(np.floor(valid_size * num_train))
train_idx, valid_idx = indices_train[split_tv:],indices_train[:split_tv]

# define samplers for obtaining training and validation batches
train_sampler = SubsetRandomSampler(train_idx)
test_sampler = SubsetRandomSampler(test_idx)
valid_sampler = SubsetRandomSampler(valid_idx)

#Loaders contains the data in tuple format 
# (Image in form of tensor, label)
train_loader = torch.utils.data.DataLoader(train_data, batch_size=batch_size, sampler=train_sampler, num_workers=1)

valid_loader = torch.utils.data.DataLoader(train_data, batch_size=batch_size, sampler=valid_sampler, num_workers=1)

test_loader = torch.utils.data.DataLoader(train_data, sampler = test_sampler, batch_size=batch_size,num_workers=1)

# variable representing classes of the images
classes = [0,1]

The train_loader, test_loader and valid_loader will be used to pass the input to the model.

Here are some random images from the dataset after applying transformations which includes resizing, random rotation, Normalizing:

2. Initialising the Convolutional Neural Network(CNN)

The CNN in PyTorch is defined in the following way:

torch.nn.Conv2D(Depth_of_input_image, Depth_of_filter, size_of_filter, padding, strides)

Depth of the input image is generally 3 for RGB, and 1 for Grayscale. Depth of the filter is specified by the user which generally extracts the low level features, and the size of the filter is the size of the kernel which is convolved over the whole image.

To calculate the dimension of the new convolution layer, the following formula is used:
dimension=
(dimen_of_input_image- Filter_size(int)+(2*padding))/stride_value + 1

Now it’s time to initialise the model:

class Net(nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        # convolutional layer
        self.conv1 = nn.Conv2d(3, 16, 5)
        # max pooling layer
        self.pool = nn.MaxPool2d(2, 2)
        self.conv2 = nn.Conv2d(16, 32, 5)
        self.dropout = nn.Dropout(0.2)
        self.fc1 = nn.Linear(32*53*53, 256)
        self.fc2 = nn.Linear(256, 84)
        self.fc3 = nn.Linear(84, 2)
        self.softmax = nn.LogSoftmax(dim=1)
        
    def forward(self, x):
        # add sequence of convolutional and max pooling layers
        x = self.pool(F.relu(self.conv1(x)))
        x = self.pool(F.relu(self.conv2(x)))
        x = self.dropout(x)
        x = x.view(-1, 32 * 53 * 53)
        x = F.relu(self.fc1(x))
        x = self.dropout(F.relu(self.fc2(x)))
        x = self.softmax(self.fc3(x))
        return x

# create a complete CNN
model = Net()

Model Architecture

We’ll also need to initialise our loss function and an optimizer. Loss Function will help us in calculating the loss by comparing the prediction and original label. The optimizer will minimize the loss by updating the parameters of the model after every epoch. They can be initialised by:

# Loss function
criterion = torch.nn.CrossEntropyLoss()

# Optimizer
optimizer = torch.optim.SGD(model.parameters(), lr = 0.003, momentum= 0.9)

3. Train the Model

It’s time to train the model!

Training a model requires us to follow some steps which are:

Clear the gradients of all optimized variables:
There could be gradients from previous batches, therefore it’s necessary to clear gradient after every epoch
Forward pass:
This step computes the predicted outputs by passing inputs to the convolutional neural network model
Calculate the loss:
As the model trains, the loss function calculates the loss after every epoch and then it is used by the optimizer.
Backward pass:
This step computes the gradient of the loss with respect to model parameters
Optimization
This performs a single optimization step/ parameter update for the model
Update average training loss

Following is the code for training the model (it’s for a single epoch)

https://medium.com/media/6a211dd7b2248b1f23adb9ba7bc6f8d9/href

4. Model Evaluation

To evaluate the model, it should be changed from model.train() to model.eval()

model.eval()
# iterate over test data
len(test_loader)
for data, target in test_loader:
    # move tensors to GPU if CUDA is available
    if train_on_gpu:
        data, target = data.cuda(), target.cuda()
    # forward pass
    output = model(data)
    # calculate the batch loss
    loss = criterion(output, target)
    # update test loss 
    test_loss += loss.item()*data.size(0)
    # convert output probabilities to predicted class
    _, pred = torch.max(output, 1)    
    # compare predictions to true label
    correct_tensor = pred.eq(target.data.view_as(pred))
    correct = np.squeeze(correct_tensor.numpy()) if not train_on_gpu else np.squeeze(correct_tensor.cpu().numpy())
    # calculate test accuracy for each object class
    for i in range(batch_size):       
        label = target.data[i]
        class_correct[label] += correct[i].item()
        class_total[label] += 1

# average test loss
test_loss = test_loss/len(test_loader.dataset)
print('Test Loss: {:.6f}\n'.format(test_loss))

for i in range(2):
    if class_total[i] > 0:
        print('Test Accuracy of %5s: %2d%% (%2d/%2d)' % (
            classes[i], 100 * class_correct[i] / class_total[i],
            np.sum(class_correct[i]), np.sum(class_total[i])))
    else:
        print('Test Accuracy of %5s: N/A (no training examples)' % (classes[i]))

print('\nTest Accuracy (Overall): %2d%% (%2d/%2d)' % (
    100. * np.sum(class_correct) / np.sum(class_total),
    np.sum(class_correct), np.sum(class_total)))

After evaluation, we find the following result:

Test Loss: 0.006558

Test Accuracy of     0: 99% (805/807) 
Test Accuracy of     1: 98% (910/921)

Test Accuracy (Overall): 99% (1715/1728)

The result that we got was using only 2 Convolutional Layers, though researchers are using deeper networks which can extract much more detailed features.

Since this model has learned to extract facial features, this can be furhter used for facial recognition, in which you could train this face classifier on your own images and create facial recognition system using transfer learning.

Also, editing few lines of code in this would generate another Image Classifier with right amount of data and labels. Possibilities are limitless, you just need to practice it and apply it on any problem you want!

Happy Learning!

GitHub Repo: https://github.com/jayrodge/Binary-Image-Classifier-PyTorch

Let’s connect on LinkedIn!

Learn more about me.

Binary Face Classifier using PyTorch was originally published in HackerNoon.com on Medium, where people are continuing the conversation by highlighting and responding to this story.