Stories by Neva Erdogan on Medium

Why Is ChatGPT So Polite If It Learned From Reddit?

Neva Erdogan — Wed, 11 Feb 2026 20:28:46 GMT

If large language models (LLMs) are trained on massive portions of the internet; Reddit threads, comment sections, forums, blogs, documentation, and books, then why don’t they behave like the internet?

Why aren’t they sarcastic, chaotic, impulsive, or outright toxic?

The short answer is exactly what you hinted at:

The internet made LLMs smart. Alignment made them usable.

But the real story is much more technical and much more interesting.

The Two-Stage Myth And the Real Pipeline

People often simplify LLM training into two steps:

Train on the internet
Apply RLHF

This is directionally correct, but technically incomplete.

The real pipeline looks closer to this:

Pretraining → Supervised Fine-Tuning → Reward Modeling → Reinforcement Learning → Evaluation & Iteration

Each stage transforms a raw text predictor into something that behaves like an assistant.

Stage 1 — Pretraining: Teaching a Neural Network Language

At their core, large language models are probability machines.

They don’t “know” facts.
They don’t “understand” meaning.

They learn statistical patterns.

LLMs are trained on enormous text datasets and learn to predict the next token in a sequence.

In other words:

They are not answering questions — they are continuing text.

This distinction is critical.

From a machine learning perspective, pretraining is typically:

Self-supervised learning
Transformer-based architecture
Gradient descent optimization
Massive distributed training

The model approximates the probability distribution of language itself.

The Problem With Raw Pretrained Models

A pretrained model can:

generate grammatical text
imitate styles
write essays
produce code

But it is not inherently helpful.

Without alignment:

It may ignore user intent
Produce irrelevant answers
Mirror biases in training data
Generate unsafe content

Pretrained models can output text that is “technically correct but irrelevant, incoherent, or even unsafe.”

Why?

Because language modeling optimizes for likelihood, not quality.

Truthfulness, usefulness, and safety are not easily encoded as mathematical loss functions.

So researchers needed a way to teach models something abstract:

What does a good answer look like?

Enter alignment.

Stage 2 — Supervised Fine-Tuning (The Hidden Middle Layer)

Before RLHF even begins, most models undergo supervised instruction tuning. Humans write example prompt–response pairs.
The model learns to imitate them.

This step:

teaches formatting
improves instruction following
shapes conversational behavior

Both the reward model and RL policy are usually initialized from this pretrained base.

Think of it as teaching the AI basic manners before judging its performance.

Stage 3 — Reward Modeling: Turning Human Taste Into Math

Now comes the clever trick.

Instead of hand-writing rules for “good behavior,” researchers train a reward model.

Here’s how it works:

Humans compare outputs and rank them. The reward model learns:

Which response would a human prefer?

RLHF leverages these preference signals to align models with human expectations.

This converts subjective human judgment into a differentiable optimization target. A massive conceptual breakthrough.

Because now:

Politeness becomes a gradient.

Stage 4 — Reinforcement Learning (The Alignment Engine)

Reinforcement learning trains the model to maximize reward.

Two models interact:

Policy model → generates responses
Reward model → scores them
Value Model → Estimates expected future rewards for variance reduction

The policy updates itself to produce higher-scoring outputs.

Most modern RLHF pipelines use algorithms like Proximal Policy Optimization (PPO) to stabilize training and prevent the model from changing too drastically.

Why stability matters:

If optimization is unconstrained, models may “game” the reward function and produce nonsense.

PPO clips updates to keep the new policy close to the original.

This is classic reinforcement learning, but applied to language.

RLHF Pipeline Diagram

Why RLHF Exists: Scale Breaks Rule-Based Safety

Modern LLMs have billions of parameters. You cannot hard-code behavior at that scale.

Rule-based filtering cannot capture the nuance of human preference.

RLHF introduces a human-guided reward signal that makes outputs more consistent with real-world expectations.

It essentially transforms a static text generator into an interactive system.

Does RLHF Actually Work?

Empirically ; yes.

Examples:

RLHF improved instruction-following by 17%
Reduced toxic outputs by 48% in one ChatGPT lineage comparison

It can even outperform much larger models without alignment:

Human evaluators preferred a 1.3B parameter RLHF model over outputs from a 175B parameter raw model.

Alignment can matter more than scale. That is a huge shift in AI research.

Mixing Objectives — Avoiding Catastrophic Forgetting

There’s another subtle technical detail many people miss.

During RLHF, models still train partly on original language data to prevent “forgetting” general knowledge.

Otherwise a customer-service tuned model might literally forget geography.

This hybrid objective balances:

human alignment
linguistic competence

An elegant engineering compromise.

The Core Insight: Intelligence ≠ Alignment

Pretraining creates capability.

RLHF creates behavior.

Without alignment, LLMs optimize for engagement patterns found online, not for truth or usefulness.

With alignment, they optimize for human preference.

This is why modern AI feels cooperative rather than chaotic.

But RLHF Isn’t Perfect

There are real technical challenges.

1. Human Bias Transfers Into Models

Human judgments are subjective.

Biases in ratings can propagate into the reward model.

Alignment is not neutrality. It is curated behavior.

2. Reward Models Can Misinterpret Preferences

Incorrect or ambiguous preference data can degrade performance.

Generalization outside the training distribution is also difficult.

Classic ML problem. New domain.

3. Helpfulness vs Harmlessness Is a Tradeoff

Balancing safety with usefulness is an optimization challenge.

Some research frames this as maximizing reward while satisfying cost constraints using methods like the Lagrangian approach.

Alignment is literally a constrained optimization problem.

Beyond RLHF — The Next Wave

The field is already evolving.

Researchers are exploring:

AI-generated critiques
Natural language feedback
Automated reward systems

One study showed iterative critique could boost response win rates to 65.9%.

There is also movement toward reducing human labeling costs via reinforcement-based automation.

The future may involve AI aligning AI.

The Big Misconception: “It Was Trained on Reddit”

Yes : internet-scale corpora are used in pretraining.

But raw data does not define final behavior.

Post-training reshapes the model.

RLHF steers responses toward:

relevance
safety
helpfulness
instruction-following

Think of pretraining as raising a child in the noise of the world. Alignment is education.

The Deeper Philosophical Takeaway

Modern AI development has quietly shifted focus from:

“How do we make models smarter?”

to:

“How do we make models behave?”

Capability scaling was the first revolution.

Alignment is the second.

And arguably the harder one.

Because intelligence is mathematical. Values are not.

Final Thought

So why is ChatGPT polite despite learning from the internet?

Because modern AI is not just trained.

It is curated.

Not just optimized for probability,
but optimized for preference.

The internet gave language models their voice.

RLHF taught them how to speak to humans.

Resources:

Why Your Spotify Discover Weekly Actually Slays (and how it’s not magic)

Neva Erdogan — Tue, 27 Jan 2026 23:13:50 GMT

The Technical Deep Dive

Ever wondered why your streaming app knows you’re in your “sad girl autumn” era before you even do? It’s not a glitch in the simulation, bestie. It’s Recommendation Systems doing the most behind the scenes.

1. Content-Based Filtering: The Solo Stacker

This is the “if you like this, you’ll like that” energy, but powered by heavy NLP. To understand a piece of content, the algo doesn’t just “look” at it; it vectorizes it. We use Count Vectors to map word frequencies, but the real star is TF-IDF (Term Frequency-Inverse Document Frequency).

It calculates a weight for each token:

This ensures that common words (like “the” or “song”) don’t drown out the unique signals (like “hyperpop” or “lo-fi”). Once we have these high-dimensional vectors, we calculate the Cosine Similarity. By finding the dot product of two normalized vectors, the system measures the “vibe distance.” If the score is close to 1, you’re getting that glass-skin content served on a silver platter. No thoughts, just optimized metadata.

2. Collaborative Filtering: The “Mutuals” Method

Then there’s Collaborative Filtering, which is basically the digital version of “I’ll have what she’s having.” We’re moving from item features to a massive User-Item Matrix.

Item-Based: Instead of looking at you, we look at the items’ track records. If a huge cluster of users gave 5 stars to both “Oversized Hoodies” and “Baggy Jeans,” the system calculates the similarity between these two item-vector columns. It’s a more stable approach because item ratings don’t change as fast as human moods.
User-Based: This is finding your “taste twins.” The system searches for users whose rating vectors have the highest correlation with yours. If you and a random person in Berlin both stan the same 5 niche indie artists, the algo performs a weighted average of their other favorites to predict your next obsession. It’s giving soulmate behavior, but it’s actually just Pearson Correlation Coefficient at work.

Your feed isn’t random; it’s an ensemble of distance metrics and sparse matrix operations designed to keep your retention rate at an all-time high. Period. 💅

POV: You’re Finally Learning Data Literacy Because Numbers Don’t Lie (But People Do)

Neva Erdogan — Fri, 23 Jan 2026 19:20:20 GMT

👀POV: You’re Finally Learning Data Literacy Because Numbers Don’t Lie (But People Do)

We are living in an era where everyone is obsessed with "data-driven" decisions, but half the time, people are just throwing charts around to justify whatever they already wanted to do. If you don’t know how to read the room (or the data), you are going to get played.
So I went deep into the technical side of things to gatekeep the truth from the noise. Here is the technical breakdown of data literacy, translated for those of us who want the tea without the academic boredom.

The Main Characters: Population vs. Sample
Think of the Population as the entire fandom. It is every single possible data point in existence for your study. But obviously, you cannot interview every single person on the planet. That is where the Sample comes in. The sample is the specific group chat you actually have access to. If your sample is messy, your insights are going to be delulu. You need a sample that actually represents the population, or else you are just projecting.

The Vibe Check: Central Tendency
When we look at a dataset, the first thing we want to know is "what is the general vibe here?" That is Central Tendency. But we have three different ways to measure it, and they all spill different tea.
Mean (Arithmetic Average): This is the basic average. It is sensitive, though. One massive outlier (like a billionaire walking into a room of students) can skew the whole number and ruin the vibe.
Median: This is the unbothered middle child. It literally sits in the center of the data when sorted. It does not care about that one billionaire outlier. It is usually more honest than the mean.
Mode: The specific value that shows up the most. It is the trendsetter.

The Drama: Dispersion and Spread
Knowing the average is cute, but it doesn’t tell you about the chaos. We need to know how spread out the data is.
Range & Quartiles: Range is just the distance between the best and the worst. Quartiles break the data into four parts so you can see where the top 1% (or top 25%) are actually sitting.
Variance & Standard Deviation: This is where it gets technical. Variance measures the average squared deviation from the mean. Basically, how far is everyone drifting from the center? Standard Deviation is just the square root of variance, bringing it back to the original units. If your standard deviation is high, your data is chaotic and unpredictable. If it is low, everyone is acting the same.

The Aesthetic: Shape of the Data
Not all data looks like a perfect bell curve. Sometimes it has issues.
Skewness : This tells you if the data is leaning too hard to one side. If it is right-skewed, the tail drags out to the right (positive), meaning most people are low, but a few high rollers are stretching the graph.
Kurtosis : This measures the "peak" intensity. High kurtosis (Leptokurtic) means everything is clustered around the center with heavy tails (lots of outliers). Low kurtosis (Platykurtic) means the curve is flat and chill.

The Conclusion
Data literacy isn’t just about making pretty charts using visualization tools. It is about understanding the underlying statistical models, recognizing the variable types (nominal, ordinal, interval, ratio), and knowing when a correlation is actually a coincidence.

Next time someone shows you a "statistic," check their standard deviation and ask about their skewness before you believe the hype.

From Behavioral Segmentation to Value Prediction: Two Customer Analytics Projects

Neva Erdogan — Fri, 31 Oct 2025 21:28:13 GMT

Two projects, 20,000 customers, and a lot of “wait, we can actually predict this?” moments

I recently finished two customer analytics projects that completely changed how I think about customer value. Not in a “wow this is revolutionary” way, but in a “why isn’t everyone doing this already?” way.

The setup: From my dataset; 20,000 OmniChannel customers (people who shop both online and offline) from 2020–2021. The mission: figure out who’s actually valuable and who’s just… there.

Spoiler: Most customers are just there. Here’s how I proved it with data.

Project 1: RFM Analysis (The One Where I Sorted 20,000 People)

What Is RFM?

RFM stands for:

Recency: How recently did they purchase?
Frequency: How often do they purchase?
Monetary: How much do they spend?

It’s literally just scoring customers on these three metrics. No machine learning, no neural networks, just smart scoring.

The Method

I scored each customer 1–5 on each metric using quartiles:

Recency (inverted, lower is better):

Score 5: Bought within last few weeks
Score 1: Hasn’t bought in months

Frequency:

Score 5: Tons of purchases
Score 1: One or two purchases total

Monetary:

Score 5: High spender
Score 1: Minimal spending

Then I combined Recency + Frequency into an RF Score. Someone with score “55” is golden (recent + frequent). Someone with “11” is basically gone.

# The actual scoring code
rfm["recency_score"] = pd.qcut(rfm['recency'], 5, labels=[5, 4, 3, 2, 1])
rfm["frequency_score"] = pd.qcut(rfm['frequency'].rank(method="first"), 5, labels=[1, 2, 3, 4, 5])
rfm["monetary_score"] = pd.qcut(rfm['monetary'], 5, labels=[1, 2, 3, 4, 5])

rfm["RF_SCORE"] = (rfm['recency_score'].astype(str) + 
                    rfm['frequency_score'].astype(str))

The 10 Customer Types That Emerged

Using regex mapping on RF scores, I identified 10 distinct segments:

Champions (RF: 54, 55)

Recent buyers + high frequency
Your brand ambassadors
What I did: Targeted them for premium product launches

Loyal Customers (RF: 34, 35, 44, 45)

Consistent purchase behavior
Solid revenue base
What I did: Loyalty program candidates

Potential Loyalists (RF: 42, 43, 52, 53)

Recent customers with potential
Need nurturing
What I did: Onboarding campaigns

New Customers (RF: 51)

Just showed up, low frequency
Critical window to convert
What I did: Welcome series, incentives

Promising (RF: 41)

Recent but need frequency boost
What I did: Repeat purchase campaigns

Need Attention (RF: 33)

Average across the board
What I did: Re-engagement tactics

About to Sleep (RF: 31, 32)

Declining engagement
What I did: Wake-up campaigns

At Risk (RF: 13, 14, 23, 24)

Used to be good, now declining
What I did: Win-back offers

Can’t Lose Them (RF: 15)

High spenders who stopped buying
Code red territory
What I did: Aggressive retention

Hibernating (RF: 11, 12, 21, 22)

Basically inactive
What I did: Minimal effort or let churn

Real Business Applications

Case 1: New Women’s Shoe Brand Launch

Target: Premium brand, above average price point
Criteria: Champions + Loyal Customers + interested in women’s category
Method: Filtered RFM segments + category interest from last 12 months
Output: Customer ID list for targeted campaign

target_customers_df = merged_df[
    (merged_df['segment'].isin(['champions', 'loyal_customers'])) &
    (merged_df['interested_in_categories_12'].str.contains("KADIN"))
]

Case 2: 40% Discount on Men’s & Kids’ Products

Target: Win back at-risk customers + activate new ones
Criteria: cant_loose + at_risk + about_to_sleep + new_customers
Method: Same filtering with men’s/kids category interest
Output: Different customer ID list for discount campaign

discount_target_df = merged_df[
    (merged_df['segment'].isin(["cant_loose", "at_risk", "about_to_sleep", "new_customers"])) &
    (merged_df['interested_in_categories_12'].str.contains("ERKEK|COCUK"))
]

The point: Instead of blasting everyone with everything, I created precision-targeted lists. No wasted budget.

Project 2: CLTV Prediction (The One Where I Predicted The Future)

Why CLTV Matters

RFM tells you who customers are right now. CLTV (Customer Lifetime Value) predicts who they’ll be in 6 months.

Knowing future value lets you:

Allocate marketing budget intelligently
Identify VIP program candidates
Spot high-value customers before they churn
Stop over-investing in low-value customers

The Data Structure

Before modeling, I created weekly metrics from the raw data:

cltv_df["recency_cltv_weekly"] = round(((df["last_order_date"] - df["first_order_date"]).dt.days) / 7)
cltv_df["T_weekly"] = round(((analysis_date - df["first_order_date"]).dt.days)/7)
cltv_df["frequency"] = df["order_num_total"]
cltv_df["monetary_cltv_avg"] = df["customer_value_total"] / df["order_num_total"]

recency_cltv_weekly: How many weeks between first and last purchase (customer lifecycle) T_weekly: How many weeks old is the customer (tenure) frequency: Total number of purchases monetary_cltv_avg: Average spending per transaction

Model 1: BG-NBD (Beta Geometric/Negative Binomial Distribution)

What it predicts: Number of future transactions

The assumptions:

Each customer has a personal purchase rate (some buy weekly, some monthly)
After each purchase, there’s a probability the customer churns
These rates vary across customers (heterogeneity)

Why BG-NBD specifically:

Handles “buy til you die” behavior
Models both purchase frequency AND dropout probability
Computationally efficient for large datasets

bgf = BetaGeoFitter(penalizer_coef=0.001)
bgf.fit(cltv_df['frequency'],
        cltv_df['recency_cltv_weekly'],
        cltv_df['T_weekly'])

# Predict next 3 and 6 months
cltv_df["exp_sales_3_month"] = bgf.predict(4*3, ...)  # 4 weeks * 3 months
cltv_df["exp_sales_6_month"] = bgf.predict(4*6, ...)  # 4 weeks * 6 months

The penalizer_coef=0.001 prevents overfitting. Lower value = less regularization since the dataset is large.

Model 2: Gamma-Gamma

What it predicts: Average monetary value per transaction

Key assumption:

Monetary value varies randomly around each customer’s average
This variation is independent of purchase frequency
(A frequent buyer might spend little per transaction; an infrequent buyer might spend a lot)

Why Gamma-Gamma:

Designed for positive continuous values (transaction amounts)
Captures customer heterogeneity in spending
Works only with customers who have frequency > 1

ggf = GammaGammaFitter(penalizer_coef=0.01)
ggf.fit(cltv_df['frequency'], cltv_df['monetary_cltv_avg'])

cltv_df["exp_average_value"] = ggf.conditional_expected_average_profit(
    cltv_df['frequency'],
    cltv_df['monetary_cltv_avg']
)

The CLTV Calculation

Combine both models:

cltv = ggf.customer_lifetime_value(
    bgf,
    cltv_df['frequency'],
    cltv_df['recency_cltv_weekly'],
    cltv_df['T_weekly'],
    cltv_df['monetary_cltv_avg'],
    time=6,        # 6 month prediction
    freq="W",      # Weekly frequency
    discount_rate=0.01  # 1% monthly discount
)

Formula essentially: CLTV = (Expected number of transactions in 6 months) × (Expected average transaction value) × (Discount factor)

The discount_rate=0.01 accounts for time value of money (future revenue is worth slightly less than present revenue).

The 4-Tier Segmentation

I segmented customers into quartiles:

cltv_df["cltv_segment"] = pd.qcut(cltv_df["cltv"], 4, labels=["D", "C", "B", "A"])

Segment A: Top 25% by predicted 6-month value
Segment B: Next 25%
Segment C: Next 25%
Segment D: Bottom 25%

Real Application: VIP Program Selection

Instead of guessing who deserves VIP treatment, I used data:

Criteria:

Must be in Segment A (top predicted value)
Frequency above median (actually shops regularly)
Monetary value > 75th percentile (from my data: 182.45)

vip_customers = cltv_df[
    (cltv_df['cltv_segment'] == 'A') &
    (cltv_df['frequency'] > cltv_df['frequency'].median()) &
    (cltv_df['monetary_cltv_avg'] > 182.4500)
]

This created a defensible VIP list. When someone asks “why is this customer VIP?” I have three quantifiable reasons.

Budget Allocation Strategy

Instead of spreading budget equally:

Segment A: 50% of retention budget
Segment B: 30%
Segment C: 15%
Segment D: 5%

Proportional to predicted value. Segment A customers get 10x the attention of Segment D customers because the models say they’re worth it.

Technical Implementation

Core stack:

import pandas as pd
import datetime as dt
from lifetimes import BetaGeoFitter, GammaGammaFitter

Data prep challenges:

Outlier handling (used IQR method at 1st/99th percentiles)
Date formatting (converted all date columns to datetime)
Creating omnichannel metrics (online + offline totals)
Handling customers with frequency = 1 (excluded from Gamma-Gamma)

Model calibration:

BG-NBD penalizer: 0.001 (minimal regularization, large dataset)
Gamma-Gamma penalizer: 0.01 (slight regularization)
Both chosen through experimentation

What I Actually Learned

RFM is underrated. Everyone talks about fancy ML models. But RFM with 10 segments gives you immediately actionable customer groups in like 30 minutes. Sometimes simple wins.

Probabilistic models > guessing. Before: “This customer seems valuable?” After: “This customer has 78% probability of making 3 purchases in 6 months with expected value of $247.”

The lifetimes library is a gift. Implementing BG-NBD and Gamma-Gamma from scratch would take weeks. The library makes it 10 lines of code.

Data prep is 60% of the work. Outliers, date formatting, creating the right metrics; that’s where I spent most time. The modeling itself was fast.

Business context matters. I tried 4-segment vs 7-segment splits. Four was clearer for stakeholders. More segments = more precision but harder to action.

The Output

From RFM:

10 customer segments with clear behavioral patterns
Two targeted customer lists for specific campaigns
Framework for future segmentation

From CLTV:

6-month value predictions for all customers
4-tier segmentation by predicted value
VIP program candidate list with quantifiable criteria
Budget allocation strategy backed by predictions

Both together:

Current behavior (RFM) + future value (CLTV)
Who’s valuable now + who’ll be valuable later
Precision targeting without wasting budget

Why This Matters

Most companies treat customers uniformly. Same email campaigns, same offers, same attention. That’s inefficient.

The customers who just browsed once six months ago? They’re not coming back. Stop emailing them.

The customers who buy monthly and spend above average? They’re your revenue base. Invest accordingly.

The customers scoring high on predicted CLTV? They’re your future. Nurture them now.

It’s not complicated. It’s just math + business logic.

Replication

Both projects are on GitHub with full code and READMEs. Check below.

What you need:

Python (pandas, lifetimes, datetime)
Customer transaction data with dates and amounts
Willingness to actually clean your data (it’s always messy)

What you’ll get:

Customer segments you can immediately action
Value predictions that inform budget allocation
Data-backed answers to “which customers matter?”

Let’s be real: some customers absolutely slay💅(high value, loyal, frequent), and some are just lame 😒(one-and-done, low spending, gone forever). The models separate the two so you know where to invest.

When Algorithms Catch Feelings: Data Science in the Wild World of Finance

Neva Erdogan — Sat, 18 Oct 2025 08:42:21 GMT

💖 When Algorithms Catch Feelings: How Data Science Tries to Decode the Market’s Emotional Chaos

There’s something oddly poetic about the stock market.
It’s human emotion, fear, hope, greed, all turned into numbers that dance across a screen.

And now, those numbers are mostly being watched not by people, but by algorithms.

We built machines that try to understand what moves us.
But the real question is: can they?

The Emotional Side of the Market

If you’ve ever been on finance twitter during a crash, you KNOW. The numbers are just one part of the story. Behind every candlestick chart there’s actual people panicking, celebrating, or straight up having a meltdown.

Markets aren’t logical they’re basically a social experiment where everyone’s emotions have a price tag.

That’s what makes them so fun to analyze though. Every spike is a digital footprint of collective freakout. Remember the GME thing? That was mass psychology in 4K. The silence after a crash? That’s everyone holding their breath, and yeah, the data picks up on ALL of it.

What gets me is that algorithms are actually getting decent at reading these vibes. Sentiment analysis, NLP models, even multimodal stuff now, like how BERT models can detect bullish vs bearish tone on Reddit they can catch shifts in investor mood from tweets or reddit before most traders even notice. It’s like we gave machines intuition, but in Python.

But here’s where it gets messy: emotions don’t follow rules. You can’t model FOMO the way you model inflation. That’s the challenge, trying to quantify something that was never meant to fit in a neat little dataframe.

The Rise of Data-Driven Decisions

Trading floors used to be loud, chaotic, people yelling. now it’s just servers humming in some data center.

But here’s the weird part:

The same algorithms we built to REMOVE human emotion? we’re now training them to DETECT it.

Like… we’re teaching machines to sense fear and optimism and confidence. Things we barely understand about ourselves.

It’s cool but also kind of dystopian? Because at the end of the day, these systems are just mirrors. They reflect our habits, our biases, our desperate belief that we can predict the unpredictable.

When Models Fail (and Why That’s Beautiful)

Let’s be real no matter how much data you feed them, models fail. Spectacularly sometimes. One unexpected tweet, one global event, and boom; your perfectly tuned regression model suddenly looks like it learned nothing.

But maybe that’s not a flaw. Maybe it’s the point.

Finance refuses to be predicted. every time a model breaks, it’s reminding us that the world still has variables we can’t capture gut feelings, random rumors, mass panic, hope. and that’s kinda reassuring in a weird way.

In a world of algorithmic trading and optimized everything, failure proves that chaos still exists. that humanity still matters.

What makes models interesting isn’t when they work perfectly it’s when they break and you have to figure out WHY. That’s where you learn about bias, about edge cases, about how our systems are just us trying to make sense of something that doesn’t always want to make sense.

So when a model crashes? Yeah it sucks, but it’s also a reminder that even with all this AI, uncertainty still wins. And honestly that’s kind of beautiful.

Where It’s All Going

Finance and data science are blending into one another ,not competing, just evolving together.
Markets are becoming emotional ecosystems where data tells stories and algorithms try to understand them.

We’re moving toward a world where every decision, from billion-dollar trades to your phone’s investment suggestion, will be shaped by models that learn, adapt, and maybe even anticipate our emotions. And that’s both exciting and humbling.

Because for all the sophistication of our algorithms, what truly drives the market is still us, the humans feeding data into the machine, reacting to trends, panicking, celebrating, hoping. The algorithms might be learning fast, but they’re still learning from us.

So, maybe the future of finance isn’t about removing emotion from data, it’s about understanding how emotion creates data. Maybe data science won’t just predict the market, but help us see ourselves a little more clearly in it.

And if algorithms are starting to catch feelings… maybe that’s not such a bad thing after all.

✨ Author’s Note
Lowkey, the market is chaotic and we’re all just trying to vibe with it.
I hope this made you see the market a little differently and maybe feel a bit of empathy for the bots just trying to keep up with us.