Dailymotion - Medium

Reinvent your recommender system using Vector Database and Opinion Mining

Samuel Leonardo Gracio — Thu, 21 Sep 2023 12:25:33 GMT

With its new positioning, Dailymotion wants to give its users the possibility to get out of their filter bubble. The new home feed is designed to allow everyone to debate and confront their opinions.

One feed, several opinions

Dailymotion mobile app offers a single feed with vertical videos from a broad list of content creators. The aim is to offer diversified and stimulating content that adapts to the user’s desires, while allowing them to challenge or express their opinions.

Download the app on Google Play or App Store and try it yourself!

Significant work has already been done to build this new Homefeed using an architecture inspired by the multi-armed bandit and multi-stage recommender system. You can learn more about our very first steps in building this model here.

This article presents a newly created recommender system built on top of this existing architecture. The goal of this new feature is to provide our users with a different point of view on videos or topics they have already engaged with.

Overview of the Home Feed architecture

In order to understand this new recommender system, a very quick explanation of the Home Feed architecture may be necessary.

If we simplify the architecture, the algorithm responsible for recommending videos within our home feed can be considered as a combination of different small recommender systems, each with its own behavior: one to recommend videos based on their performance metrics (watch time, number of views, freshness…), another to recommend videos based on the user’s history or subscribed channels.

Quick overview of the Homefeed recommender system

This article will focus on our new “opinion-based” recommender, a more original recommender system which goal is to recommend personalized content while also attempting to challenge the user’s opinions.

While writing this article, the recommendations provided by this new recommender system are available through a button “Show me a different POV” appearing on each eligible video. The feature will evolve in the coming months.

Bring more perspective

Before going into detail on every aspect of this recommender system, it might be useful to describe its overall approach.

First of all, the new recommender system is designed at a video level. In fact, for each video on a user’s home feed, the first step of the “opinion-based” recommender (or “Perspective BETA” feature) is to find similar videos, i.e, videos also talking about the same topic as the input video.

This brings up an important aspect of this new feature: not all videos are eligible. For example, some videos may be too “niche” in their subject matter, making it impossible to find similar videos.

The second and undoubtedly most important aspect of this recommender system is to rank these videos according to the opinion they express.

Finally, we have, when it’s possible, a list of similar videos ranked by the intensity of the opinion they express.

Example of the feature powered by the new “opinion-based” recommender.

This quick explanation of our new “opinion-based” recommender system makes it possible to divide it into two main steps:

Candidate Generation: responsible for finding videos evoking the same subjects as those already enjoyed by the user.
Re-ranking: order the resulting videos and recommend videos having a strong and/or different opinion.

The following schema details the subparts of this 2-step architecture :

The global Architecture of the new opinion-based recommender system.

Candidate Generation

Dailymotion’s catalog contains several hundred million videos. The goal of the candidate generator is to transform this video catalog into a short list of dozens of relevant videos for a user.

The first filter is done using simple heuristics: for instance, for French users, we only want to recommend recent content, in French, from content creators and traditional media. By doing this, we can already narrow down the list to a few hundred thousand videos.

Nevertheless, we still need to reduce the number of videos while also finding videos that are similar to the ones the user has already watched. To achieve this, we need to represent each video as a vector.

Textual embeddings to represent videos

In order to embed a video object, i.e represent a video as a vector with real numbers in a continuous vector space, several options are available: Frame-Level embedding, Multi-modal embedding… All these options have an important drawback: they are computationally heavy.

Fortunately, each video uploaded to Dailymotion is also associated with several textual metadata: a title, description, and some tags. Transforming a textual information into a vector is both simpler and cheaper. To do that, we use the Multilingual Universal Sentence Encoder (MUSE), a pre-trained and open-source multilingual sentence embedding model handling 16 languages, including French.

Nevertheless, the video metadata can sometimes be too limited. Some videos may have textual metadata that does not showcase the real content of the video. For example, a video where the only information available is a title “My Daily VLOG” and no description at all may not provide enough textual information.

However, a solution to this lack of textual information can be found. Recent advances in speech-to-text models have led to the emergence of a pre-trained and open-source solution that makes it easy to obtain a transcript of a video: Whisper.

The Machine Whisperer

Whisper is an open-source speech recognition model developed by OpenAI capable of providing automatic video subtitles. Based on a Transformer sequence-to-sequence model, Whisper has been trained with 680k hours of training data in several languages and with different audio quality, making it robust for all types of audio soundtracks.

Subtitles are an essential feature to make our mobile app accessible to everyone. Moreover, they can also be used as transcripts of our videos, i.e., a written record of the spoken content in a video.

These auto-generated transcripts, typically of very good quality, allow us to extract much more textual information than the video title or description.

By using Whisper to obtain the transcript and MUSE to embed the text contained in this transcript, we can now obtain a complete representation of the actual video content using only textual metadata.

The following diagram describes how this pipeline works but also introduces a new element in the overall architecture of our recommender: Qdrant.

Representation of transforming a video into a vector by using its transcript.

Qdrant: build a K-NN

Qdrant is an open-source vector search database. Qdrant is designed to efficiently handle high-dimensional vector data and enables retrieval of similar vectors based on their cosine similarity scores. It is based on an algorithm called Hierarchical Navigable Small World (HNSW), an approximate K-NN algorithm with a very short response time (≈20ms).

Cosine similarity between two embeddings, each one representing a video.

A Qdrant database can easily store hundreds of thousands of embeddings, along with various associated metadata such as the video’s language, creation date, or other useful information if we wish to filter certain videos when querying the server.

Qdrant is the final step of the candidate generator: it enables quick retrieval of similar videos from any video liked by a user on the home feed. For instance, for any video in the Qdrant database, we can use its embedding to retrieve N similar videos using its approximate K-NN functionality and cosine similarity.

Nevertheless, even if among some of these N closest videos some contain different points of view, we still need to re-rank the output of this approximate K-NN in order to give priority to videos with a strong or different opinion.

Overview of the Candidate Generation step.

Re-ranking: express your opinion

Now that we have our candidate videos, i.e, the N closest videos to an input video, we can now re-rank this short list of videos in order to push content that will challenge the opinion of the user.

Opinion mining, also known as sentiment analysis, is a subfield of natural language processing (NLP) and machine learning that aims to determine the sentiment or emotional tone expressed in a piece of text.

With the recent dominance of Large Language Models (LLM) like GPT (OpenAI), LLaMA (Meta) or PaLM (Google), sentiment analysis has also received a significant boost. These complex models are also capable of analyzing text and assigning it an opinion score.

In our case, we send each video transcript to Google’s PaLM API. The model predicts a score, ranging from -1 to 1, which reflects the video’s opinion on the subject it addresses. The higher the score in absolute value, the stronger the opinion. A score of zero means that the video is neutral and does not express any particular opinion.

In this re-ranking stage, we also use other video metadata such as the aspect ratio of the video or the freshness to build our final ranking.

Finally, we have the final output of this new recommender system: for each video liked by a user, we can now retrieve similar videos ranked in terms of opinion intensity.

Re-ranking step: retrieve opinion score using PaLM and re-rank using global score

Create your own opinion

We are proud to have presented you a more original recommender system, not only personalizing content for users using a content-based approach but also challenging their opinions by re-ranking the result using sentiment analysis.

The two main subparts of this opinion-based recommender system, namely Candidate Generation and Re-ranking, play crucial roles in building this key feature for Dailymotion.

Would you like to be part of upcoming meaningful Data and AI products at Dailymotion? Check our open positions.

Reinvent your recommender system using Vector Database and Opinion Mining was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

Optimizing video feed recommendations with diversity: Machine Learning first steps

Samuel Leonardo Gracio — Thu, 06 Oct 2022 14:56:41 GMT

As part of its new mobile experience, Dailymotion redesigned its video recommendation system.

Dailymotion hosts more videos you can watch in several lives, and more are added as we speak. Among all these choices, how can you find the best ones for you? That’s the job of a recommender which automatically presents videos curated to your interests.

Why did we rethink the mobile video recommender?

Use cases drive the existing recommenders on the dailymotion.com website, as most Web users are anonymous. Therefore, the priority is to suggest videos to watch after the current one.

On the native applications, all users are required to log in to use the app, providing an opportunity to display personalized content. Mobile app usage patterns are also different. For instance, users may like to watch short videos on the Dailymotion app while commuting.

How to deliver in a short time?

At that time, the application’s user interface was undergoing a major revamp, and we were changing several underlying APIs and event stores. Crafting something light allowed us to adapt quickly during the journey.

That’s why we decided to keep it simple and iterate fast. Deliver the most needed features first, observe user behavior changes, then repeat. Keeping the solution simple and iterating quickly led to remarkable results in a short time.

The ideal new home feed experience

The home feed is the first screen you see after launching the Dailymotion iOS or Android app. We expect users to “snack” videos by watching them directly on the home feed.

Together with the product team, we formulated the ideal recommender:

Relevant & diverse: videos come from all the interests of the user.
Variability of reward: different results at every app start and refresh of the home feed.
Explainable: recommendations based on transparent features.
Iterable: this first version should be easy to tune and extend.

Users are more likely to interact with the first results, so we present the most relevant at the top. Moreover, the recommender should ship quickly to receive rapid feedback.

Kickstarting the recommender

The performance of a recommender is usually evaluated on previously collected data. In this case, there were no appropriate historical user interactions to use.

We built a user interface in a notebook using Jupyter Widgets to inspect the results while constructing the recommender.

Experimental user interface to recommend videos in a Jupyter notebook

After each algorithm change, we could immediately see videos like a real user would and evaluate their relevance qualitatively.

Thanks to this, we could conveniently demo an early version to the stakeholders and collect their feedback to drive future improvements.

The Dailymotion’s recommender main components

The new recommender is made of two main parts:

Candidates: retrieve all videos which can be recommended.

Ranking: produce an ordered list of videos for a given user.

This is inspired by the two-stage approach used for Youtube recommendations. First, data analysts defined how to select candidates from quality partners. Then, machine Learning work focused on picking the right videos and assembling a list.

A rankers octopus

For the ranking part, the architecture builds on an extensible Ranker component with the following contract:

https://medium.com/media/1bc077c76518e2aae9bcce1ba7b51f82/href

We coded several complementary rankers based on this simple Python class. Our final recommender is an ensemble of multiple rankers respecting the diversity of the user interests.

Components of the video recommender

Fresh and best-performing videos

The Freshness & performance ranker selects videos by using several features:

Freshness: users like watching recent videos, e.g., related to news headlines.
Real views ratio: number of views lasting 10+ seconds divided by the total views count. Filter out accidental clicks and some clickbait videos.
Watch ratio: time watched divided by the video duration. Videos with a high ratio tend to be more pertinent.
Aspect ratio: prefer square and vertical image formats, which fit better on a mobile screen.

The performance ranker computes a weighted sum of these features. Coefficients are tuned by emphasizing freshness to keep learning how new videos can perform when presented to actual users.

Surface new content

Ranking videos only according to their performance leads to users always seeing the same content over and over.

To break this bubble and robustly learn how a video performs, we must give it a chance by showing it to a sample of people.

The Exploration ranker randomly pulls videos within the top 100 of the Freshness & performance and Featured videos rankers results. These videos have an equal chance of selection because all are relevant. This helps surface unseen content from the bottom of the list.

Always something new to watch for the users

One essential requirement is to show something new each time a user opens the app or refreshes the home feed. Discovering new videos is what ensures users keep coming back to Dailymotion.

The videos presented are different each time. The ones from the rankers are combined into a single list. Each ranker is assigned a probability to balance exploration versus historical performance.

“Show me videos from my interests”

New users indicate their interests when setting up their accounts as part of their onboarding process. Learning if they like Movies, Tech or Politics avoids the cold start problem.

Selection of user’s interests

However, not all interests have the same volume of uploaded videos. For instance, news and political videos are more frequent than cooking recipe videos.

We devised solutions to prevent some interests from filling up the recommendation slots and ensure we show videos from all the interests of a user.

In the candidate selection phase, the recommender groups videos by interest. Then, it selects a roughly equal number of videos from each interest. This method, named stratified sampling, helps us deliver a balanced experience.

Moreover, to respect the diversity of a user interests list in the final videos list, we re-order them to alternate different topics:

Compute a similarity score for each pair of videos based on their attributes
Add the first video to the final list.
Add the following video, less like those already added to the list.
Repeat the previous step until the list is complete

Right content = happy users

Imagine for a second that we choose to optimize for money first. Placing advertisements on videos is a significant way to generate revenue. It could be tempting to promote the most-clicked videos to display more ads. But that would often result in low-quality content (clickbait) and a bad user experience.

A healthier goal is to help users in finding exciting content every time.

We optimized watch time per user on the home feed, a measure for engaging content. In this case, we use the median watch time per user, which has many advantages:

Robust to outliers: the median roughly ignores exceptionally long watch times.

Robust to clickbait videos: these videos have a short watch time; users often click on the video and then stop.

More retention and better experience: videos with a good watch time are the videos that have the highest entertainment potential

Download the app on Google Play or App Store and try it yourself! (France only as of writing)

This is only the beginning. What’s next for you?

Our journey is far from over, and we have already identified several improvements:

More personalization: we based recommendations on user interests in this first release. The following version will integrate more users’ actions: watch history, likes, channels’ subscriptions…

Present even more relevant videos to first-time users who had no previous interactions.

Would you like to be part of upcoming meaningful Data and AI products at Dailymotion? Check our open positions.

The video recommender was developed by Denis Angilella and Samuel Leonardo Gracio, Machine Learning engineers at Dailymotion, in collaboration with the Product teams. Denis Angilella and Samuel Leonardo Gracio also co-authored the article.

Optimizing video feed recommendations with diversity: Machine Learning first steps was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How Dailymotion committed to closing the gender pay gap in two years

Karine Aubry — Mon, 08 Mar 2021 09:06:05 GMT

An International Women’s Day special editorial by Karine Aubry, Chief People Officer at Dailymotion

At the end of 2019, we shared an ambitious goal with our teams, fully aligned with our values: to close the gender pay gap between women and men over the next two years. Nothing to brag about, simply paying employees fairly regardless of gender — equal pay for equal work. How do we end up with gender pay gaps? How to measure these gaps? How to fix this? How to build a solid system to maintain equity once fixed? We thought it might be worth sharing our journey, our wins, mistakes, and takeaways.

Dailymotion is the leading global video destination that connects over 350 million consumers to their daily fix of the most compelling content, with offices in Paris, Montpellier, New York, Singapore, and Seoul. So we operate in a highly competitive global tech industry, fighting an intense talent war. Over time, gaps were created, due to hiring negotiations, internal promotions, the market value of specific skills at a given time and, let’s be honest, due to a lack of process to anticipate the gaps rather than acknowledging them when it’s already too late.

Why we don’t hire “rockstars”, “gurus”, “Jedi Knights” or “ninjas” at Dailymotion

Equal pay for equal work

We started by clarifying what “equal pay for equal work” means. How can we measure equal work in an international company with 350 employees and over 200 different job titles when we cannot afford the services of a consulting firm? By defining equal work as “equal responsibilities”. We used our internal career framework, breaking down our jobs into larger occupational categories and a ladder from junior to senior VP, each level coming with a skillset and a salary band for each country.

This framework was already in place for talent acquisitions and promotions. What was missing was the gender* analysis to challenge the fairness of our decisions. We made sure to communicate and explain this framework, in an effort to empower women when asking for a raise or salary assessment. So at the end of 2019, we measured that women earned on average 6% less than men. We allocated 0.4% of our gross wages to close the gender pay gap. With this budget, we were able to lower the gap to 4%. Looking at the trajectory, we were confident in our ability to reach our goal within the next 22 months.

*In most of the countries we operate in, employment law has a binary understanding of gender, especially in France where we have our HQ. So we started with a cis-gendered and binary-oriented analysis of the gender pay gap, but we are working to figure out how to implement the full spectrum of gender diversity. We are also working on implementing more Diversity KPIs to guarantee the opportunity equity of our decisions (for example age, family status, ethnicity, different ability…).

The assumption was wrong…

Because of our attrition rate — Dailymotion is paying the price for the talent war — new hirings and regular promotions, when assessing the required budget at the end of 2020, we were frustrated to find out that despite our efforts, the gender pay gap had slightly increased.

Our frustration at this exact moment

So what were we supposed to do in this kind of a situation? Give up? Not an option. Here are the main takeaways from our next course of action:

Analyze the impact for each hiring, promotion, or termination
For any event impacting the gender pay gap negatively, immediately identify the necessary budget to offset it as soon as possible
And of course deal with the root cause: plan trainings about unconscious bias, challenge and improve recruitment and promotion processes… and collect data!

It’s a long road getting there

In 2021, we doubled the budget allocated to gender pay equity, hence deciding to lower the budget assigned to performance increases. Placing equity before performance was a conscious choice because, in the long run, our values matter more.

Are you ready to take action?

To leaders who want to reach gender equity in their organization and don’t know where to start: yes, it can be a frustrating journey because there is no recipe book (yet?). But it is also extremely motivating and rewarding to write this chapter of history: together with our diversity of approaches and methods, we have no doubt that we will close the gender pay gap. Our work at Dailymotion is not complete yet but our commitment remains unchanged.

“Life doesn’t always give us what we deserve, but rather, what we demand. And so you must continue to push harder than any other person in the room.” — Wadi Ben-Hirki

Of course, we are open to feedback and to open-source our approach. We would be happy to share our work with any organization interested. And if you’re looking for a job, we’re hiring. Join Dailymotion and change the world through video!

Dailymotion Careers - Join our team

How Dailymotion committed to closing the gender pay gap in two years was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How Deep Learning can boost Contextual Advertising Capabilities

Brice De La Briere — Thu, 28 Jan 2021 12:57:30 GMT

Dailymotion’s advertising solution uses video frame signals and computer vision techniques to target categories from the IAB taxonomy while respecting user privacy

In a world where the end of web cookies is fast approaching — bringing uncertainty for advertisers and marketers — and where technical and legal constraints are constantly increasing, the need for user privacy and personalized ads has never been so important. One of the biggest challenges for contextualization is the categorization of content at scale. Here is how we enhanced our in-house contextual advertising solution at Dailymotion, using state-of-the-art computer vision techniques for video categorization.

Well-categorized content provides a superior user experience through better recommendations and contextual ads, using video categorization to place ads adjacent to relevant content. Additionally, it provides more capabilities and better performance to our advertisers. Hence, the idea to classify our videos was born.

Non-contextual video Ad example

Currently, our Partners can select an “upload category” while uploading their content to Dailymotion. Since it is very high-level information (for instance: news, sports, lifestyle, etc.), we wanted to go a step further to obtain a more granular definition of our content. For example, for the upload category “sports” we would like to say whether it is soccer, basketball, or rugby.

Video Classification Problem

From a machine learning standpoint, the problem that we are trying to solve is called a multi-label classification problem, which is a generalization of multiclass classification. In the multi-label problem, there are no restrictions on how many of the classes the instance can be assigned to.

The IAB content taxonomy

Now that we have the idea to classify our content we need to define which categories to use and what granularity level we want. We opted to use the IAB content taxonomy as it has become the new standard in the ad industry. This taxonomy is supposed to describe every possible content and it enables us to describe each video of our catalog more accurately and consistently.

We selected a subset of 196 IAB categories that suited our catalog variety. This allowed us to slightly simplify the classification task.

Some of the selected IAB categories

High-level I/O of the classifier

The Dataset

Since we wanted to tackle this problem with a supervised training model using the IAB categories as labels, we need a properly labeled dataset. There are many different signals we can use to define the examples in our dataset:

video metadata: video owner, upload category, language, the upload date, etc.
textual signals: video title and video description
visual signals: video frames
audio signals: audio track

Through the use of textual signals, we have already developed several solutions that tackle the content classification problem and currently have two textual models in production.

One dedicated to French and English text:

Bag-of-words representation for video channels’ semantic structuring

And one dedicated to multiple other languages:

How we used Cross-Lingual Transfer Learning to categorize our content

Although these textual models work well and are already in production at Dailymotion. But sometimes the title and description do not allow us to predict an IAB category either because they are very short, too vague, or in a language that we do not handle yet. That’s why we thought having another model based only on visual signals would allow us to improve our predictions in some scenarios:

It would improve our overall coverage, predicting categories for the videos in a language that we do not handle yet
As the problem is multi-label, it would increase the number of correct IAB categories per video, for the videos already classified by the textual models

An example of complementary predictions

In this example, a first model based on textual signal only has predicted “Interior Decorating” and from the video frames, our visual model has predicted “Food Movements”. Both predictions seem to be accurate given their respective signal. This is the complementarity we hope to get for the already categorized videos.

Getting the IAB labels to train our model

To train our classifier we will use the predictions given by the textual models as labels for the visual model. Here we introduce some uncertainty by using labels that come from the predictions of another model but we are confident in these predictions and we carefully selected a scope where they perform well to constitute our dataset. That is why we think that on average the textual model predictions would be good ground truth for the visual model.

We don’t want the models (textual and visual) to be identical, ie. to predict the same categories for the same video. Since they use a very different signal as input we hope to get a good complementarity both in terms of coverage and number of categories per video between them.

Visual Scope

One particularity of the visual model is the fact that we only use the frames to classify a video and it implies two strong hypotheses:

It is better if the video is short (between 1 & 10 minutes). Usually, it makes less sense to try to predict a category like “Automotive”, “Cooking” or “Pets” on a one-hour long documentary, for instance.
The categories that we try to predict must be visual. It is very difficult to predict “News”, “Politics”, or “Jazz” just by looking at the frames of a video, even for a human.

This is why we had to reduce the number of categories the model can predict to the visual ones only. Here is an example of what could be non-visual and visual according to us:

IAB category bucketing

Video Classification State of the Art

Now that we have defined our problem, we know that it corresponds to a video classification problem where we want to train our model end-to-end with our own data and labels. Therefore, we looked at the video classification literature and in particular two workshops from CVPR 17' [1] and ECCV 18' [2].

The NeXtVLAD [5] is one of the best performing non-ensemble architectures for the video understanding competition (YT8M). We were particularly interested in non-ensemble solutions for simplicity's sake and running costs. The NeXtVLAD is an improvement of the NetVLAD network [3][4] that proved to be effective for spatial and temporal aggregation of visual and audio features [6].

It uses video frame features (image descriptors) from another model inceptionv3 [8] learned on the Imagenet dataset as inputs. Then these described frames are temporally aggregated at the video level by the NeXtVLAD layer. The video aggregation is then enhanced by a SE Context Gating module, aiming to model the interdependency among labels [6][7]. This video aggregation is finally used to predict some video categories.

NeXtVLAD architecture schema from the research paper

End-to-end Training & In-house Tuning

The NeXtVLAD has performed very well on video classification challenge yt8m, but our goal is slightly different and we needed to retrain the network from end-to-end for three main reasons:

We have defined our own labels, a subset of visual IAB categories that suits the advertising needs. Initially, we tried to map the yt8m challenge labels to our own IAB categories to avoid retraining end-to-end but the results we obtained were not satisfying enough. This was most likely because the mapping that we created ourselves was not perfect and introduced errors, and also because our video distribution might be different from the one in the training dataset from the yt8m challenge.
We want the model to be fully trained on the distribution of our own labels to get the best performance on our videos
We made some light modifications to the original network architecture

To fit our needs and get the best possible performance for our task, we modified the original architecture with two main changes:

Addition of in-house metadata in the model architecture to improve the classification. Our metadata is encoded and concatenated to the video level aggregated vector (NeXtVLAD layer output)
For engineering simplicity's sake, we ignored the audio input feature at least initially

Here is the modified architecture we currently use:

Our slightly tuned NeXtVLAD architecture

The only part of the pipeline that we do not re-train and use as it is, is the InceptionV3 [8] network used to describe the frames and given as input to the NeXtVLAD.

Performance Measurement & Results

Performance metrics

The way we evaluate our model must be aligned with our product needs and we need to make sure we optimize for the right metric. In our case, the product need is defined as follows: for any given category we want to maximize the number of videos correctly labeled with it. This would be useful for both recommendation and advertising purposes but to do so, we need to split it into two objectives:

Coverage: we want to have as many videos labeled with a relevant IAB category as possible.
The number of IAB categories per video: we want to have as many relevant IAB categories per video as possible.

To measure our performance on the test set, we will plot the precision-coverage curves and in particular, we will look at the coverage value we obtain for the given precision of 80%.

Training Loss

The loss we use to train the network must be tractable and differentiable, we used the sigmoid cross-entropy: which measures the probability error in discrete classification tasks in which each class is independent and not mutually exclusive.

Cross-entropy formula

Results

Below is an evaluation example with two different trainings runs, one with the base NeXtVLAD architecture (dashed) and the other with some architecture improvement like adding in-house metadata (plain).

Let’s have a look at the precision-coverage curves:

For our product requirement of 80% precision, we obtain the following results:

Not retrained model, mapping initial yt8m labels to ours: we had around 30% coverage, giving an f1 score of ~0.44
Base architecture: we have obtained around 46% coverage, f1 score~0.58
Improved architecture: we have obtained around 64% coverage, f1 score~0.71

By adding our in-house metadata within the network and retraining it end-to-end, we managed to significantly increase the coverage.

Production Pipeline Overview

To put such a model in production is a challenge since many different parts need to work at scale: video download, frame extraction, preprocessing with InceptionV3, and finally the NeXtVLAD network inference. We need to put all of these parts together in a pipeline to answer our two use cases:

Run our model on every new video uploaded. An important scaling is required.
Backfill our video catalog, ie. run the model for all our videos uploaded in the past. Huge scaling is needed here since there are dozens of millions of videos available in our catalog.

To fit these two use cases with the scale they require we designed the following pipeline:

Schema of our pipeline

This pipeline uses the framework Klio developed by Spotify, initially designed for large-scale audio pipelines but it also suits our video pipeline needs.

Merging Visual & Textual results

We have two kinds of models that classify our content based on different inputs: textual and visual. We can think about how to use and merge the different predictions we get when a video is scored by two different models. Since one of our product needs is to increase the number of IAB categories per video we decided to begin with a simple union of the predictions.

With this simple approach, for the video already classified by a textual model, we managed to increase our number of IAB categories by 44% on French and English videos and by 50% for our multilingual model.

Future Work & Improvements

Enhancing our targeting possibilities with computer vision has been very challenging both in terms of machine learning and data engineering and we are happy we have managed to improve the base performance we initially achieved. Still, we have many ideas to continue to improve our pipeline:

As mentioned earlier, we initially excluded the audio input, and obviously re-adding it would be a good improvement possibility.
The union of the different predictions is a simple but naive approach that we can improve.
We also think that with better visual descriptions of the frames we would attain better performance. We currently use the output from InceptionV3 but some more recent architectures have better performance on ImageNet. Also, we have seen at Neurips this year that robustness leads to improved feature representation [9], so using a robustly trained ImageNet model could be an idea as well.

Video categorization has already allowed us to improve Dailymotion’s user experience and contextual advertising capabilities while respecting user privacy. Advanced Machine Learning models can now interpret what a video is about, the feeling it’s evoking, and at which exact video frame a specific product category is shown, opening infinite new opportunities for hyper contextual targeting.

The quest to find more sustainable methods and “healthy data” is only just beginning, so stay tuned to hear more about our technical solutions to leverage video signals.

Dailymotion Advertising - the home for videos that matter

References

[1] CVPR’17 Workshop on YouTube-8M Large-Scale Video Understanding

[2] The 2nd Workshop on YouTube-8M Large-Scale Video Understanding

[3] Aggregating local image descriptors into compact codes

[4] NetVLAD: CNN architecture for weakly supervised place recognition

[5] NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification

[6] Learnable pooling with Context Gating for video classification

[7] Squeeze-and-Excitation Networks

[8] Rethinking the Inception Architecture for Computer Vision

[9] Do Adversarially Robust ImageNet Models Transfer Better?

How Deep Learning can boost Contextual Advertising Capabilities was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How we used Cross-Lingual Transfer Learning to categorize our content

Samuel Leonardo Gracio — Tue, 20 Oct 2020 08:43:03 GMT

How Dailymotion transfers the knowledge from an English/French textual model to new languages

Dailymotion is a video platform hosting hundreds of millions of videos in more than 20 languages which are watched every day by millions of users. One of our main priorities is to provide the most suitable content to our users. This can be done only through a precise categorization of our videos no matter the language.

A year ago, Dailymotion presented how to predict the main categories of a video based on its textual metadata with sparse inputs in Tensorflow Keras. The results provided by using our Granular Topics generator for English and French videos encouraged us to investigate how to expand such results to other languages. How did we manage to transfer the results of our previous model to a larger set of languages?

A need for multilingual categories

One of the main issues a video hosting platform faces is to be able to automatically categorize a video, or improve the recommender system, for instance. At Dailymotion, we have already built a strong Deep Learning pipeline that automatically predicts the category of an uploaded video from its textual metadata (title, tags, description…) using a Bag-of-Words classification model.

Bag-of-words representation for video channels’ semantic structuring

Nevertheless, this solution has a major limitation: it only works for English and French videos. In fact, generalizing this method to other languages would be sub-optimal and would require a lot of hand-labeled data for every single language.

Dailymotion video catalog distribution per languages

Moreover, other languages represent more than a third of our video catalog. As an international company, we needed to tackle this issue.

A Transfer Learning problem

There are several ways to tackle this issue with the appearance of strong multilingual NLP models such as m-BERT or XLM, but we chose to approach this as a Transfer Learning problem. In order to use the work previously done, we wanted to build a model that trains on English and French textual metadata but predicts on other languages.

“How is it possible to learn with English and French textual data and predict in other languages ?”

The objective of this article is to share how we were able to test and use new state-of-the-art multilingual NLP models in order to complete this cross-lingual classification task.

“To Translate or not to Translate, that is the question…”

One can wonder : Can you just simply translate all your textual metadata in English ? The answer is yes… and no. We can do this using the Google Translate API, for instance but this is a very trivial solution that would cost us a lot of money. In fact, we found a happy medium: a multilingual pre-trained model that can handle textual information in several languages.

What is a Multilingual pre-trained model?

The recent improvements in NLP research seem to converge to a specific type of model: heavy models trained on hundreds of millions of Wikipedia pages. These new models, as impressive as they are, can’t be retrained easily: during their trainings, they require a lot of computing power, often including dozens of Tesla V100 GPU.

Usually, these models have been trained once for a very general purpose. They are then available with pre-trained weights. Below is a short list of the multilingual pre-trained models that we have used during our research:

m-BERT: a multilingual version of the now most famous NLP model, BERT. Trained on 104 languages, using shared Transformers.
XLM: based on BERT architecture, XLM introduces a different training approach which is supposed to be more efficient for cross-lingual tasks.
MUSE (Multilingual Universal Sentence Encoder): very different from the others, introduces a model focused on aligning embeddings.

Note: All these models are available in several versions (large, small, etc.). This article will present their application on a more concrete data science problem.

Complexity of our task

We want to train with English and French textual metadata and predict categories on other languages. This is called “Zero-Shot cross-lingual classification problem”. If all of the models presented above are all supposed to be multilingual, i.e, built in order to beat theoretical benchmarks on cross-lingual tasks such as XNLI, they are very often evaluated on simpler tasks.

Note: XNLI is an evaluation corpus for language transfer and cross-lingual sentence classification in 15 languages.

For a real-world data problem, you wouldn’t test these models in the same conditions as in a research paper. Below are examples of some issues that may occur:

Complex data: since we are using the textual descriptions of our videos, the data is different from a Wikipedia page, for instance. In fact, these descriptions sometimes go straight to the point and contain abbreviations, or even spelling mistakes.
Content distribution: in our case, we know that our video catalog can be different for each language or country. For instance, we know that our French video catalog contains more videos categorized as “Soccer” than Korea’s, due to the difference in culture between these two countries.

This is the reason why we needed to test and build our own implementation of these multilingual NLP models as we cannot blindly follow theoretical benchmarks.

What is a good cross-lingual model ?

Now that we have presented three multilingual models, we can explore how we decided between them.

The importance of aligned embeddings

All the state-of-the-art models presented in this article share a common structure: they take a sentence in input, convert its words into a sequence of tokens and then output an embedding. The main objective for these three multilingual models is to create a common vector space for all the languages. Nevertheless, the main difference between them is precisely this common vector space, i.e how they were trained and how their embeddings are built.

Although XLM and m-BERT do not share the exact same structure, they do share a common purpose: being able to perform on several cross-lingual tasks. For that reason, their training tasks are more general and the resulting embeddings are more complex, not simply aligned. On the other side, MUSE pre-training was focused on one goal: to create the most aligned embeddings possible, no matter the language.

“But… What do you mean by aligned embeddings ?”

Example of MUSE textual similarity : the darker the box, the more similar the sentences.

MUSE was trained simultaneously on 16 languages with a shared encoder working with translation tasks. The result of that is a very good capability of mapping two similar sentences in two different languages in the same vector space. For instance, if MUSE encodes two sentences about Cristiano Ronaldo, one in English and the other in French, the resulting embeddings will be very similar. MUSE is not language specific. On the other hand, m-BERT and XLM embeddings are more general: they are not necessarily aligned per language but encode more information than that. Unfortunately, complexity can sometimes be a burden.

In theory, the MUSE shared representation between languages is better for a Zero-Shot Classification Task : if we use the embedding at the output of this model in input of our classifier, we will manage to get similar inputs for similar texts, no matter the language, without using any translation.

From embeddings to cross-lingual classifier

In order to really compare these pre-trained models, we present the following structure for our cross-lingual classifier:

Input: Embedding. Obtained from the pre-trained multilingual models, by passing them the textual metadata of the video in input.
Classifier core: Two Dense layers, followed by a Batch Norm layer and dropout.
Output: Sigmoid, resulting into a vector with a confidence for each possible class. Thus, a video can have multiple class that characterize its content.

Structure our final model

After testing all of the different multilingual pre-trained models, including both their small and large versions and days spent on hyper-parameters tuning, we found a winner.

The success of simplicity

Despite the huge amount of research articles about the surprising multilingual aspect of m-BERT or the incredible performances of XLM, our own conclusions are mixed.

BERT is dead, long live MUSE

We tried to test our architecture with different versions of both m-BERT and XLM, but neither of them gave us satisfying results. In fact, our task may be too different from the usual benchmarks. For a cross-lingual task like ours, bordering on a problem where large-scale translation would be required, it seems that MUSE in its lighter version, is far better than these very heavy pre-trained models based on the below Transformer architecture :

Computing time: the lightest version of MUSE, based on CNNs, is able to compute an embedding three times faster than BERT or XLM, going from 8ms to 24ms on average for the models built with Transformers.
Aligned Embeddings: as explained, we believe that for this task, the most important feature for a cross-lingual model is to provide aligned embeddings. MUSE is far better than the two others for that purpose.
Global performances: we used a Top-1 accuracy in order to measure the performances of each model. We saw significant differences between these models in terms of metrics.

Example of categories predicted on a German video using our model

Overall, despite the actual popularity of Transformers-based NLP models, we believe that simplicity is key. This real-world project required a cross-lingual Transfer Learning model and MUSE, with its aligned embeddings per language seems to be the best solution.

What’s Next ?

As already done for the English and French videos, this multilingual model is now in production. Each time a video gets uploaded on the platform, it is automatically tagged with a category.
We will continue to improve the categorization of our video catalog by using other signals. For instance, we are currently working on a computer-vision based model that tags videos using their frames.

How we used Cross-Lingual Transfer Learning to categorize our content was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

Dailymotion’s Product design operations for facilitating shorter time to market

Binard Guillaume — Thu, 24 Sep 2020 08:00:05 GMT

Dailymotion’s journey to adapt our design methodologies to shorter, time-boxed missions

Soon after deploying our own project management’s model in September 2019, we have been developing specific design missions with the purpose of supporting this new model in its success. Whether you’re product design team leader or another key stakeholder, anyone interested in shipping products earlier and its processes will find this article useful.

In September 2019, with the goal of reducing time to market, Dailymotion implemented a new project organization model. To do so “We created two types of mission: the Discovery Mission and the Delivery Mission. The first one lasts for two weeks, the aim being reduce uncertainty and define the work to be potentially done on a Delivery Mission.” By all accounts, this new organization has been successful: more team accountability and more missions hitting the market earlier.

How Dailymotion hacked its feature team project model to shorten time to market

But soon after implementing the new model, the product design team noticed challenges that made it clear that some aspects of the model needed greater definition. After all, the work of design varies from one project mission to another: “Discovery mission” being the most design-heavy mission, this label lacked the detail that designers needed in order to set product managers’ expectations. Not only that, but this lack of specificity was a huge missed opportunity to identify projects’ maturity, articulate each mission’s scope, facilitate design operations and awareness within the project team. To ensure that the new project model was a success both in terms of product management and product design, the design team decided to triage Discovery missions into three types of missions: Explo, Recon, and Build.

The Discovery mission is most design heavy mission.

The need to change our design operations

The purpose of the two-week Discovery mission is to reduce uncertainty before the engineering team starts building new features. It is during this short initial mission that the bulk of the design work happens; anything from user research and competitive analysis, to designing and polishing complete interactions can take place at this stage. As a sprint is obviously not enough to provide a meaningful and exhaustive design from scratch, we needed to organize our contribution to the Discovery mission in a substantial and sustainable way. To make these two-week projects as productive as possible, we established the following goals:

Identify granularly each project’s maturity level

How to objectively qualify the maturity of a project? What is our real knowledge level versus our desire to ship early? How to identify how many discovery missions would be necessary to tackle the project?

Accurately estimate each mission’s scope

Shipping earlier doesn’t mean shipping faster — it means shipping less. How to help the scope being more adapted to the team and timeframe capacity?

Facilitate design operations and its awareness through teams

Helping designers be more efficient in their operations and raising the project team awareness is key to performing well: raising quality bar while following the pace.

Co-creating Discovery design missions’ operations

Mapping design activities for each discovery missions

To facilitate cooperation and buy-in from product managers, developers, designers, and other contributors, we scheduled a series of workshops in our Paris and New York offices. The goal of the workshops was to introduce a logic to triage the main different types of Discovery missions, set expectations for each of them, and map the kinds of activities designers undertake during each type of mission. This helped the team truly weigh design operations in such scenarios.

Adding granularity to the Discovery mission

Being involved in a Discovery mission is somehow similar to being involved in a commando operation: short time frame, and very localized action. As a Discovery mission is too general and too large to be specific in a short sprint cadence, we decided to break it down in three Discovery mission types, each referring to a crucial step in the design workflow: EXPLO, RECON and BUILD. These refers to commando types of missions.

To make simple analogies, an EXPLO mission is about investigating, collecting information about why should we, for example, cross this river? RECON mission is about experimenting how to cross this river (bridge, boat, swim, submarine, plane, catapult, rope…). BUILD mission is about defining the specific solution we need to develop (What specific type of bridge?)

Those three words helps the team instantly visualize the project’s status and synchronize expected operations. This framework of three distinct missions is the very core of any design activities. The commando image is used for its inspiration clarity, and really fits our project organization at Dailymotion.

Map activities with our internal collaborators

Having defined those three Discovery mission types, we held workshops for the design & product management teams in Paris and New York. “what we did was gave everyone a marker and asked them to write examples of inputs and outputs on a whiteboard for each mission”. Inputs refers to the knowledge previously acquired about the project, and outputs focuses on what the team agrees to deliver at the end of the mission. This make it easier to responsibly and accurately estimate the amount of work that can be done in two weeks. This exercise helped align expectations of designers and product managers and also raised the project team awareness about how design operations unfold.

In addition, it was a great opportunity to address participants’ doubts about this process. For example: we don’t necessarily need to always run the three missions in a row. We can skip one mission and move to the next if the prior knowledge is objectively sufficient. As well, we can iterate each type of mission if the workload is too consequent to be achieved in one mission.

By synthesizing the various workshops outcomes, we defined the key activities for each of the three types of design missions.

The three Discovery missions cards.

Setup missions’ facilitation tools

Having organized design activities through the three Discovery missions’, it was the time to build tools that will help teams properly run those missions. We added the following tools to our mission Wiki, the company’s centralized knowledge platform:

Mission cards: These cards explain the nature and goals of each mission as well as the conditions that must be met in order to launch each type of mission.

Missions checklist cards: Every mission comes with its own checklist. Input checklist help teams validate the acquired knowledge and confirm the right Discovery mission to be launched. Output checklist helps the team finetune the goal and agree on the deliverables to produce.

Pre-scheduled calendar: Once the team has checked the design activities to be done, they can arrange them on the sprint calendar, which help monitor and manage the operations throughout the mission.

Artifact links: As we use a lot of different tools to conduct our studies, making them accessible and easy to find is crucial. It also makes it easier for new team members joining a mission to get up to speed quickly.

The tool box help the team run better design operations

Design operations supporting the new organization

Thanks to the collaboration and support from each team at Dailymotion, the first results after the introduction of the three Discovery design missions were very encouraging. EXPLO, RECON and BUILD words were joining our teams’ vocabulary and it was easier for everyone to visualize what would happen in those missions while using these terminologies. Forms and questionnaires sent to teams helped us measure the benefits of the three Discovery design missions. The people who answered the questionnaire noticed the following benefits and improvements:

More focused Missions

Selecting the right mission phase for the right purpose, setting clear goals and select the most efficient outcomes helped enhance our team efficiency. For this improvement alone, this Design process was worth it.

Key design activities

Checklists helped designers have examples of meaningful activities listed per missions which helped them focus on the project itself. As an example, we partnered with our User Researcher to produce cards that helped designers run better research activities in a two weeks’ timeframe.

Increase awareness of the design process

As the work of design is often unclear to colleagues on other teams, having clear Discovery design mission with a clear deliverables’ checklist helped raise awareness of design activities for everyone in the project team.

Greater oversight

After a few months of practice, we were able to see which missions were the most frequent. While BUILD and RECON were used equally, we saw that there were significantly fewer EXPLO missions. This helped develop awareness of our upcoming challenges, like participating more in the projects’ genesis. If you want to know which design team you are, categorizing your missions done across a semester might be a good way to know it.

As a result, we are more efficient, focus, and we are delivering better quality work.

The three discovery missions helping oversee team’s global activity

We’ve already seen impressive results on our own team, but if applied across the company, we could achieve even more fluid communication, efficient pace.

Ship faster, alone, might easily bring chaos in teams expectations, operations and puts the product quality in danger as a result. But Ship earlier, regularly, implies a sense of measure, reason and order. And this was our intention by bringing the three Discovery design missions. We hope it’ll continue to support positively our new project’s organization structure at Dailymotion and that it will benefit to other organizations facing design operations challenges.

Dailymotion’s Product design operations for facilitating shorter time to market was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How to automate users management in Wireguard

Ben — Thu, 02 Jul 2020 08:50:12 GMT

Introduction to Asteroid, Dailymotion’s open-source application

Our philosophy at Dailymotion is that if nothing fits our technical needs, we’ll create it. That’s why we developed Asteroid, our home-made open-source application to easily manage our Wireguard server. The app is written in Go and has greatly improved our efficiency when adding and removing access to our infrastructure.

When I joined the Dailymotion Ad-tech team, a VPN was almost always required to access infrastructure resources, and for this we were using OpenVPN. Out of curiousity, we decided to trial Wireguard. We quickly saw many benefits, including but not limited to: reduced latency, improved performances and an easier and faster setup.

After this initial testing phase, we started adding more and more people to the system. Unhappy with the need to manually add and remove users, we searched far and wide but were unable to find a tool to automate this aspect of the Wireguard server management.

Asteroid, our app created with Go

We wanted an application that allows us to easily add, remove and view peers on our Wireguard server. We chose the Go programming language as it has a small footprint and is easy to deploy as a single binary.

While implementing the ssh connection with Wireguard, we faced some issues with shell escape sequences. They look something like:

\e[0; 33m

They are used for coloring the output on our remote systems. They’re easy to overlook because your local shell might also hide them.

Adding a new peer or user with Asteroid

Here’s how Asteroid works; to add a new peer or user, we just run these commands:

$ asteroid add -address=”172.16.0.7/32" -key 
“eXaMPL3Ave8q+kmNVmiw4KdKiXc//M0EGOY6K9C14nw

Removing a peer or user with Asteroid

Removing a peer or user is also extremely simple:

$ asteroid delete -key “eXaMPL3Ave8q+kmNVmiw4KdKiXc//M0EGOY6K9C14nw

Viewing peers or users added to the server with Asteroid

To view peers or users added to the server, we use the view command:

$ asteroid view

The help command

The “help” command is very useful to check what each command does or which arguments to give:

$ asteroid -h

Why we chose to go for open source

Wireguard was built as an open-source component to improve upon the OpenVPN status-quo. We’re happy to have switched to this new alternative and open-sourcing our Asteroid tool is a way of giving back to the open-source community. In the coming weeks, we’re thinking of adding a way to batch adding and removing users.

If you want to try it out and contribute, please visit: https://github.com/dailymotion/asteroid

How to automate users management in Wireguard was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How to Activate a Global Audience in Less than a Week

Wade Slitkin — Wed, 24 Jun 2020 14:06:26 GMT

How to activate a global audience in less than a week

The foundation of Amplification at Dailymotion

Dailymotion was tasked with activating its global audience in support of an unprecedented live digital event, for a brand-new Partner, in just a few short days. We implemented an amplification strategy that leveraged four key elements: cadence, targeting, creative workflow, and asset management to help the Partner achieve their goals.

Dailymotion joined the world’s biggest digital platforms to stream “One World: Together At Home”, a live broadcast — in support of the healthcare workers on the frontlines battling the COVID-19 crisis. As a truly global company, it was an honor to activate our international technology and communications (and music lovers!) teams to support such a historic live event.

How to move 300 people to full remote in 24 hours

Monday — The Kickoff

It was a late Monday afternoon when we received word that we would be streaming “Together At Home” the charity event put together by Global Citizen (our Partner), whose goal was to raise money for the World Health Organization and frontline healthcare workers. From a technology perspective, Dailymotion has been in the online video business for 15 years and has delivered billions of video views; all we needed was access to the live stream and we’d be up and running. No concern there. However, we also needed to activate our worldwide audience to tune in and take action in less than a week. This warranted an eyebrow raise, at least.

To give you a sense of scale, we have content teams across the world who help localize (35 international destination sites, to an audience in more than 50 countries) communications, multi-language social handles, a handful of hyper-targeted newsletters, blogs, press avenues, internal communications, push notifications, in-app notifications and an editorial team who curates and shares relevant content alongside our recommendation algorithm. This huge mechanism provides Dailymotion the unique ability to target, amplify, and subsequently drive specific traffic onsite. However, without a succinct plan, these benefits can turn to detriments in no-time.

To keep such a nuanced machine running and on-task requires is a sound communication plan. This rests on the pillars of cadence, targeting, creative workflow, and asset management. Combined, they inform a consistent and clear channel strategy that engages an audience from awareness through retention.

How Dailymotion hacked its feature team project model to shorten time to market

Tuesday — The Plan

Tuesday morning post an internal ‘kick-off,’ we got to work on how the cadence of our communications would help the Partner achieve their goals (viewership and tune-in). Cadence provides guide rails for everything from engagement to the amount and type of creative needed.

The first step was to identify the messaging hierarchy. This was a charitable streaming event, so communications had to encourage users to take action with Global Citizen (primary message) and inform them of the live event (tertiary message).

Secondarily, we had to determine when these messages would be pushed live. It was important to avoid frantic spamming due to the tight timeline (t-minus 5 days) so we set up three messaging windows: pre-event (“take action”), live-event (“tune-in”), post-event (“thank you and results”) to control output. The pre-event phase focused on informing users about #Togetherathome by driving them to the event website. The live-event phase drove users to Dailymotion.com where they could watch. The post-event thanked those who tuned-in and shared the results (more than $127M raised!).

Wednesday — The Target

The cadence strategy provided the “when”, so during Wednesday’s stand-up, we were able to address the next piece of the puzzle, the “who”. Dailymotion’s Partners include publishers who leverage the player (streaming technology) as well as viewers (who frequent the destination site, dailymotion.com). We then mapped those personas to datapoints showing previous engagement wins per channel. Meaning that we knew which channels gave us the best chance to activate the desired personas.

With “when” and “who” locked in, it was time to outline the “how.” A detailed creative workflow document is a marketing manager’s best friend. If constructed concisely and carefully, it saves the team from multiple email threads, slacks, and the need for additional meetings. It becomes the source of truth for any inflight campaign. The document should consist of:

Approval flows (internal and external) which identify stakeholders and current statuses.
A creative checklist that has primary and alternative messaging with corresponding assets.
Timing windows inform channel managers when to activate.
Target profiles remind channel mangers of their campaign audience for further targeting.
Support contacts offer backup personnel for pivots or emergencies.

There is a myriad of ways that companies streamline this process so being flexible and taking into account your own resource constraints can affect how you track and execute this step. The one constant to remember is that this is a live document and that it is subject to change from every stakeholder (Partner, channel owners, leadership, etc). So, encourage channel owners to always be checking the document like the rearview mirror in a car before they schedule or post.

Thursday — The Tools

Without warning, it was Thursday. Meaning we were less than 24 hours away from launch. With cadence established, targets identified, and creative workflows outlined the teams began to schedule and prep their channels for “go.” The final step was to collate and organize approved assets into a single repository. Saying asset management out loud seems like a no-brainer but more times than not, this step is often overlooked. Teams assume that such an obvious thing is being taken care of…by someone.

If files and folders are not clearly marked or if there are asset changes or replacements coming down the pike, you’ll have teams borrowing or shoe-horning other creative not optimized for their channels. The wrong asset can make or break a post. Take your time, have consistent naming conventions, label, file, and version meticulously.

Friday — The Execution

Friday meant “go” and all inbound and outbound channels began to go live. Communication lines were open through the weekend to ensure that each channel was carefully following the schedule and creative guidelines. Any changes were funneled through team comms so edits were captured and implemented in realtime.

This coordinated effort resulted in more than a million views and hundreds of thousands of concurrent streams. Dailymotion’s unique position as a technology and video destination allows its passionate storytellers to work hand in hand with its Partners. It’s Dailymotion’s pleasure to raise awareness and extend the reach of our Partners worldwide for the videos that matter.

How Dailymotion and CANAL+ managed to host a PR screening… with cinemas closed

How to Activate a Global Audience in Less than a Week was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

Starting a new job in the midst of a Pandemic

Cristina Calle Jordá — Thu, 30 Apr 2020 09:33:51 GMT

What does remote-joining look like for a new employee at Dailymotion in times of Coronavirus

Starting a new position should always be a thrilling experience. After a testing recruitment process, you proved your value and the long-awaited day is finally here. I was excited for the change and the new, out of the ordinary challenges that were waiting for me when joining Dailymotion. Little did I know that there was a Pandemic looming just around the corner. When I said out of the ordinary, I did not think it would mean being part of Dailymotion from the comfort of my home, but hey, your career should never stop surprising you. This is how my adventure as a Dailymotion Talent Acquisition Specialist began.

First day

Thinking about any first day, that newbie feeling can bring up a cocktail of emotions. Some excitement, some confusion, but how to manage all that while being on full remote? Well, it’s not all that different from sitting down at your office desk. From the first day, I could feel that teams took on a “champion each other” mentality and made sure that I was encouraged as well. In many ways, I believe have been additionally looked after since everyone understood that starting under these special circumstances it would be harder to feel comfortable and part of the team. Sensing that the situation was strange, I tried to avoid feeling lost. My focus during those first days was seeking to create a sense of team and community. Therefore my activities focused mainly on accepting invitations to meetings, outside or inside my department, team-lunches, after-work drinks, themed webinars, and “blind-test” games. Anything that could get me close to having an established communication and introduction to the world of Dailymotion.

Are you ready to take action?

Not your typical first impression

Part of the excitement in starting a new role is that you get to meet inspiring new people, different ways of working, and from my side, a whole new industry. A new job means new challenges and new experiences. Imagine having to develop a strong relationship with your manager, even though you have only met them a couple of times. At Dailymotion, I could sense that my feelings were acknowledged. It was alright to feel lost and frustrated. Trust was and is still being created through communication and reassurance that if I had made it this far, I could only do better. For my other newbie colleagues and I, we are demonstrating that we can get out of our comfort zone with this one. We get to have a truly uncommon onboarding experience, we get to meet, live, and see Dailymotion even before setting one foot in the office.

Business and fun as usual

At Dailymotion, we are indeed fighting the COVID-19 backlash, but with a fighting response of our own. Lots of positivity, good spirits, and support. This became clear to me from day one, as every single hour of my first week was programmed, dozens of meetings planned, including every kind of office tradition that could be put on a Zoom call. Our Principles of Motion got put to the test and resulted in a creativity that was remarkable to witness as we were all making the best out of an unprecedented situation. The Dailymotion community seems to have come together in a very present way over Slack. Everyone understands that communication is key now more than ever, but not only do we do it for our work, I believe that we also do it to champion each other. And we are very creative at it too — you’ll find memes, GIFs and inside jokes on every channel, while our statuses get increasingly funnier with people finding new ways to express their moods and activities through emojis.

Making it work

To be clear, we are not all thrilled about the idea of working from home. I’ve only visited the offices a couple of times, and just like with other experiences seeing it live must be much more exciting. However, our mission is a comprehensive one, we are going through this with high spirits and adaptability. From my experience, it is all about taking it one day at a time while having a clear long-term picture. My team meets every day, we do regular syncs, everyone is up for even a virtual lunch or coffee break together. And as we are dealing with the now, we are also preparing for what is yet to come, we are talking about BIG projects, those exciting groundbreaking kinds of projects. So maybe this was the perfect opportunity to step back, avoid the office distractions and do some of our best work.

Doing the best that you can

Being part of a new team during quarantine can seem a little odd at times. Even if I am the one being welcomed as the newest member of the group, it seems that we are all meeting new versions of our colleagues. As everyone does their best to create the most comfortable and professional working environment at home, we can all confirm to have experienced some peculiar situations.

Come and see my house, everyone already has

A side effect of being quarantined while working is that everybody gets to see you in a new light. When starting this adventure at Dailymotion one of my worries was also trying to make a good first impression. That can become a little difficult when a cat keeps jumping in front of the camera… At Dailymotion we have officially seen it all at this point. Kids, the passing-by partners, pets, and of course a great competition for the best virtual Zoom background. Being exposed from the first day appeared as a little uncomfortable to me. That social and professional barrier was suddenly lifted. But it is all part of the process and it adds a touch of humanity. We are first and foremost humans and yes, employees have a life outside of work.

When creating an onboarding plan for new joiners, Dailymotion makes a point in trying to generate a guided and impactful experience. Considering the COVID-19 circumstances, it is safe to say that I will certainly remember my first days and weeks at Dailymotion. From a personal perspective having the freedom to feel acknowledged and part of the community, was a necessary factor in fulfilling my job. This is experience has so far served as a test for multiple initiatives, such as making the case for remote work and creating trust in everyone’s potential. I’m looking forward to what is yet to come from the comfort of my dining table, and hopefully soon from the Dailymotion headquarters!

How to move 300 people to full remote in 24 hours

Starting a new job in the midst of a Pandemic was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.

How Dailymotion and CANAL+ managed to host a PR screening… with cinemas closed

Colas Courjal — Fri, 10 Apr 2020 11:56:38 GMT

Virtual VIP preview: available at home for all guests with a password

The launch of a new season from CANAL+’s “Original Creation” catalog is usually an opportunity to organize a VIP event with a screening of the first episode. Guests, including producers, talent, partners, press and advertisers are invited to a VIP venue in Paris… the perfect occasion for everyone to get a sneak preview of the new season and to chat with the production team. But that was before COVID-19.

As a major digital broadcaster, Dailymotion was in a unique position to offer an alternative way to promote this major series. And that’s what we did for the new season of The Bureau (Le Bureau des Légendes), thanks to strong collaboration with the Communication, Public Relations and Digital teams at CANAL+.

The Bureau (Le Bureau des Légendes), Season 5: 10 episodes of 52 minutes,
created by Éric Rochant. Season 5 from Monday, April 6, only on CANAL+

How to be social during social distancing?

The PR screening of the first episode of The Bureau had been planned for a long time. Confinement in France was announced on the evening of March 16, 2020 by the President of the Republic, Emmanuel Macron, thereby leading to the de-facto cancellation of all public or private events, even more so those organized in a closed space. The production team quickly reorganized themselves to finish the final post-production touch-ups, as the broadcasting date of the new season on CANAL+ and myCanal was not to be delayed. The first episode was scheduled for Monday April 6th.

CANAL+ is already managing multiple channels on Dailymotion, where they publish trailers, short extracts, and promotional content around their programs. So, the idea of organizing a dedicated digital PR event on Dailymotion was quite logical.

A new form of digital PR event

The video was uploaded by the CANAL+ Digital Team in private mode and protected with a password. An invitation email was sent to all the guests, providing them with the link to the episode and the password.

Before 8 pm, CANAL+ published the trailer, and at 8 pm they replaced the trailer video with the first episode. Video replacement is a unique feature on Dailymotion, available to verified partners allowing them to change the video source without modifying the video URL. In this case it was really important that the guests who had already received the invitation by mail earlier in the afternoon were able to access the new video.

Short and private, just like IRL

In the same way, a real PR screening is organized, the video was only available for 4 hours. Additionally, as this is sensitive content with regards to piracy, the video was also protected by HLS encryption.

Introductory speech from Gérald-Brice Viret,
General Manager of Canal+ Group Programs

CANAL+ also decided to include speeches from Gérald-Brice Viret
General Manager of Canal+ Group Programs, Alex Berger, President & Executive Producer TOP (The Oligarchs Productions), and Éric Rochant, Showrunner of The Bureau (“Le Bureau des Légendes”). It was an original way to replace the habitual introductory speeches, giving guests important information about context, artistic intentions, broadcasting planning, special thanks and fun facts.

Introductory speech from Éric Rochant,
The Bureau (Le Bureau des Légendes) Showrunner

Thanks to the Advanced Statistics tool available in the Dailymotion Partner Space, we estimated the audience at more than 400 simultaneous viewers (80% on computer, 15% on mobile and 5% on tablet). That’s a huge virtual room! We are proud to have been part of this digital adventure and to have helped media and communication teams ensure the continued promotion of their premium content to confined audiences.

“We are delighted with the incredible success of this virtual projection, which was very appreciated. We are also very proud to have had the opportunity to promote one of our major series, Le Bureau des Légendes, despite the confinement”, Emilie Pietrini, Chief Communication and Brand Officer of CANAL+ Group.

The current crisis is forcing everyone to rethink quickly standard communication patterns. This initiative can also inspire any partners working in PR, communication and marketing.

Special thanks to all the team of The Bureau (Le Bureau des Légendes), Alex Berger, President & Executive producer TOP-The Oligarchs Productions and Eric Rochant, Showrunner of the series.

How Dailymotion and CANAL+ managed to host a PR screening… with cinemas closed was originally published in Dailymotion on Medium, where people are continuing the conversation by highlighting and responding to this story.