The CVAT Blog: Data Annotation Guides, Tutorials, and Best Practices

CVAT began as an open-source program at Intel in 2017 to accelerate the annotation of digital images and videos for training computer vision algorithms. Back then, our mission was to help internal data science teams obtain new annotated data to train deep neural networks. Little did we know that, eight years later, we’d be helping hundreds of thousands of teams build better AI models. Three years ago, we spun out into an independent company to build on our leadership position in visual data annotation for computer vision and machine learning applications. Since then, we’ve achieved some impressive feats as a team and as a business. One of them is our vibrant developer community on GitHub, which has helped shape CVAT’s roadmap and core architecture in powerful ways. To celebrate our three-year anniversary and 14,000 stars on the GitHub repository that started it all, we’re sharing 14+ key (and fun) milestones that brought us here: Milestone #1: January 2017 Intel software engineers Nikita Manovich (Team Lead of Data Infrastructure) and Andrey Zhavoronkov (Software Engineer) begin developing an internal annotation tool by enhancing the VATIC tool. They add image annotation, attribute support, and redesign the client-server architecture. Milestone #2: June 2018 The team decides to go open-source. Since the team also supported OpenCV, a popular computer vision library, they released the first version of CVAT (Computer Vision Annotation Tool), 0.1.0, on GitHub under the same organization. Because of that, the project quickly gained traction. Milestone #3: December 2019 Intel officially announces CVAT as its new open-source initiative to streamline digital image and video annotation for computer vision use cases. Milestone #4: September 2020 The CVAT open-source community grows rapidly, contributing new features, fixes, and integrations. We also launch cvat.org (no longer available) – a free online server for annotating data without installing CVAT locally. Milestone #5: April 2021 The GitHub project becomes one of the most-starred in its category, reaching 5,000 stars by late 2021. CVAT is also included in the GitHub Arctic Code Vault, preserving our code in a long-term Arctic archive of notable open-source projects. Milestone #6: February 2022 CVAT partners with HUMAN Protocol, a decentralized labor marketplace, to allow customers to scale annotation workflows with on-demand, crowdsourced annotators. Milestone #7: July 2022 CVAT officially spins out from Intel and becomes an independent company. Nikita Manovich and Boris Sekachev, who joined Intel as an intern during summer of 2017, become co-founders, ready to take CVAT to the next level. Milestone #8: December 2022 We launch CVAT Online—a cloud-hosted version of our open-source platform, giving teams and individuals access to scalable, collaborative annotation workflows without needing to self-host. Milestone #9: August 2023 CVAT Online reaches 50,000 users. We also sign our first enterprise contract, laying the foundation for CVAT Enterprise—a commercial version of our self-hosted annotation platform with support for automation, team workflows, and enterprise-grade integrations. Milestone #10: September 2023 We reach 10,000 stars on GitHub and secure our first paid labeling contract, launching our Labeling Services business – a natural extension of our mission to provide end-to-end annotation solutions. Milestone #11: February 2024 CVAT joins Google Summer of Code 2024, enabling students and new contributors around the world to work on real-world annotation challenges. Milestone #12: April 2024 We introduce annual plans for CVAT Online, helping our customers save up to 30% on premium features. Transparent, flexible pricing becomes a core part of our user experience, especially for teams with long-term annotation pipelines. Milestone #13: May 2024 CVAT’s labeling services scale to a global workforce of several hundred annotators. Our clients now include Fortune 100 companies across retail, logistics, robotics, and more. We’re also honored to be recognized as a top-choice annotation tool at Embedded Vision Summit 2024. Milestone #14: October 2024 We introduce SAM2-powered video tracking in CVAT Enterprise, enabling video annotation to be completed up to 10x faster than before – a massive leap in productivity for teams working with surveillance, autonomous driving, and motion datasets. Milestone #15: January 2025 AI agents come to CVAT. Customers can now integrate their own detection and segmentation models to automate labeling across CVAT Online and CVAT Enterprise. Milestone #16: June 2025 CVAT Analytics expands: teams can now track annotation efficiency, reviewer throughput, rework rates, and annotator performance trends in real time. This helps customers make data-driven decisions about scaling their workforce and improving dataset quality over time. What’s Next? As more companies embed AI into their operations, from warehouses and self-driving fleets to retail shelves and hospitals, supervised learning will remain the backbone of innovation. And supervised learning requires high-quality, structured annotated data. In our fourth year as an independent company, we’re more focused than ever on giving teams everything they need to produce that data fast, accurately, and at scale. We’re not just building an annotation platform. We’re building the foundation for better AI. Thank you for being part of the journey!

Company News

July 24, 2025

How to Create Data Labeling Specifications for Your Annotation Project: A Client's Guide (+ Free Template)

IntroductionChoosing the right data annotation service is a key step in any AI or machine learning project. High-quality labeling services are essential for training algorithms and ensuring accurate predictions. CVAT (Computer Vision Annotation Tool) and Clarifai are two leading platforms offering various annotation services. These platforms cater to a wide range of users, from individual researchers to large companies.In this comparison, we’ll examine the strengths and weaknesses of both. We will focus on performance, scalability, and ease of use. We will also consider the target audience and suitability for specific industries. This will help you make the best choice for your project.‍Performance and capabilitiesCVAT is an open-source tool designed for teams that need more control and customization over their annotation workflows. It offers the following annotation types.Annotation types2D Image Annotations: Support for detailed annotations like bounding boxes, polylines, points, skeletons and polygons for more intricate data.Video Annotations: Capabilities for object tracking, recognition, and event detection in video-based tasks.3D Sensor Fusion: Provides support for annotations involving 3D sensor data, making it ideal for applications like autonomous driving, robotics, and LiDAR tasks.‍One of CVAT's key strengths is its ability to handle complex annotations, like instance and semantic segmentation with high precision. This makes it ideal for industries like healthcare, automotive, and surveillance, where detailed accuracy is very important.Clarifai is a comprehensive platform that focuses on automating data annotation processes to improve efficiency. Its main features include:‍2D Image Annotations: Efficient handling of large-scale image classification tasks using AI-driven automation, including bounding boxes and polygons.Text Classification: Support for natural language processing (NLP) initiatives, making it suitable for text-based projects.Video Annotations: Offers video object tracking to automate and simplify video analysis.Document Analysis: Named entity recognition (NER) for processing and analyzing large volumes of text efficiently.‍Clarifai is highly adaptable for different annotation tasks due to its AI tools. This makes it a good fit for industries like e-commerce, finance, and media. These industries handle a large amount of data, but the annotations are less complex.‍Ease of UseCVAT provides an easy-to-use platform that doesn't require technical expertise. Users can quickly sign up on the CVAT cloud platform and start labeling process right away. Data scientists and AI researchers value its powerful customization features. However, smaller teams or individuals without much technical knowledge can also use it effortlessly. The platform also supports complex project setups and allows for collaboration among multiple users, making it suitable for team-based projects.‍Clarifai is also designed for ease of use, requiring minimal setup. Its intuitive platform includes many automated features that help reduce manual effort. This makes it a great choice for project managers or companies looking to outsource data labeling without getting into the technical details. Teams can quickly start using the platform, even if they don’t have extensive technical knowledge in data annotation.‍‍Scalability and FlexibilityScalability is crucial for teams and organizations looking to expand their AI projects. CVAT excels in this area, primarily because it is open-source. This allows teams to enhance their annotation operations by improving infrastructure, adding custom plugins, or adjusting workflows to fit specific needs. Such flexibility is particularly beneficial for large organizations and AI research teams. These teams are involved in complex projects that require tailored workflows or intricate annotations. Examples include projects in the autonomous driving or aerospace sectors.‍‍On the other hand, Clarifai offers a simple approach to scalability. With its global workforce and AI automation, it excels in projects that require quick deployment. Companies in sectors like retail, healthcare, and marketing can easily scale their annotation needs. They can do this using Clarifai’s fully managed services. These services help reduce operational burdens. This is particularly advantageous for businesses looking for fast results without the need to establish a dedicated in-house annotation team.‍Industry-Specific SuitabilityClarifai and CVAT are versatile tools that can be applied across various industries, though they approach data annotation differently. Clarifai emphasizes automated data labeling, ideal for large datasets requiring speed and efficiency. Its AI-driven labeling is fast, yet it also supports manual annotation when needed for flexibility. On the other hand, CVAT focuses on manual labeling. This makes it better suited for tasks that demand high accuracy and human oversight. CVAT also offers automated and semi-automated annotation options. This allows CVAT to adapt to projects where repetitive or simpler tasks can be handled by AI. More complex tasks are left for human annotators.‍The decision between manual and automated annotation depends on the complexity of the data and specific project requirements. Automated annotation excels with large, straightforward datasets, whereas manual annotation is essential for more precise and intricate work. Both tools successfully cater to the unique data annotation requirements of various sectors, ensuring high-quality results across industries, including:‍HealthcareAnnotation helps analyze medical images like X-rays and MRIs. It is important for diagnosing tumors and other diseases.‍Surveillance and Security In this field, annotation is used for video tasks like event detection and facial recognition. It improves accuracy in important situations.‍Autonomous VehiclesAnnotation is key for object tracking and 3D sensor fusion. It trains models for lane detection, pedestrian tracking, and obstacle recognition.‍E-commerceAnnotation assists in classifying images and tagging products. This makes it easier to handle large data volumes and enhances user experience.‍Retail and MarketingIn these areas, annotation analyzes customer data. It helps businesses gain insights and make predictions.‍RoboticsAnnotation trains robots for tasks like object recognition and navigation. It creates reliable models for complex environments, such as automated warehouses and factories.‍Pricing ModelData Labeling ServicesA labeling service is a data annotation service used to train artificial intelligence models. Specialists manually mark objects in images or text so that the AI can learn to recognize and categorize them. This process is crucial for creating high-quality training datasets. These datasets allow AI to accurately perform tasks such as facial recognition, object detection, or text analysis. CVAT and Clarifai offer data labeling services. Below, we will review their data annotation offerings:‍CVAT· Discussion of Requirements: First, you contact the CVAT team or your contacts to discuss the details of your project. This helps them understand your specific needs and goals.· Proof of concept (POC) annotation: CVAT will request a data sample and an initial specification. This will allow CVAT to demonstrate its expertise. It will also help prepare an accurate project quote and estimate the time required to complete the project. This phase is completely free for a customer!· Team Formation: Depending on the scope and complexity of the project, CVAT may form a specialized team of annotators. This team will be responsible for carrying out the annotations according to your requirements.· Project and Task Creation: CVAT creates a project on their platform, including tasks for annotation. These tasks contain instructions and examples to guide the annotators on how to work with your data.· Data Preparation and Upload: You provide your data (images, videos, etc.), which are then uploaded into the system. CVAT supports various formats, making the upload process easier.· Annotation Process: The annotators begin working on annotating the data. CVAT offers powerful annotation tools, allowing the team to perform their tasks efficiently.· Quality Control: During and after the annotation, quality control is conducted. This may include reviewing the annotators' work and using automated tools to ensure accuracy.· Documentation: CVAT provides documentation for the project, including reports on completed work, quality metrics, and any important comments. This is useful for analysis and reporting.· Delivery of Annotated Data: Once the project is completed, you receive the annotated data in the agreed format, ready for use in your project.· Feedback and Support: The CVAT team remains in contact to gather your feedback on the process and provide support for any questions that may arise.‍Clarifai· Easy Execution: Users can effortlessly upload data in various formats to the Clarifai platform. The labeled data will be returned to the specified format for continued training, whether on Clarifai or another platform.· Expert and Flexible Workforce: The platform reduces the daily management burden of data labeling pipelines by allocating a specialized team based on expertise. A single team will manage the entire project to ensure consistency.· Quality Assurance Checkpoints: Clarifai conducts tests against data samples to ensure quality before finalizing the labeling of the complete training dataset. Users receive regular updates and transparency regarding quality metrics and turnaround times.· More Secure: The platform offers a secure environment for handling image, video, and document data. It adheres to strict security standards and data privacy principles. This allows users to select teams with background checks. The annotation takes place in secure facilities.· Flexible Pricing: Clarifai provides flat-rate pricing, making it easier to outsource data labeling needs and reduce operational overhead. Pricing scales with project growth.· Speed Time to Production: The team utilizes a state-of-the-art platform. This platform employs AI automation to expedite dataset annotation and project completion. It ensures high levels of accuracy.CVAT’s flexible pricing includes options like per-object, per-image, or hourly billing based on project demands. The only limitation for CVAT is that the project cost cannot be less than $5,000.‍Clarifai offers a more fixed project evaluation system, but there is also the option for a customized approach to the project.‍‍Suggestions for self-service on the platform.There are also plans available for independent work on the platform. Below is a comparison.CVAT‍Clarifai‍Additional Areas of ComparisonTo assist you in making an informed choice, here are five distinctions between CVAT and Clarifai:‍Integration with Existing Tools:CVAT's open-source architecture allows for seamless integration with third-party tools and custom pipelines. This makes it a suitable choice for teams with established AI ecosystems. This flexibility enables organizations to tailor their workflows to specific needs. While Clarifai also provides integration options, its emphasis on ready-to-use AI models may limit customization for teams with advanced technical skills.Project Management:CVAT offers robust project management features. These features allow team leaders to assign tasks, monitor progress, and collaborate in real time. This can be particularly beneficial for complex projects involving larger teams. Clarifai provides managed services for annotation and project management, which can streamline processes and support team coordination.Annotation Accuracy:CVAT is equipped with comprehensive annotation tools that are ideal for tasks demanding high precision, such as autonomous driving or medical imaging. Its capabilities allow for detailed data management. Clarifai utilizes AI-driven automation to enhance efficiency. This may be sufficient for many applications. However, it may face challenges with highly complex datasets.Turnaround Time:Clarifai's AI automation and distributed workforce are recognized for delivering faster turnaround times, making it suitable for projects that prioritize speed. Conversely, CVAT focuses on meticulous manual and semi-automated annotation. This ensures a high quality of results. This can be particularly important for complex datasets, even if it may take longer.Security and Data Privacy:CVAT's open-source nature allows for on-premise hosting. This grants organizations full control over data privacy. This is an essential feature for businesses handling sensitive information. Clarifai provides cloud-based solutions with strong security measures. This may appeal to companies that prioritize data security. However, it may not offer the same level of direct data control as CVAT.‍ConclusionCVAT and Clarifai are both powerful data annotation platforms, each serving different needs and applications. CVAT is well-suited for those requiring customizable, precise, and scalable solutions, particularly in sectors like robotics, autonomous driving, healthcare, and surveillance. Its open-source nature allows for easy installation and project management, especially for teams with the technical expertise to handle complex annotation tasks.‍On the other hand, Clarifai is designed for teams that value user-friendliness, automation, and rapid scalability. Its focus on AI features and managed services makes it a strong contender across various industries.‍Are you ready to make your choice? Explore both CVAT and Clarifai to determine which platform aligns best with your project's unique needs and objectives!‍

Industry Insights

October 8, 2024

CVAT vs. Clarifai: Which Data Annotation Service Is Right for You?

In the dynamic world of computer vision, staying current with technology advancements is not just beneficial—it's critical. This is particularly true for organizations that use self-hosted solutions for the Computer Vision Annotation Tool (CVAT.ai). ‍Regular updates to such a tool are essential for several reasons: security, improved functionality, ensuring compatibility, and maintaining operational efficiency. This article explores why regularly updating your self-hosted CVAT.ai solution is crucial for maintaining a competitive edge and operational reliability.‍This article is divided into two parts: the first addresses 'why' regular updates are necessary, and the second explains 'how' to implement these updates effectively.‍Why is it Necessary to Update CVAT.ai Regularly?‍Improved Security: One of the most compelling reasons to regularly update your self-hosted CVAT is to enhance security. Although the latest version of CVAT.ai is secure, the threat landscape constantly evolves. New vulnerabilities are discovered daily, and the CVAT.ai Team releases patches to mitigate these risks. By staying updated, you safeguard your system against vulnerabilities that malicious actors could otherwise exploit. Regular updates are crucial for maintaining the integrity of your data and ensuring the privacy of the information processed by CVAT.‍Access to Latest Features: CVAT is continuously improved by a community of developers who add new functionalities and enhancements. These updates can include everything from improved annotation algorithms, support for new formats, and enhanced user interfaces to integration capabilities with other tools and platforms. ‍Compatibility and Integration: As your IT environment evolves, new versions of dependent software and hardware are introduced. Regularly updating CVAT ensures compatibility with other software tools and infrastructure changes. For example, updates may be needed for CVAT.ai to operate smoothly with newer versions of browsers, operating systems, or integrations with third-party APIs and services. Maintaining an updated system prevents disruptions caused by compatibility issues, which can be costly and time-consuming to resolve after the fact.‍Operational Reliability: Regular updates introduce new features and improvements, including optimizations that enhance CVAT's performance and stability. These optimizations can lead to faster load times, improved response times, and more efficient data processing, enhancing the system's overall reliability. For businesses relying heavily on computer vision technologies, operational reliability is non-negotiable.‍How to Update CVAT?‍Before we delve into the procedure, it’s important to note that the steps described here apply only to standard CVAT.ai standard public images.‍If you have created a custom image that we need to be aware of, we assume you are technically proficient and can handle the necessary updates tailored to your image.‍Step 1: Back Up Your Data‍Before making any changes to your CVAT installation, it's essential to back up your data. This ensures you can restore your system to its previous state if something goes wrong during the update.For more information, see CVAT.ai Backup Guide.‍Step 2: Stop the Old Version‍You need to stop the currently running version of the application to avoid potential conflicts.Use the Docker compose command to stop the running CVAT.ai container.‍Step 3: Pull Updates from Repository‍Once the system is halted, you can safely update the software by pulling the latest changes from the CVAT GitHub repository. You must download the entire source code, not just the Docker Compose configuration file.To see if the new version was released and to check the latest changes, use CVAT.ai Changelog.You must also check and update the additional components at this stage.‍Step 4: Handle Personal Customizations‍If you have custom configurations, such as a database managed outside Docker, you must ensure these are compatible with the new version. Review your configurations and make necessary adjustments to ensure they work with the new version of CVAT. In some cases, you need to build images locally; see this Guide for details.‍Step 5: Run the New Version‍After updating the software and adjusting your customizations, you can start the new version of CVAT.Start CVAT container: Use Docker commands to run the new CVAT containers; see the Upgrade Guide for details.‍Step 6: Manual Updates Where Needed‍Sometimes, you may need to update custom external components or manually handle migration scripts.‍And that's it!You now have new updates CVAT.ai with all necessary security improvements and features!‍Looks Too Complicated?‍Updating and managing CVAT can sometimes feel complex, mainly when you're focused on annotating and training models for your work or research. If you'd prefer to leave the sysadmin and DevOps tasks to someone else, CVAT offers installation support and help managing Enterprise self-hosted solutions. Explore our enterprise proposals and plans to find the right level of support for your needs. Alternatively, consider using our online version—it's always up-to-date and secure, so you can focus solely on annotating without hassle.‍‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub‍

Labeling Guides

August 8, 2024

Why is it Essential to Keep CVAT Updated?

Computer Vision Annotation Tool (CVAT) was started by Intel in 2017 and launched publicly on GitHub in the middle of 2018. In 2022, the platform became a core IP of an independent CVAT.ai Corporation, which we consider our founding year. With over seven years of experience, CVAT.ai has embarked on a mission to transform the field of data annotation and image labeling. We are proud of our remarkable journey and the milestones we have achieved.‍Our platform has become a cornerstone for data scientists, machine learning engineers, researchers, and students striving for excellence in artificial intelligence. ‍An anniversary is more than just a date; it symbolizes our growth, achievements, and the vibrant community we have fostered.‍The following post will outline our achievements from last year and revisit the company's history!Best Moments of The Year‍There were some ups and downs, but we are here to celebrate the results of our efforts. No hard feelings—lessons were learned and will not be forgotten. Today, we focus on the good parts and celebrate the fruits of our hard work:‍September 2023: We've reached 10,000 stars on GitHub, and we're still going strong—today, we have nearly 12,000 stars! We want to thank every one of you for your support. We also welcome stars as a gift, so if you'd like to cheer us up and help make our data labeling tool even more popular, please visit our GitHub and give us a star.November 2023: CVAT.ai plays a crucial role in the crowdsourcing annotation of Computer Vision datasets; therefore, in collaboration with Human Protocol, we have successfully launched the crowdsourcing data annotation project in several iterations:some textFebruary 2023: We’ve aired the first experiment in Crowdsourcing Annotation with CVAT and Human Protocol.November 2023: We have continued to push forward, and through a combined effort with our Human Protocol partners, we have made the integration more user-friendly for annotators and clients whose data needs to be annotated.November 2023: With Human Protocol, we warmly welcomed speakers at the Newconomics 2023 Conference.This initiative makes data annotation more affordable for AI companies needing annotated data. We are continuing to collaborate with Human Protocol to unlock and democratize AI.‍February 2024: CVAT.ai joined Google Summer of Code 2023, and we are still actively working on the project, which we consider a success!April 2024: We've introduced Annual Plans, helping our loyal and devoted users save up to 30% on data annotation tools. We have maintained transparent pricing, which significantly aids in budget planning!May 2024: The CVAT.ai Labeling Service is stellar and thriving. We have several hundred annotators who work across various fields, consistently meeting deadlines and maintaining high-quality standards. Our client base includes large enterprises in retail and other sectors, featuring customers from the top 100 enterprises worldwide. Their satisfaction with our services brings us great joy.May 2024: CVAT.ai was recognized as a top-choice data annotation tool at the Embedded Vision Summit 2024 (EVS 2024).‍Looking Forward‍As we celebrate this milestone, we are more committed than ever to pushing the boundaries of what CVAT.ai can achieve. ‍We extend our heartfelt thanks to our users, contributors, and partners who have been part of this incredible journey. Your support and collaboration have been instrumental in our success.‍Here's to more years of innovation, growth, and success with CVAT.ai!‍Stay connected with us, be curious, keep annotating!‍‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub‍

Company News

August 1, 2024

CVAT.ai Birthday is Here: See Our Achievements in the Field of Data Annotation and Image Labeling

In the first two parts of this article series, we discovered the cost of annotating images and videos yourself or with an in-house team. This part investigates the finances and resources you need to outsource the data annotation to the labeling service.‍‍However, let's first revisit our practical scenario: Imagine a leading robotics scientist developing a smart home assistant to distinguish between dirt and valuable objects in a home environment. Life's chaos often includes scattered toys, misplaced glasses, pet fur, and god knows what else. The proposed robot aims to clean efficiently and assist in locating misplaced items. Such functionality could benefit the elderly by helping them keep track of their possessions, for example. So, there is a niche for such products.‍As the project's lead, you are instrumental in guiding a compact research team that has gathered a dataset of 100,000 images, each depicting different room settings with items scattered across the floor. According to publicly available data, this dataset size is typical for robotics projects, ranging from thousands to millions of images. ‍With an average of 23 objects per image, the task involves annotating approximately 2.3 million objects. This series of articles explores various strategies for managing this large-scale annotation challenge, including do-it-yourself approaches, forming an in-house team, outsourcing, and utilizing crowdsourcing techniques.‍‍Welcome to the third part of our series, which explores the costs of outsourcing data annotation to cover the scientist's labeling needs.‍Case 1: You handle the task yourself or with minimal colleague help.Case 2: You hire annotators and annotate with your team.Case 3: You outsource the task to professionals.Case 4: Crowdsourcing.‍Case 3: You Outsource the Task to Professionals‍Let's start with a brief introduction and a statement that all data labeling companies operate similarly, with some variations that can significantly impact the quality of their labeling services. The devil is in the details. And CVAT.ai is not an exception. If the named scientist comes to us before jumping into the work, we will request some information from him and his team.‍Time and Stages‍To be precise, this is how the whole workflow will look, separated by stage and with time estimations. It might differ for different companies, so we are talking from our experience.‍We are not shy to state that our experience is vast and one of the best in the market, as we not only provide data labeling services but also own our data annotation platform. For clients, this means that we are flexible and can continuously adjust CVAT to make the annotation and validation process more efficient. Our clients can use the same platform internally and easily extend annotations. They also just log in to see how the data annotation process is going for them. ‍Without paying for anything., just try to annotate something, like millions of data scientists worldwide do. ‍But enough about us, let's see what annotation stages are there. ‍Stage 1: Annotation Proof of Concept (PoC)‍We will sign a Non-Disclosure agreement with the client to protect the data if necessary. We will request actual data samples (50-100 images or 1-2 videos) to start investigating it and see how it should be annotated.We will need the client's approved annotation specifications. At this stage, we will work together closely and ask questions to clarify corner cases and quality requirements. Following the efforts above, we will create a PoC and offer the precise project costs and durations.We will then send the client our proposal.‍We commit to initiating a PoC within one day of data reception and will provide detailed estimates and calculations within 3-5 days, depending on the project scope. Our initial project budget assessment is conducted with a high degree of accuracy. According to our experience, the final project cost typically deviates from the initial estimate by at most 10%.‍Stage 2: Documentation & Preparation‍Based on the conducted Proof of Concept (PoC), we will propose the most effective method for data annotation, refine and supplement the initial specification, and agree on the quality requirements and project annotation timelines.We will develop all the necessary documentation and sample agreements, including comprehensive information about our collaboration's terms and payment conditions. The client should only review the documentation and suggest any necessary revisions.Training the data annotation team is also entirely our responsibility. We will assign a dedicated manager who will be the direct and constant point of contact for resolving all operational issues and gathering all the necessary information about the project to build the training process for the annotation team.‍Document processing on our end will be completed within a week, barring any delays from the client. We immediately begin training and data annotation for expedited projects, bypassing bureaucratic delays.‍Stage 3: Annotation‍At this stage, we perform data annotation strictly following the instructions. However, we understand that requirements may change during the process, so we are always ready to be flexible and accommodate minor changes to the initial documentation.Since we understand that developing an AI model is a multi-step process, for large projects, we advocate delivering annotated data in batches without waiting for the entire dataset to be annotated. This approach allows our clients to conduct relevant experiments and adjust the process. The dedicated manager, responsible for the interim progress, will oversee the project from start to finish.We welcome regular feedback from the client and are ready to make additional revisions to the documentation as the project progresses to ensure the expected result.Typically, the most critical stage is annotating the first batch of data, during which all processes are fine-tuned, and the client's final requirements are understood. After successfully delivering the first batch of data, our team operates like a well-oiled machine, delivering high-quality results within the expected timelines.‍Most projects reach completion within one month.‍Stage 4: Validation‍We guarantee high-quality results to our clients because, before committing to specific obligations, we conduct experiments that help us understand the results we can deliver and how to improve them.‍We take full responsibility for a quality check; we can offer the following services for better results:‍Conduct manual and Сross Quality Assurance (QA), automate QA for Ground Truth (GT) annotation covering 3-10% of the dataset.Execute any final amendments at no additional cost and deliver a conclusive quality report.Compute and report quality metrics like Accuracy, Precision, Recall, Dice coefficient, and others, and provide a confusion matrix.Final validation and the conclusive report from our end will be completed within one week.‍Stage 5: Acceptance‍This is the final and best stage, where the client gets the final results.All that is left is to process payments and provide feedback regarding our labeling service.‍Following our previous article, in case there are no client delays and unexpected events, the whole process for the described project will take approximately 50 work days, 10 weeks, or 2.3 months. Of course, it depends on each case's requirements and circumstances.‍By entrusting us with your project, you commission a high-quality service with a pre-defined and documented guaranteed outcome. The client's role is limited to observing the process, accepting recommended changes from our side, reviewing the delivered data, and providing feedback on the results of the validated work. We take on all internal processes and guarantee the project's quality and timely delivery.‍Data Labeling Price‍Well, that’s a tricky question because the price heavily depends on the amount of data and the specific needs: the quality, the type of annotation, deadlines, and many more.‍Let’s use data publicly available online to estimate the cost of annotating 2,300,000 objects or 100,000 images. However, here's the issue—labeling service providers often lack transparency, and there aren't many published prices. Thus, we can only rely on fragments of information from sources like KILI Technology or Mindkosh to make our estimates. The number will usually be above $300,000 because semantic segmentation, used for this task, is one of the most expensive annotation types for now.‍But how much will it cost if the client comes to CVAT.ai? We used a flexible approach when we needed this amount of data to be annotated. Our pricing is built on the following assumptions:‍Estimation and Payment Models‍Per Object: This primary model charges for each data unit annotated—whether a frame, object, or attribute within an image or video. It suits projects with clearly defined unit sizes and quantities.Per Image/Video: Charges apply per image or video file processed, ideal for projects with consistent complexity or time demands per file.Per Hour: Costs are calculated based on the time annotators spend on the project, offering flexibility for projects with varying complexities or scope changes.‍Expected Project Budget Ranges‍$5K - $9.9K for Annotation Only, Manual, and Cross-Validation: This range is typical for projects focused on manual annotation, including thorough cross-validation for accuracy.Above $10K for Comprehensive Services: For budgets exceeding $10K, services extend beyond basic annotation to include AI engineer involvement, automated quality assurance, and potential custom AI solution development.The final cost of annotating 2,300,000 objects in CVAT.ai depends on the chosen approach. Using the "Per Object" method, the initial pricing begins at a set rate per unit. Due to the large volume, discounts ranging from 5% to 30% will be applied, reflecting our commitment to building long-term partnerships. By utilizing the highest discount tier, the total cost for annotating all objects will be approximately $225,400. This is an approximation, and the final price may vary based on the client's specific needs. Regardless of the exact cost, the results will be of the highest quality and delivered promptly.‍In general, you should expect the outsourcing price to be more than 1.5 times the cost of a potential in-house data annotation team. Hiring your data annotation team is one of the ways to achieve a better price while maintaining high quality. Read You hire annotators and annotate with your team for tips.‍Conclusion‍In summary, outsourcing your data annotation tasks to a professional service offers significant benefits in terms of time efficiency, quality assurance, and overall project management. While costs can vary based on project specifics, CVAT.ai provides a flexible pricing model that caters to different needs, ensuring high-quality results within a reasonable budget. With discounts available for larger volumes, we can offer competitive pricing without compromising quality.‍Next steps?‍Ready to label data with CVAT.ai? Email us: labeling@cvat.ai!Ensure you have all the necessary information—download our detailed takeaway now!‍‍‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub‍

Annotation Economics

July 25, 2024

How Much Does it Cost to Outsource Annotation to a Data Labeling Service?

In the first part of this series of articles, we emphasized the need for precise annotation of images and videos, essential for developing AI products capable of performing accurate analyses, making predictions, and delivering reliable outcomes. We focused on how time-consuming and money-consuming solo annotation might be.‍In this article, we will explore the costs and resources required to maintain an in-house team of data annotators.‍But before we jump into the topic, here's a reminder of our use case:‍A lead robotics scientist is creating a smart home assistant robot to differentiate between dirt and valuable items in a household setting. Life's chaos often includes scattered toys, misplaced glasses, pet fur, and god knows what else. The proposed robot would clean efficiently and help locate lost items, adding a layer of functionality beyond standard home cleaning devices. This can help elderly people keep track of their belongings, for example.‍As the lead scientist, you play a crucial role in this project. Along with your small research team, you've compiled a dataset of 100,000 images showing various room settings with items scattered on the floor. According to publicly available data, this dataset size is typical for robotics projects, which can range from thousands to millions of images.‍Each image features an average of 23 objects, so the task involves annotating approximately 2.3 million objects. This series presents various strategies to tackle this significant annotation task, including DIY methods, building an in-house team, outsourcing, and crowdsourcing.‍‍Welcome to part two of our series on the costs of data annotation. This article describes the cost of hiring annotators and building the annotation team yourself.‍Case 1: You handle the task yourself or with minimal help from colleagues.Case 2: You hire annotators and annotate with your team.Case 3: You outsource the task to professionals.Case 4: Crowdsourcing.‍Case 2: You hire annotators to annotate with your team‍Now, as with anything else on this planet, there are pros and cons to having an annotation team. Let's start with the advantages and address questions about the time required for annotation and the cost-effective impact of this approach.‍Here, we will calculate only the monthly expenses and costs. The minimum team to annotate 2.3 million objects consists of 35 annotators, supported by management personnel involved in onboarding, offboarding, and upskilling annotators.‍For these 35 annotators, one manager and 3-4 senior annotators are necessary to guide the team.‍Contracts and Team SizeData annotation teams vary in size from small (up to five members) to large groups, with larger teams requiring more coordination and management.‍Recruiting is straightforward for small teams, but complex for larger groups. Annotators may be full-time employees with fixed salaries or contractors. Contractors, however, pose challenges in retention and engagement due to their involvement in multiple projects and expectation for workload-aligned compensation.‍When working with contractors, as we do, extra effort is necessary to ensure availability. For instance, if you need 35 annotators, consider hiring between 60 to 70 to account for potential unavailability.Time and Costs‍From our experience the hiring process will take as much time as:‍Time to find a data annotation manager: 1 month or moreTime to find one annotator: Up to 1 monthTime to onboard one annotator: Up to 1 month‍You can conduct job interviews and onboarding concurrently. If you're fortunate, you might be able to hire between 5 to 10 annotators per month. But to hire and train a big data annotation team you need to have at least 3-4 months.‍Expenses wise it will be:Manager salary (per month): Up to $6000 (data from Indeed, June 2024)Annotators Salary (per hour): It depends on whether you can afford to hire abroad. If yes, starting from $1/h and up to $40 if you hire in the US or high level of qualification is required. Where to look for them? On Upwork, Indeed, LinkedIn—you name it. Again, the job posting price ranges from $0 to $500, in rare cases, with the help of the recruitment agency.Yes, if your service is as popular as platforms like CVAT.ai in the data annotation area, you can significantly reduce time and costs. Annotators will eagerly respond to your posted vacancies as soon as they are advertised.Set Up Time‍Next step is to prepare data: the dataset is the foundation of any robotics project.‍For this project, the scientist must sift through a vast collection of video footage to select relevant frames and then craft a comprehensive data annotation specification. In our case, this specification is planned to cover 40 different classes, each to be annotated with polygons individually. ‍On average, the complete guideline is 30-50 pages. It will include detailed instructions for annotating each class, examples of correct and incorrect annotations, and edge cases. Drafting this detailed specification is time-consuming; it might take several weeks. The data annotation specification will be updated during the project because it isn’t possible to describe all corner cases from the beginning.‍The time it takes to annotate each object with polygons will later be calculated, considering factors such as the object's complexity and size, the image's clarity, and the annotator's skill level.‍Simple Object (e.g., a rectangular object): 5-10 secondsModerately Complex Object (e.g., a car): 30-60 secondsHighly Complex Object (e.g., a human with detailed limb annotations): 1-3 minutes or more‍Operational Costs‍In addition to onboarding and training costs, the expenses for data annotation projects also include licenses and instance costs per annotator. Each annotator may require a license for the annotation software used, which can vary significantly in price depending on the complexity and capabilities of the software. ‍In the case of CVAT it will cost you $33 per seat or you can use the free open-source tools. ‍Remember that even free tools require time and resources to set up and support; time is money. So, while we say "free," it means that you can download and install the open-source tool, but the rest depends on your time, expertise, and effort (and how much of your paid time will be spent on this).‍Operational costs includes costs for accounting, contracts management and cannot be approximated, as they are company specific.‍Final calculations‍To calculate the total time required for 35 professional annotators to annotate 2,300,000 objects, where each object takes approximately 40 seconds in average to annotate, you can follow these steps:‍Calculate the Total Time for All Objects:‍Total time = 2,300,000 objects × 40 seconds per object = 92,000,000 seconds or 25,555.56 hours‍Divide by the Number of Annotators to Find Time per Annotator:‍Time per annotator = 25,555.56 hours / 35 annotators = 730.16 hours‍So, if all annotators work simultaneously and efficiently, each annotator will need about 18.25 work weeks, which is approximately 4.2 months, to complete the annotation of all 2,300,000 objects.‍To calculate the costs for the scenario described, let's break it down into its components and sum them up for the 4.2 months required for the project. We'll assume each annotator earns $550 per month and that there is a varying cost for licenses, from free to $33 per month. Additionally, management and validation cost is $6000 + 20% per month from the total cost of annotators.‍Total Salary Costs for Annotators (4.2 months): ‍Total annotator costs = $2,310 per annotator × 35 annotators = $80,850Management and validation Fees (for 4.2 Months):‍Total cost for a data annotation manager = $25,200Management and validation Fees = $80,850 * 20% = $16,170‍Conclusion: To annotate 100 000 images, that is 2,300,000 objects it will take 4.2 months and $122,220.‍To this number you need to add costs of the software licenses.‍Hidden and One Time Costs‍When calculating how much an annotation team costs it might be a good thing to take in account a one-time cost like hiring time and efforts. ‍As we’ve mentioned before, assembling a data annotation team starts with recruiting a crucial step that sets the tone for the team's development and effectiveness. Organizations typically choose between outsourcing recruitment or handling it internally.‍Time and Cost Estimates:‍Outsourcing RecruitmentTime: Recruitment agencies can expedite the process, typically taking 2 to 6 weeks to secure a position.Cost: Agencies charge a fee based on the position's annual salary, usually 15% to 30%.Internal RecruitmentTime: This method can take 4 to 8 weeks, depending on the efficiency of HR processes and candidate availability.Cost: Costs include job posting fees ($0 to $500) and internal HR labor (approximately $55,000 annually or $26 per hour). ‍The numbers provided are approximate and based on data from Indeed and LinkedIn; actual costs may vary and should be aligned with the company's internal processes. For example, at CVAT.ai, we have automated our hiring process, enabling us to recruit the best annotators on the market at competitive prices. We use Remote.com for onboarding candidates and are quite satisfied with this HR platform. Our annotators come from various countries, including Kenya, India, Nigeria, Ghana, Nepal, and Indonesia.‍Considerations for Hiring Relatives‍Small teams might consider hiring relatives for data annotation tasks. While this can add value in terms of trust and loyalty, it often leads to challenges such as the absence of professionalism and cost. Performance might not meet professional standards if the hiring criteria are not aligned with the job's technical demands.‍Management Overhead‍Post-recruitment, managing a data annotation team involves handling administrative tasks essential for maintaining AI development standards:‍Paperwork and Compliance: Managing contracts and compliance with labor laws.Financial Management: Overseeing accounts and payment systems.Work Environment Management: Providing training, managing workloads, and fostering a supportive work atmosphere.‍Additional Considerations‍Technology and Tools: Investments in data management and annotation tools can enhance efficiency.Team Dynamics: The interaction between team members and management style significantly impacts productivity.Market Conditions: Economic factors and labor availability can influence recruitment and operational costs.‍These elements are often seen as "hidden costs" and vary significantly by organization, affecting the overall expenses. They should be included in the final budget considerations due to their potential to impact the costs significantly.‍Conclusion‍What will be the total duration and costs of the entire project? Here, we are discussing the baseline minimal price, excluding hiring and hidden costs:‍Total Duration: 18.25 work weeks, which is approximately 4.2 months. Cost Range: The costs vary. They start from around $122,000 and may go up indefinetly, depending on team capacity and other factors, like where are you placed, do you hire locally or worldwide and so on. ‍What else should you take into account when reading this article?‍The calculation for the hiring process assumes linear and consistent recruitment and onboarding, which might not reflect real-world variations. Realistic scenarios may need buffer times for unexpected delays and additional costs for unplanned issues.The provided time and costs assume maximum efficiency. They may not account for variables such as sick leave, training efficacy, and turnover rates, which could significantly impact both time and cost estimates.Empirical data from similar past projects could further refine estimating the time taken to annotate objects and onboarding costs.‍Overall, the presented figures are reasonable but should be treated as approximations with potential for variation based on real-world execution.And that’s all for today. See you in the next article, where we will discuss how much it costs to outsource the data annotation to professionals.‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub‍

Annotation Economics

July 11, 2024

How Much Does It Cost to Annotate Data with an In-House Team?

Creating computer vision AI systems requires meticulous training and fine-tuning of deep learning (DL) models using annotated images or videos. These annotations are crucial for developing AI products capable of accurate analysis, prediction, and generating reliable results. However, the process of image annotation significantly contributes to the overall cost of developing such systems. ‍"Instead of focusing on the code, companies should focus on developing systematic engineering practices for improving data in ways that are reliable, efficient, and systematic. In other words, companies need to move from a model-centric approach to a data-centric approach."— Andrew Ng, CEO and Founder of Landing AI‍How can you calculate the optimal price for image annotation to include in your budget? ‍We'll explore the various factors that influence the cost of image and video annotation. More importantly, we'll discuss why the price of image annotation should not be your only consideration when training and fine-tuning computer vision models.‍Prerequisites‍To better understand the dynamics of daily life, let’s consider a common scenario: life at home. ‍Most of us live in houses, often not alone but with families. These families can vary in size and composition—ranging from small units to large, bustling households with children, pets, and elderly members who require special attention and care.‍This variety can lead to issues that are relevant to all living areas: children might leave toys like LEGO pieces scattered on the floor, elderly individuals may misplace their glasses or other medical devices and struggle to find them, and pets could shed fur or leave other surprises around. All of these factors contribute to a household's everyday chaos.‍Certainly, several solutions are already available on the market, such as automatic vacuum cleaners and electric mops. However, let's consider the possibility that these devices might not be as smart as we need them to be.‍As a scientist leading a small research team, you aim to introduce an innovative product to the market—a smart home assistant robot. This advanced robot will differentiate between actual dirt and valuable items. It will clean up the former and signal the latter's presence, aiding in retrieving lost items. This functionality will not only keep homes cleaner but also make it easier to find misplaced objects.‍For research purposes, the scientist and their team have gathered a dataset comprising 100,000 images of various rooms with items scattered on the floor.‍‍The volume of 100,000 images comes from the average batch size we typically see in robotics projects. This number is supported by the available datasets in the public domain, where the quantity of images usually ranges from 10,000 to several million per dataset.‍Let’s assume that one image on average has 23 objects. So you need to annotate an average of 2,300,000 objects in total (or slightly fewer or more).‍This series of articles describes four cases on how to deal with such tasks:‍Case 1: You handle the task yourself or with minimal colleague help.Case 2: You hire annotators and try to build a team yourself. Case 3: You outsource the task to professionals. Case 4: Crowdsourcing‍Case 1: You handle the task yourself or with minimal help from colleagues‍A small disclaimer: annotating solo is fine for small amounts of data, but doesn’t work for big datasets. And here is why.The Annotation StageFor the robotics project, the scientist needs to select useful frames from the extensive video collection and create a detailed data annotation specification. Accurate and precise polygon annotations will be used to label objects in the images.‍Let’s assume, that according to the data annotation specification, 40 classes will be annotated using polygons, with each instance annotated separately. A basic description of how to annotate is necessary, noting that the full specification can take 30-50 pages and will include detailed instructions on how to annotate each class correctly with good, bad examples and corner cases. Writing a specification also requires time estimated in days and weeks.‍The time required to annotate an object using polygons can vary depending on several factors, including the complexity and size of the object, the clarity of the image, and the expertise of the annotator.On average, it can take anywhere from a few seconds to several minutes per object. Here are some general estimates:‍Simple Object (e.g., a rectangular object): 5-10 secondsModerately Complex Object (e.g., a car): 30-60 secondsHighly Complex Object (e.g., a human with detailed limb annotations): 1-3 minutes or more‍Detailed polygon annotations can take significantly longer for precise tasks, especially for objects with intricate details and irregular shapes.If the quality requirements permit, AI tools like the Segment Anything Model can be used to speed up the annotation process. However, for some tasks, these models often lack the precision needed and require extensive manual corrections.Let's focus on the task at hand. We are dealing with an image of a room scattered with small objects. Typically, a skilled annotator can label each object in about 40-50 seconds. However, since our scientists do not perform annotations daily, the expected speed of annotation in our case will be approximately 60 seconds (or 1 minute) per object.‍Now let’s talk about money and costs. It's important to note that sometimes people think that annotating themselves is cheap because they do not account for their time, which is paid time unless the annotation is done outside of working hours.Let's assume the robotics engineer is from the USA and annotation is done during working hours. We will research job postings on Indeed, the well-known job aggregator site, and then check the average salary before taxes.The average salary calculated from the data provided is approximately $42 per hour (for June 2024).‍All that's left is to add the cost of the annotation tool. This cost can be zero if the scientist is tech-savvy and can install a self-hosted solution. However, if that's not the case, the scientist will need a tool that may be free or cost some money.‍If you plan to annotate yourself or ask a colleague(s) to help you, so you can annotate as a small Team, in the case of CVAT it will cost you $33 per seat. ‍Here is a list of the most popular open-source data annotation tools that you can use for free*. ‍Remember that even tools that are free to download and install require time and resources to set up and support, and time is money. So, while we say "free," it means that you can download and install the tool, but the rest depends on your time, expertise, and effort (and how much of your paid time will be spent on this).‍Let’s sum it up:‍First, we calculate the total amount of hours that the scientist will need to annotate all objects:‍2,300,000 objects x * 60 seconds = 138,000,000 seconds.138,000,000 seconds / 3,600 = 38,333 hours (rounded to the whole number).‍In the best-case scenario, it will take:‍4,792 working days240 months or 20 years of one person‍If the scientist drops all other duties and dedicates 8 hours daily solely to annotation.‍The cost of the annotation will be:‍38,333 hours * $42 = $1,609,986 + cost of the tool on the top. ‍Note that the described approach lacks scalability. In the future, maintaining the dataset and addressing any emerging issues will be necessary. Additionally, deployment in a production environment typically requires a significantly larger volume of data. Of course, the engineer can ask colleagues to help, but it may reduce the time but not the cost.‍The Quality Assurance StageTo ensure quality assurance when annotating data independently, an automated system known as a "Honeypot" can be used.‍The Honeypot method is cost-effective but pretty time-consuming. It involves setting aside approximately 3% of your dataset, or about 3,000 images from a set of 100,000, specifically for quality checks.‍You will need to use a previously created specification that outlines your annotation requirements and standards. Annotate this selected subset of images yourself to serve as a benchmark. While this method saves time in the long run, it still requires an initial investment of time and resources to set up and perform these annotations, which translates to a monetary cost.‍***And that’s it. Feel free to leave any comments on our social networks, and we'll gladly respond. In our next update, we will answer the question of how much an in-house annotation team costs.‍‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub‍

Annotation Economics

June 20, 2024

Calculating the Cost of Image Annotation for AI Projects: Annotating Solo

Let’s start with an official explanation of the term and what’s behind it. Don’t worry if you don’t understand it yet; we will explain it further.‍‍Semantic segmentation is a computer vision technique that uses deep learning algorithms to assign class labels to pixels in an image. This process divides an image into different regions of interest, with each region classified into a specific category.‍Now, let’s break down this concept step by step with a simple example of use.‍Meet Alex, a young and enthusiastic urban planner. He has a big dream: to design smarter, more efficient cities. To understand what makes a city "smarter" or "more efficient," Alex needs to study how cities function. For example, he needs to distinguish between different types of land cover, such as buildings, roads, water bodies, and green spaces.‍ This helps him assess the amount of green space, evaluate vegetation health, and plan for the creation or preservation of parks and natural areas. Additionally, he can classify areas based on pedestrian usage, identify heavily used and underutilized spaces, and plan interventions to improve accessibility and safety, like adding benches, lighting, or pedestrian crossings.‍These are just a few ways Alex can use data to make informed decisions and design better cities.‍After understanding the goals, Alex starts thinking: Now what? How can he analyze a city? How can he make informed decisions about best practices and areas for improvement?‍So his journey beginsStep 1: Understanding Semantic Segmentation‍Semantic segmentation gives a computer the ability to see and understand images the same way humans do. Instead of just recognizing an entire image as a "cityscape" or "street view," it breaks down the image into tiny parts and labels each one. Every pixel in the image is assigned a category: this pixel is part of a road, that one is a building, and those over there are trees.‍‍By using this technique, Alex can automatically categorize and label every pixel in each image. He can then use the labeled dataset to train a machine learning algorithm, which can gather valuable insights on how a city works. The algorithms analyze the labeled data, identify common city practices based on the labels, and return actionable results. These insights can inform urban planning decisions, optimize traffic management, and enhance public space design.‍Step 2: Preparing the Data‍Alex learns that to teach a computer to understand images, it needs a lot of examples. So, he gathers a dataset of city images as in the example above and organizes them into a folder. This process is known as the data collection step, which involves several challenges.Note, that at this step Alex might have some challenges:‍It can be difficult to collect sufficient data, as privacy issues may arise when using images from certain sources. Additionally, finding the most useful data for training a deep learning model requires careful consideration. Alex also needs to filter out duplicated data to ensure the dataset's quality. With the data ready, the next step is to add labels to the objects in the images. ‍We will discuss these steps and their challenges in more detail in future articles.‍Step 2: Labeling the DataAlex uploads the folder with data into the Computer Vision Annotation Tool (CVAT.ai). Then he carefully labels each pixel in the images, categorizing elements like roads, buildings, and trees.Alex can do it manually, or from the cloud storages.For this task, Alex can use various tools: Polygons or Brush tool. Here is how it looks, when he adds buildings to one category, and pools to the other.‍‍And here is what is going on behind the curtains at this very moment:Semantic segmentation models create a detailed map of an input image by assigning a specific category to each pixel. This process results in a segmentation map where every pixel is color-coded according to its category, forming segmentation masks. ‍A segmentation mask highlights a distinct part of the image, setting it apart from other regions. To achieve this, semantic segmentation models use complex neural networks. These networks group related pixels together into segmentation masks and accurately identify the real-world category for each segment.‍For example, all the pixels that belong to the object “pool” now belong to the “pool” category, and all the pixels that belong to the object “building” are assigned to the “building” category.‍ One key point to understand is that semantic segmentation does not distinguish between instances of the same class; it only identifies the category of each pixel. This means that if there are two objects of the same category in your input image, the segmentation map will not differentiate between them as separate entities. To achieve that level of detail, instance segmentation models are used. These models can differentiate and label separate objects within the same category.Here is a video showing different types of the segmentation applied to the same image:‍Step 3: Training the Model with Annotated Data‍Once the annotation is complete, Alex exports the annotated dataset from CVAT.ai. He then feeds this labeled data into a deep learning model designed for semantic segmentation. Some examples are Cityscapes, PASCAL VOC and the very popular Yolo8. Models are usually evaluated with the Mean Intersection-Over-Union (Mean IoU) and Pixel Accuracy metrics.‍After selecting and training the model, Alex runs it on new, unseen images to test its performance. The model, now trained with Alex's labeled data, can automatically recognize every object in the images and provide detailed segmentation results.‍Here are some examples of how it may look:‍‍‍‍Step 4: Gathering Insights‍By analyzing the results from the model, Alex gathers valuable insights:‍Traffic Patterns: Improved traffic flow and reduced congestion by optimizing traffic light timings and road designs.Green Space Distribution: Identification of areas needing more green space and better urban planning for environmental health.Public Space Utilization: Enhanced public space planning to increase accessibility and usage.Infrastructure Development: Efficient monitoring of construction projects and better planning of new infrastructure.Urban Heat Islands: Implementation of cooling strategies to mitigate heat island effects.‍Now Alex can make informed decisions, because he has processed data on hands. Conclusion‍Thanks to semantic segmentation, Alex can transform raw images into valuable insights without spending countless hours analyzing each one manually. The technology not only saves time but also enhances the accuracy of Alex’s work, making the dream of designing smarter cities a reality. In the end, semantic segmentation turns complex visual data into actionable knowledge and helps create a better urban environment for everyone.‍ And Alex couldn’t be happier with the results.‍Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub

Annotation 101

June 5, 2024

What is Semantic Segmentation?

It's always great to receive feedback, and it's even better when that feedback is positive. So this week starts off with some good news: CVAT.ai (that once took part in the conference) was acknowledged as one of the most popular annotation tools, outpacing direct competitors, and ranking just behind in-house and custom solutions.‍Here's the proof: ‍What is Annotation at Embedded Vision Summit 2024 (EVS 2024)‍The Embedded Vision Summit 2024 is a conference focused on the latest technologies in embedded vision. It brings together engineers, researchers, and business leaders to explore advances in computer vision and AI technologies that are designed to be implemented in hardware such as cameras, robots, and sensors. ‍This event highlights innovations that enable machines to visually interpret and understand the world around them, demonstrating practical applications and trends across various industries. EVS is an essential platform for networking, learning about the newest technologies, and discovering practical techniques for implementing vision capabilities in real-world applications.‍Why CVAT.ai Stands Out‍CVAT.ai is well known by the professionals for being one of the most used tools in the field for managing and automating training data. CVAT.ai is an open-source, and has two versions:‍Self-hosted version offers unmatched flexibility and customization, allowing users to tailor it perfectly to fit their specific project needs. It can be easily integrated into existing infrastructures, making it a favorite among developers and corporate tech teams aiming to keep their annotation workflows in-house.Cloud-version offers the same powerful features as the self-hosted version, with added convenience and scalability. This version provides users with quick setup and no maintenance concerns. It's an ideal solution for teams that require immediate access to annotation tools without the complexity of managing their own server infrastructure.‍This recognition at the Embedded Vision Summit 2024 not only validates the effectiveness of CVAT but also highlights its essential role in the ongoing evolution of computer vision technologies.‍Interested? Don't hesitate to try CVAT.ai now! Contact us if you want to host CVAT.ai on-prem, need professional support or start using CVAT.ai cloud for your immediate data annotation needs.And remember, we're here to assist you every step of the way.Not a CVAT.ai user? Click through and sign up here‍Do not want to miss updates and news? Have any questions? Join our community:‍Facebook‍DiscordLinkedInGitterGitHub

Company News

May 29, 2024

Blog

How to Choose the Right Dataset Format for Your ML Project

Save Time,
Annotate Better

Subscribe to the CVAT Newsletter

Product & Services

Company

Resources

Blog

How to Choose the Right Dataset Format for Your ML Project

CVAT Celebrates 14K Stars on GitHub and Its Three-Year Anniversary!

Subscription or One-Off? How Smart Teams Choose Annotation Services

Four Ways to Automate Your Labeling Process in CVAT

How to Create Data Labeling Specifications for Your Annotation Project: A Client's Guide (+ Free Template)

CVAT vs. Clarifai: Which Data Annotation Service Is Right for You?

Why is it Essential to Keep CVAT Updated?

CVAT.ai Birthday is Here: See Our Achievements in the Field of Data Annotation and Image Labeling

How Much Does it Cost to Outsource Annotation to a Data Labeling Service?

How Much Does It Cost to Annotate Data with an In-House Team?

Calculating the Cost of Image Annotation for AI Projects: Annotating Solo

What is Semantic Segmentation?

Embedded Vision Summit 2024: CVAT.ai Recognized as a Top-Choice Tool

Save Time, Annotate Better

Subscribe to the CVAT Newsletter

Product & Services

Company

Resources

Save Time,
Annotate Better