Our Data Annotation Services
As the world scrambles to build accurate machine-learning models that meet our ever-growing digital needs, the lack of high-quality annotated data has become increasingly apparent.
Data annotation is the process by which a set of data is labelled with tags that are relevant to it to train machine-learning models. Such data can take the form of audio, images, video or text, all of which must be labelled as accurately as possible.
CaptionCube has annotated more than 830 hours of audio data in Singaporean English, Mandarin Chinese, Chinese dialects and Bahasa Melayu (Malay), which includes data for the medical sector. We also have substantial experience in annotating image and video data.
Based in Singapore, CaptionCube is home to a team of meticulous talents who are familiar with the Singaporean context and well-versed in English (including Singlish!), Mandarin Chinese and Bahasa Melayu (Malay), making us the ideal solution to your local annotation needs.
Lighten your data engineers’ massive workloads by outsourcing the tedious task of poring over huge amounts of data to us! Let us assist you in building accurate machine-learning models in a swift and fuss-free manner.
Readily available Singaporean Speech Data Set
We have a set of full-verbatim Singaporean Conversational English speech data (100 recorded hours), which is available for licensing.
“NTU engaged CaptionCube’s transcription services for Singlish corpus building. Their transcription was delivered on time and of the highest quality. I would definitely recommend them to anyone who needs transcription services!”
Lim Zhi Hao, Business Development, AI Singapore Speech Lab (Abax.ai)
“We engaged the services of CaptionCube to do a series of transcriptions from December 2020 to February 2021. It was easy and enjoyable to work with them, as they promptly replied our emails and questions. Their service was professional and the quality of work was high, although their charges were relatively competitive in the market. Additionally, when we needed some data ahead of time, they were able to give it to us. The social mission they have was another draw as they provide people with special needs with jobs. In short, we highly recommend working with CaptionCube!”
Dr Anthony Leow and Shuyi Chua, Research Staff, Republic Polytechnic
Our Data Annotation Samples
Transcript and Audio Annotation Samples
Listen to the audio and download the samples below to find out more about what CaptionCube has to offer!
1. Tell us about your data annotation needs using this contact form.
2. Our customer service team will get back to you with a quote within 1 to 2 working days.
3. Send us your content and receive the annotated data based on your specifications and turnaround time.