Data Annotator Intern (Remote)

Remote: 
Full Remote
Contract: 

Offer summary

Qualifications:

Currently pursuing a Bachelor’s or Master’s degree in Data Science, Data Analytics, Computer Science, Statistics, Mathematics, Engineering, or a related field., Strong attention to detail and ability to work with diverse data formats., Proficiency in Excel for data entry and basic analysis., Highly trustworthy and responsible, able to handle sensitive information with discretion..

Key responsibilities:

  • Manually review and annotate data from customer invoices to ensure accuracy for machine learning purposes.
  • Collaborate with the Data Team to provide feedback on invoice formats and data extraction issues.
  • Perform quality checks on manually entered data to maintain high standards for AI model training.
  • Identify data entry challenges and suggest improvements for processing different invoice formats.

Portcast logo
Portcast Startup https://www.portcast.io
11 - 50 Employees
See all jobs

Job description

About Us:
Portcast is a venture-backed startup which predicts global trade flows to help logistics and shipping companies become more profitable. We are a predictive analytics company that offers a fast-paced, innovative environment where you will be empowered to sell our AI-product to C-level executives. We are customer-obsessed and are constantly working to provide our customers access to actionable and insightful data to build resilient supply chains.

Portcast is transforming international supply chains by leveraging machine learning and real-time external data to predict cargo movements and automate freight audit processes. Based in Singapore, and backed by leading VCs and senior industry advisors, Portcast is a team of young tech and industry talent building a game-changing product for the logistics and shipping industry.

About the role:
As part of the Data Team, you will focus on a critical aspect for our AI OCR product. Your core task will be to manually review and annotate data from customer invoices, ensuring that the extracted data is accurate and ready to be used for machine learning purposes. This is a key role in maintaining the quality of the data that feeds into our AI models, which directly impact our predictive capabilities.

While you will be working closely with the Data team, your work will involve a mix of manual data entry, annotation, and feedback to improve the OCR system's ability to handle complex and varied invoice formats. 

What You’ll Do:
  • Manual Data Entry & Annotation: Review a set of 2k+ invoices in different formats, manually extract relevant data, and input it accurately into Excel. Ensure that the data is consistent and of high quality for AI model training.
  • Collaborate with the Data Team: Work closely with our data team to provide feedback on problematic invoice formats and data extraction issues, helping to improve the OCR model's accuracy and performance.
  • Data Quality Assurance: Perform quality checks on manually entered data to ensure that it meets the standards required for training and improving AI models.
  • Confidentiality Compliance: Handle confidential customer data with the utmost care. You will be required to sign an NDA to protect sensitive data and maintain compliance with data privacy regulations.
  • Invoice Format Feedback: Identify recurring data entry challenges and suggest ways to improve the system’s ability to process different types of invoices and formats efficiently.

  • Requirements:
  • Currently pursuing a Bachelor’s or Master’s degree in any field, preferably Data Science, Data Analytics, Computer Science, Statistics, Mathematics, Engineering, or a related field.
  • Highly trustworthy, responsible, and able to handle sensitive information with discretion.
  • Comfortable working with confidential data and maintaining strict confidentiality standards.
  • Strong organizational skills and the ability to manage time effectively while handling large data volumes.
  • We are looking for someone who is not only smart and detail-oriented but also dependable and accountable—someone we can trust as a key contributor to data-driven projects.
  • Strong attention to detail and ability to work with diverse and sometimes complex data formats.
  • Proficiency in Excel for data entry, manipulation, and basic analysis.

  • What's In It For You:
  • Hands-on experience in the development and improvement of AI-driven OCR system, with real-world applications in logistics and supply chain.
  • Exposure to the process of working with messy, real-world data and supporting the refinement of machine learning models.
  • Opportunity to collaborate closely with a dynamic Data Science team and contribute directly to impactful AI projects.
  • Join us at Portcast and be part of a high performing team that is shaping the future of the logistics and shipping industry through cutting-edge predictive analytics!

    Required profile

    Experience

    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Microsoft Excel
    • Client Confidentiality
    • Accountability
    • Time Management
    • Organizational Skills
    • Detail Oriented
    • Reliability

    Related jobs