Vacancy expired!
Language Specialists Needed for Penn Research Project
Penn's Linguistic Data Consortium (LDC) is seeking speakers of the following languages for a short term language research project
Arabic
Burmese
Cambodian
Dari
Georgian
Hebrew
Kannada
Korean
Malayalam
Maldivian
Odia
Telugu
Tibetan
Urdu
Uyghur
Researchers at The University of Pennsylvania are collecting and analyzing images of written language for the CAMIO (Corpus of Annotated Multilingual Images for OCR) Project. CAMIO data will be used to train and test artificial intelligence technology capable of recognizing and automatically transcribing images for dozens of languages spoken around the world.
We are hiring language specialists who can help us build the CAMIO research database. You will search the web for thousands of highly variable images of written texts in your language. You will then label the collected images for features of interest, e.g. whether the image contains tables or handwriting. You may also be asked to apply additional analysis to some images, for instance drawing bounding boxes around each line of text and transcribing the words that appear in that line.
This is a short-term, temporary position. Work can be done anywhere and you can set your own working hours.
Primary Responsibilities
Use creative web searching to find variable images of written text in your language
Review collected data to verify that it contains required features
Label linguistic and visual features of the collected data, following detailed guidelines
Use computer tools to draw bounding boxes around lines of text
Accurately transcribe text from images of writing in your language
Other duties as assigned
Qualifications/Requirements
Highly fluent in one or more of the languages specified above, including ability to read and write in the language
Able to conduct effective web searches to find highly variable images of written text
Able to follow written and verbal instructions in English
Able to learn new computer skills with limited support
Excellent attention to detail
Reliable, mature and able to meet established deadlines
Able to work at least 15 hours/week for at least 2 consecutive months
Access to a reliable computer with a high-speed internet connection
Eligible to work as an independent contractor for the University of Pennsylvania
Compensation for this project is based on amount of work completed, equivalent to $15/hour with the potential to earn bonuses.
To apply, send your resume and cover letter to camio.ldc@gmail.com. The subject line should state the name of your language and the words Language Specialist CAMIO Project (e.g. Maldivian Language Specialist CAMIO Project).
Company Profile: Linguistic Data Consortium (LDC) is a not-for-profit organization hosted by the University of Pennsylvania that creates and distributes linguistic resources to universities, laboratories, companies and libraries around the world in support of language-related education, research and technology development.
Visit us at https://www.ldc.upenn.edu for more information.