AI Engineer – Computer Vision Specialist
If you want to be part of a fast learning team and have the same vision of us, send your CV to info@devisionx.com
Please use subject “AI Engineer – Computer Vision Specialist”
Location:
Cairo, Egypt (Full remotely)
Level:
Med-Level/Senior
Job Purpose
We are seeking a highly skilled AI Engineer specializing in computer vision with expertise in Vision-Language Models (VLMs) and Visual Retrieval-Augmented Generation (RAG). The ideal candidate will play a key role in developing scalable AI vision workflows using the latest advancements in multimodal learning, LLMs, and visual reasoning, leveraging the workflow-building capabilities of Tuba.AI.
Job Responsibilities
- Design, develop, and implement AI-driven computer vision solutions tailored to industry-specific applications, including quality inspection, object detection, vision-guided robotics, and defect analysis.
- Utilize Tuba.AI’s drag-and-drop workflow builder to design, train, and deploy custom vision solutions seamlessly.
- Develop multimodal systems combining vision and text using Vision-Language Models (e.g., CLIP, Flamingo, BLIP, DINOv2, etc.) for enhanced contextual understanding.
- Implement and integrate Visual Retrieval-Augmented Generation (RAG) models that enhance AI performance by retrieving external image or textual knowledge to answer complex queries.
- Collaborate with the research team to fine-tune VLMs for domain-specific visual tasks (e.g., product categorization, medical imaging, or industrial automation).
- Work with large-scale visual datasets and knowledge bases, leveraging data retrieval mechanisms for dynamic AI workflows.
- Develop pipelines for image-to-text and text-to-image tasks with applications in e-commerce, media, and manufacturing.
- Collaborate cross-functionally to identify key business challenges and design computer vision solutions to address them.
Job Requirements
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related field.
- Strong understanding of Vision-Language Models (VLMs) and experience working with models like CLIP, OpenFlamingo, DINOv2, or related architectures.
- Experience developing Visual RAG systems, integrating retrieval pipelines and databases into AI workflows.
- Proficiency in Python, with experience in libraries like PyTorch, TensorFlow, OpenCV, and Hugging Face Transformers.
- Experience with AutoML platforms and tools for building, deploying, and managing custom vision models.
- Knowledge of prompt engineering for vision-related tasks and fine-tuning multimodal models for high-accuracy predictions.
- Familiarity with LLMs (Large Language Models) and their role in generating dynamic, visual outputs.
- Proficiency in working with visual datasets, embeddings, vector databases (e.g., Pinecone, Weaviate, or FAISS), and large-scale training datasets.
Preferred Skills
- Experience with edge deployment of vision models using low-latency hardware.
- Knowledge of zero-shot and few-shot learning techniques in computer vision.
- Strong understanding of semantic search and multimodal retrieval for use cases in e-commerce or industrial inspection.
- Familiarity with cloud services (AWS, GCP, Azure) for scalable model training and deployment.
- Familiarity with fine-tuning pre-trained VLM models for custom use cases.
What We Offer:
- The opportunity to work on cutting-edge AI projects with real-world applications in industries such as manufacturing, logistics, and retail.
- A collaborative, innovation-driven work environment with access to advanced tools and resources like Tuba.AI.
- Competitive salary, growth opportunities, and continuous learning initiatives.
If you are passionate about AI and Computer Vision, please send your CV to info@devisionx.com with the subject line “AI Engineer – Computer Vision Specialist.”