Hello, I'm

Anna FP

Data AI Engineer — Building intelligent, data-driven systems.

Competencies

Data Engineering & Backend

Architecting robust Python pipelines and scalable APIs. Optimizing data retrieval using Vector databases, Elasticsearch, and MongoDB.

PythonFastAPIMultiprocessingETLElasticSearchVectorDBsMongoDBPostgreSQLQdrantChromaDBRedisPydanticMultithreading

Generative AI & ML

Orchestrating intelligent agents with LangGraph and Lanhchain. Leveraging PyTorch and Sklearn for advanced data inference.

LangChainOpenAIPyTorchLangGraphHuggingFacePytorchTensorflowMCPRAGRAGASScikit-learnEvidently AIPrompt Engineering

Orchestration & Visualization

Automating workflows and data processing using Airflow and CI/CD pipelines. Deploying containerized apps and building data dashboards for UI and analytics.

BashDockerStreamlitAirflowDash MantineJenkinsMLflowAnsibleLangSmith

Projects

ArtGuide

ArtGuide

AI Art Detector and Audio Guide

An interactive AI-powered guide for art enthusiasts. It utilizes RAG technology to provide real-time insights, context, and conversational exploration of artworks.

PythonDockerQdrantDBLangchainCLIPPiperTTSLanggraphFastAPIOpenAIStreamlit
ML Resilience Lab

ML Resilience Lab

Credit Card Fraud Detection System & Resilience Platform

A high-availability fraud detection system with real-time pipelines, automated drift detection, and self-healing MLOps protocols.

AirflowMongoDBXGBoostFastAPIEvidently AIMLflowLangSmithStreamlitDocker Compose
The News Hub

The News Hub

Real-Time News Engine and Insights

A comprehensive news aggregation and analysis platform driven by AI. It serves as a centralized system to collect, process, and interactively explore global news content efficiently.

PythonAirflowSklearnDockerChromaDBMongoDBLangchainFastAPIOpenAINext.js

Experience

Data Engineer

@ BMAT Music Innovators
Feb 2024 - Present
  • Engineered robust Python scripts to synchronize and maintain critical backend data flows, guaranteeing continuous data consistency across Dev, Staging, and Prod environments.
  • Developed high-concurrency RESTful APIs using FastAPI and asynchronous programming, providing stable support for multiple downstream services and ensuring strict data validation with Pydantic.
  • Optimized search performance and metadata discovery by implementing and managing Elasticsearch clusters for high-volume music catalog indexing, ensuring millisecond-latency retrieval.
  • Designed and deployed interactive production dashboards, centralizing scattered data to facilitate real-time data visualization and decision-making for key stakeholders.
PythonAirflowDockerMongoDBElasticsearchFastAPIDashETLPydantic