Skip to content
View haja-k's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report haja-k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haja-k/README.md
Header

Backend-First

LinkedIn Email

Focus Focus Focus Focus

I build and operate AI systems that ship: retrieval pipelines, evaluation + quality gates, and production infrastructure for LLM-backed products.

What I do

Area Typical work
Retrieval infrastructure Hybrid retrieval (graph + vector + structured filters), indexing, ranking, tuning
Production readiness Load tests, latency/concurrency modeling, rollout checks, operational guardrails
Security automation Repeatable OWASP ZAP scans and baseline security workflows
Reliability Routing, TLS hardening, incident-ready logging/metrics, active-active patterns

Toolbox

Category Stack
Languages Python · TypeScript/JavaScript · Bash
AI/Retrieval LangChain · embeddings pipelines · hybrid ranking · evaluation workflows
Data/Stores Neo4j · PostgreSQL/pgVector · TiDB · MySQL
Infra/Delivery Docker · Nginx · Linux
Testing/Quality k6 · Locust · Playwright · OWASP ZAP

Public deployments

Most production code I work with is confidential (state government use cases). These deployed products are publicly accessible:

Product Link
Dayang chatbot (Sarawak services portal) https://service.sarawak.gov.my/web/
Court-related project (public article reference) https://ekss-portal.kehakiman.gov.my/portals/web/home/article_view/0/5/1
Malaysia public library chatbot (button-based) https://www.u-library.gov.my/portal/web/guest

Selected public repos

These repos mirror the way I build systems end-to-end (pipeline → retrieval → validation), even when production code isn’t public.

Area Repository What it shows
Local multi-agent AI app agentic-video-analyst offline inference · multi-agent orchestration · desktop app engineering
Graph ingestion + retrieval neo4j-document-pipeline graph modeling · retrieval API patterns for LLM workflows
Vector + hybrid experiments tidb-vector-llm-testbed relevance/scoring experiments · indexing tradeoffs
Embedding pipeline mysql-to-pgvector-embeddings extraction → embeddings → pgVector semantic layer
Structured retrieval faq-retrieval-system structured query layer for grounded answers
Performance testing playwright-dayang, k6-for-custom-dify UX + API load testing approaches for assistants
Security automation zap-security-api ZAP baseline/quick/full scan exposed via API
Experiments playwright-study, besu-ibft2.0 targeted learning repos (testing + distributed systems)

Principles

  • Measured improvements (eval + monitoring) over demo-only features
  • Quality, latency, and security as release criteria
  • Operable systems: clear failure modes, logs/metrics, and runbook-friendly workflows

Pinned Loading

  1. agentic-video-analyst agentic-video-analyst Public

    Fully local AI desktop application that uses multi-agent orchestration to analyze short videos (~1 min) through natural language queries. All AI inference runs offline with no cloud dependencies.

    Python

  2. mysql-to-pgvector-embeddings mysql-to-pgvector-embeddings Public

    vectorizing data from mysql database to vector so it can be used by LLM in Dify workflow orchestration

    Python 2

  3. tidb-vector-llm-testbed tidb-vector-llm-testbed Public

    Experimental framework for evaluating TiDB’s vector search capabilities with LangChain-based LLM retrieval workflows. Includes setup scripts, indexing pipelines, and retrieval benchmarks to test hy…

    Python

  4. neo4j-document-pipeline neo4j-document-pipeline Public

    Using Neo4j for knowledge graph. Complete with API for end-to-end ingestion, indexing and retrieval pipeline ready for workflow integration.

    Python

  5. besu-ibft2.0 besu-ibft2.0 Public

    hyperledger besu with ibft 2.0 experiment

    Shell 1

  6. img-classification-api img-classification-api Public

    image classification platform for image model self training

    JavaScript