CRAB Lab

CRAB Lab ๐Ÿš€ Ankara Researcher | Empowering students in CS Research ๐Ÿ’ก
๐Ÿ”ฌ NLP | ML | DL | Transformers | Transfer Learning | OpenCV & More!
๐Ÿ“š Explore, Learn & Innovate! ๐Ÿ”

๐Ÿš€ Ankara Researcher | Empowering BD students in CS Research ๐Ÿ’ก
๐Ÿ”ฌ Focus: NLP | ML | DL | Transformers | Transfer Learning | OpenCV
๐Ÿ“š Research insights & resources for learning & innovation!

The first reaction to a jailbreak is usually simple.Add another prompt.Add another rule.Add another instruction.At CRAB ...
10/03/2026

The first reaction to a jailbreak is usually simple.
Add another prompt.
Add another rule.
Add another instruction.
At CRAB Labs, we have learned that this does not scale.
True defense cannot live only in text.
Real defense requires model level alignment.
It requires context aware safety that understands intent, not keywords.
And it requires continuous monitoring of how systems behave after deployment.
Prompt patches look fast.
But they slowly accumulate inconsistencies and blind spots.
Security cannot be cosmetic.
It has to be systemic.
Remember
If your safety strategy is only a prompt, your defense is temporary.

๐—ช๐—ต๐˜† ๐—๐—ฎ๐—ถ๐—น๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ๐˜€ ๐—ž๐—ฒ๐—ฒ๐—ฝ ๐—ช๐—ผ๐—ฟ๐—ธ๐—ถ๐—ป๐—ดJailbreaks do not succeed because users are clever.They succeed because systems are fragile.A...
08/03/2026

๐—ช๐—ต๐˜† ๐—๐—ฎ๐—ถ๐—น๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ๐˜€ ๐—ž๐—ฒ๐—ฒ๐—ฝ ๐—ช๐—ผ๐—ฟ๐—ธ๐—ถ๐—ป๐—ด

Jailbreaks do not succeed because users are clever.
They succeed because systems are fragile.
At CRAB Labs, we see the same patterns again and again.
Jailbreaks exploit instruction conflicts.
They exploit role confusion inside multi prompt setups.
And they exploit overgeneralized helpfulness, where the model is rewarded for
answering even when it should slow down.
This reveals something uncomfortable.
Safety is not binary.
It is probabilistic.
Sometimes the model follows the safe path.
Sometimes it slips through a slightly different context, phrasing, or role.
Understanding jailbreaks is not about blame.
It is about learning how and where systems fail.
And every failure we understand makes future systems more robust for everyone.
Remember
Jailbreaks are not just attacks.
They are diagnostics for broken safety assumptions.

Most models look impressive in demos.That is not where real safety is tested.At CRAB Labs, we treat LLMs like real world...
04/03/2026

Most models look impressive in demos.
That is not where real safety is tested.
At CRAB Labs, we treat LLMs like real world systems.
Because that is how they are being used.
This is what red teaming actually means.
It means intentionally stress testing models to expose safety failures.
To discover jailbreaks and policy bypasses.
And to reveal bias and misuse patterns that normal benchmarks never capture.
Red teaming is not about making models look bad.
It is about finding what will break before real users do.
It changes how we design evaluation.
It changes how we build safeguards.
And it changes how we decide whether a system is ready for deployment.
At CRAB Labs, red teaming is part of our development cycle.
Not a last minute audit.
Because failures found early prevent harm later.
Remember
If you only test your AI in friendly conditions, you are not testing it at all.

People often ask why alignment is still such a hard problem.The uncomfortable answer is simple.Human values are not fixe...
28/02/2026

People often ask why alignment is still such a hard problem.
The uncomfortable answer is simple.
Human values are not fixed.
They are context dependent.
They are sometimes conflicting.
And they are deeply shaped by culture, language, and lived experience.
What is acceptable in one setting can be harmful in another.
What is helpful for one user can be misleading for someone else.
Expecting a single model to capture all of this perfectly is unrealistic.
At CRAB Labs, we do not treat alignment as something that can be solved by one
clever loss function or one safety filter.
Alignment has to be built across the entire system.
Through training that exposes models to diverse and representative data.
Through evaluation that reflects real world ambiguity and cultural variation.
And through governance mechanisms that define how systems are monitored,
updated, and constrained after deployment.
Alignment is not a feature.
It is infrastructure.
Remember
You cannot align an AI system to humanity with a single technique.
You need a process.

Alignment is often misunderstood.Most people think alignment means blocking bad words.At CRAB Labs, we see alignment as ...
25/02/2026

Alignment is often misunderstood.
Most people think alignment means blocking bad words.
At CRAB Labs, we see alignment as something far more difficult and far more
important.
Alignment means that an LLMโ€™s behavior actually matches human intent, values, and
safety constraints.
Not in a static policy document.
But in real interactions.
A well aligned model follows instructions faithfully instead of optimizing for
shortcuts.
It avoids harmful actions, not only harmful language.
And it handles ambiguity responsibly, instead of confidently guessing.
This is what makes alignment hard.
Human intent is contextual.
Safety boundaries change across domains and cultures.
And real world inputs are rarely clean or complete.
Alignment is not a one time training step.
It is ongoing.
It is contextual.
And it is always imperfect.
At CRAB Labs, we treat alignment as a system level problem.
Data design.
Evaluation design.
Deployment constraints.
And continuous monitoring.
Because a well aligned model is not silent.
It is careful.
Remember
Good alignment does not make AI quieter.
It makes AI more responsible.

Most AI research still celebrates what models can do.At CRAB Labs, we care just as much about what models can explain.Ou...
25/02/2026

Most AI research still celebrates what models can do.
At CRAB Labs, we care just as much about what models can explain.
Our research philosophy is simple.
AI should explain itself.
If a system cannot justify its behavior, it cannot be trusted in real decisions.
Uncertainty should be visible.
When a model is unsure, that uncertainty must be exposed, not hidden behind fluent
language.
Safety should be measurable.
Not assumed.
Not promised.
But evaluated through clear protocols and real failure analysis.
And research should benefit society.
Not only benchmarks.
Not only papers.
But people and public systems that actually depend on AI.
CRAB Labs exists to push responsible and inclusive LLM research forward.
Because powerful models alone do not create progress.
Responsible systems do.
Remember
The goal of AI research is not to build smarter machines.
It is to build systems people can safely rely on.

This week reminded us of something important.Progress in LLMs looks fast.But reliability still moves slowly.At CRAB Labs...
24/02/2026

This week reminded us of something important.
Progress in LLMs looks fast.
But reliability still moves slowly.
At CRAB Labs, a few lessons stood out very clearly.
Reasoning is fragile.
Small changes in prompts, tools, or context can completely change outcomes.
Autonomy increases risk.
The moment models start acting through tools and workflows, errors stop being local
and start becoming systemic.
Evaluation is still evolving.
Our current metrics cannot fully capture usefulness, uncertainty, and real world failure
behavior.
And multilingual fairness is critical.
If our systems work well only in English, we are not building global AI.
This week reinforced one simple idea for us.
Advancing model capabilities is not enough.
Progress in LLMs must be matched with rigor and responsibility.
Remember
The real measure of progress in AI is not how fast models improve, but how safely
and fairly they can be deployed.

Whenever bias mitigation comes up, the same fear appears.โ€œThat is censorship.โ€At CRAB Labs, we take a very different vie...
24/02/2026

Whenever bias mitigation comes up, the same fear appears.
โ€œThat is censorship.โ€
At CRAB Labs, we take a very different view.
Reducing bias is not about silencing opinions.
It is about correcting imbalances in how systems behave.
Bias mitigation means balanced representation in data and evaluation.
It means being transparent about what a model can and cannot do.
And it means delivering fair performance across different groups, languages, and
contexts.
A system can respect diverse viewpoints and still be accountable to evidence.
It can support open discussion and still avoid systematically disadvantaging certain
communities.
Responsible AI is not neutral by accident.
It is designed.
At CRAB Labs, we focus on behavior level fairness.
Not surface level moderation.
Because real inclusion is not about hiding outputs.
It is about ensuring that different users receive the same quality, safety, and reliability.
Remember
Fair AI does not silence voices.
It makes sure every voice is treated equally by the system.

Most people look for bias only in offensive sentences.At CRAB Labs, we see something much deeper.Bias in LLMs is structu...
23/02/2026

Most people look for bias only in offensive sentences.
At CRAB Labs, we see something much deeper.
Bias in LLMs is structural.
It does not only appear in what the model says.
It appears in who gets correct answers.
It appears in which perspectives dominate the conversation.
And it appears in which errors are silently tolerated.
This kind of bias is harder to detect.
Because it hides inside performance averages, benchmark scores, and deployment
metrics.
Structural bias emerges from how data is collected.
From how objectives are defined.
And from the context in which systems are deployed.
If a model is trained mostly on dominant voices,
then dominant perspectives become the default truth.
If an evaluation dataset ignores minority contexts,
then failures in those contexts become invisible.
At CRAB Labs, we treat bias as a system level problem.
Not a content filter problem.
Because you cannot fix structural bias by removing a few bad outputs.
You must redesign data pipelines.
Rebalance objectives.
And audit performance across languages, communities, and real use cases.
Remember
If bias is built into the structure of your system, fairness must be built into its design.

Low resource languages are often treated as an afterthought.Added later.Evaluated less.Deployed last.At CRAB Labs, we se...
22/02/2026

Low resource languages are often treated as an afterthought.
Added later.
Evaluated less.
Deployed last.
At CRAB Labs, we see the cost of that choice very clearly.
When a language has limited data, models do not just become less accurate.
They reason worse.
They hallucinate more.
And their safety alignment becomes weaker and less reliable.
This is not because the language is harder.
It is because the system was never designed to serve it properly.
Fixing this is not only about collecting more data.
It is about improving data quality.
Designing better adaptation strategies for low resource settings.
And building evaluation protocols that actually reflect how models behave in those
languages.
At CRAB Labs, we strongly support inclusive AI research.
Because the future of trustworthy AI cannot be built on a small set of dominant
languages.
Remember
If your research ignores low resource languages, your AI will ignore most of the
world.

Most people do not notice it.Because in demos, everything works in English.At CRAB Labs, we see a very different reality...
22/02/2026

Most people do not notice it.
Because in demos, everything works in English.
At CRAB Labs, we see a very different reality.
Most large language models still perform best in English.
Not slightly better.
Systematically better.
And this creates real global problems.
It creates digital inequality.
Access to reliable AI becomes language dependent.
It creates cultural misinterpretation.
Local expressions, social norms, and implicit meanings are often flattened or
misunderstood.
And it lowers safety in non English contexts.
Moderation, refusal behavior, and harm detection are far less reliable outside high
resource languages.
This is not just a technical gap.It is a fairness gap.
When AI systems are trained, evaluated, and optimized primarily in English,
the rest of the world becomes an afterthought.
At CRAB Labs, multilingual AI is not a feature.
It is a requirement.
Because safety, trust, and usefulness must hold across languages, not only across
benchmarks.
Remember
If your AI system is safe only in English, it is not safe globally.

For years, evaluation in NLP felt simple.You trained a model.You reported accuracy or F1.You moved on.That mindset no lo...
22/02/2026

For years, evaluation in NLP felt simple.
You trained a model.
You reported accuracy or F1.
You moved on.
That mindset no longer works.
At CRAB Labs, we keep running into the same wall.
Traditional evaluation assumes three things.
There is one correct answer.
Labels are clean and unambiguous.
And model outputs are deterministic.
LLMs violate all three.
Many prompts have multiple valid answers.
Real world labels are noisy, incomplete, and often subjective.
And modern models produce different outputs for the same input.
This breaks the logic behind most standard metrics.
So evaluation itself becomes a research problem.We now need ranking aware metrics that capture usefulness instead of exact matching.
We need human preference modeling to reflect how people actually judge outputs.
And we need robustness and uncertainty tests that measure how models behave under
distribution shift, ambiguity, and adversarial conditions.
At CRAB Labs, we treat evaluation as part of system design.
Not as a reporting formality.
Because you cannot build trustworthy AI on top of broken measurement.
Remember
If you cannot measure real behavior, you cannot claim real progress.

Address

Mirpur-02
Dhaka
1216

Alerts

Be the first to know and let us send you an email when CRAB Lab posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Share