Blind Review Sanitizer

Automatically anonymize academic manuscripts for double-blind peer review by removing author identifiers, institutional affiliations, acknowledgments, and excessive self-citations while preserving document formatting and scholarly content integrity.

Key Capabilities:

- Author Identity Removal: Automatically detect and redact author names, institutional affiliations, and contact information using pattern matching and customizable rules
Acknowledgment Section Sanitization: Identify and remove or flag acknowledgment sections that may reveal author identity through funding sources or personal thanks
Self-Citation Detection and Neutralization: Identify first-person citations and excessive self-references that could deanonymize the submission
Multi-Format Document Support: Process DOCX, Markdown, and plain text files with format-aware sanitization strategies
Audit Trail Generation: Create detailed logs of all redactions made for verification and transparency

Parameters

Parameter	Type	Required	Default	Description
INLINECODE0	str	Yes	-	Path to input manuscript file (DOCX, MD, or TXT)
INLINECODE1

When to Use

✅ Use this skill when:

- Preparing a manuscript for double-blind peer review at journals requiring author anonymity
Submitting to conferences with anonymization requirements (e.g., NeurIPS, ICML, ACL, major medical journals)
Performing a final compliance check before submission to ensure no identifying information remains
Re-sanitizing a previously rejected manuscript for submission to a new venue with different anonymization standards
Creating anonymized versions of papers for public preprints while maintaining citation integrity
Processing collaborative manuscripts where some authors need to remain anonymous for specific submissions

❌ Do NOT use when:

- Preparing for open peer review or journals with transparent review processes → Use cover-letter-drafter instead
The manuscript contains patent-pending innovations requiring author identification → Consult legal counsel first
You need to add author information rather than remove it → Use citation-formatter for bibliography management
Working with highly sensitive clinical data requiring HIPAA compliance → Use hipaa-compliance-auditor for medical data
The document uses complex LaTeX formatting with embedded author macros → Manual review required

Related Skills:

- 上游 (Upstream): cover-letter-drafter, citation-formatter, INLINECODE10
下游 (Downstream): journal-club-presenter, INLINECODE12

Integration with Other Skills

Upstream Skills:

- cover-letter-drafter: Generate cover letters AFTER manuscript sanitization to avoid including blinded content in correspondence
INLINECODE14: Format citations BEFORE sanitization to ensure proper numbering and formatting
INLINECODE15: Check co-author conflicts BEFORE anonymization to maintain disclosure accuracy

Downstream Skills:

- journal-club-presenter: Create presentation materials using sanitized versions for external review
INLINECODE17: Adapt abstracts for conferences that may have different anonymity requirements

Complete Workflow:

Manuscript Writing → citation-formatter → conflict-of-interest-checker → blind-review-sanitizer → cover-letter-drafter → Submission

Core Capabilities

1. Author Identity Detection and Removal

Systematically identify and remove author names, institutional affiliations, and contact information from manuscripts using pattern recognition and user-specified rules.

CODEBLOCK1

Parameters:

Parameter	Type	Required	Description	Default
INLINECODE18	List[str]	No	List of author names to redact. Improves accuracy when specified.	None
INLINECODE19

Best Practices:

- ✅ Always provide explicit author names when known to improve detection accuracy and reduce false positives
✅ Test with sample documents before processing important manuscripts to verify redaction patterns
✅ Include all name variants (full names, initials, anglicized versions) in the authors list
✅ Review output carefully for author names that may appear in figures, tables, or supplementary materials

Common Issues and Solutions:

Issue: Common words flagged as author names

- Symptom: Words like "Wang" (meaning "king" in Chinese) or common English names appearing in text are incorrectly redacted
Solution: Use explicit author list with full names; disable partial matching for documents with many common name words

Issue: Author names in citations not detected

- Symptom: "As Smith et al. (2023) showed..." retains author name when Smith is an author
Solution: Use self-citation detection mode which specifically targets author names in citation contexts

2. Institutional Affiliation Masking

Automatically detect and replace institutional identifiers including universities, research institutes, departments, and laboratories with generic placeholders.

CODEBLOCK2

Parameters:

Parameter	Type	Required	Description	Default
INLINECODE21	List[str]	No	Custom keywords for institution detection	Predefined list
INLINECODE22

bool | No | Only match explicit institutional patterns, reduces false positives | False |

Best Practices:

- ✅ Add custom institution keywords for specialized domains (e.g., "Consortium", "Network") not in default list
✅ Use strict mode for documents with many false positives (e.g., medical texts mentioning "hospitals" frequently)
✅ Check institutional abbreviations which may not be caught by full-name patterns (e.g., "JHU" for Johns Hopkins)
✅ Verify geographical references as some may indirectly reveal institutions (e.g., "Silicon Valley campus")

Common Issues and Solutions:

Issue: Generic words flagged as institutions

- Symptom: "Research group" or "technical institute" in generic contexts are replaced
Solution: Enable strict mode or add negative context patterns to exclude generic usage

Issue: Multi-campus institutions not fully masked

- Symptom: "University of California, Berkeley" partially masked as "[INSTITUTION], Berkeley"
Solution: Pre-process to combine multi-part institutional names before sanitization

3. Acknowledgment Section Management

Intelligently identify and handle acknowledgment sections, funding disclosures, and personal thanks that may reveal author identity or institutional affiliations.

CODEBLOCK3

Parameters:

Parameter	Type	Required	Description	Default
INLINECODE23	bool	No	Retain acknowledgment section instead of removing	False
INLINECODE24

List[str] | No | Custom section titles to recognize | Predefined list |

Best Practices:

- ✅ Remove acknowledgments by default for strict double-blind review requirements
✅ Check funding disclosure requirements - some journals require anonymized funding info in acknowledgments
✅ Use custom titles for non-English manuscripts or specialized formats
✅ Review anonymized funding numbers - grant numbers sometimes contain institutional codes

Common Issues and Solutions:

Issue: Acknowledgment section not detected

- Symptom: Acknowledgments section with non-standard title (e.g., "Gratitude", "Credits") remains in document
Solution: Add custom acknowledgment titles or manually review document structure

Issue: Essential content in acknowledgment section

- Symptom: Data availability statements or ethical approvals mentioned in acknowledgments are removed
Solution: Move essential content to appropriate sections before sanitization; use keep_acknowledgments with manual redaction

4. Self-Citation Detection and Neutralization

Identify excessive self-citations and first-person references to previous work that could deanonymize the submission, replacing them with neutral language.

CODEBLOCK4

Parameters:

Parameter	Type	Required	Description	Default
INLINECODE26	bool	No	Only highlight self-citations without replacing	False
INLINECODE27

Dict[str, str] | No | Custom replacement phrases | Default mappings |

Best Practices:

- ✅ Use highlight mode first to review all self-citations before final replacement
✅ Maintain citation integrity - ensure numbered references remain valid after text changes
✅ Check context carefully - some self-references may be essential for narrative flow
✅ Balance anonymity with scholarship - excessive neutralization may make the paper less clear

Common Issues and Solutions:

Issue: Legitimate references flagged as self-citations

- Symptom: General references to "our approach" or "our method" in methodology sections are flagged
Solution: Review highlighted instances manually; adjust context to use passive voice before sanitization

Issue: Citations broken after text replacement

- Symptom: "As we showed [1]" becomes "As [PREVIOUS WORK] showed [1]" but reference [1] is the author's paper
Solution: This is expected behavior - reviewers should not see author self-citations; citations will be restored post-review

5. Multi-Format Document Processing

Process manuscripts in DOCX, Markdown, and plain text formats with format-aware handling to preserve structure while sanitizing content.

CODEBLOCK5

Supported Formats:

Format	Extension	Features Preserved	Special Handling
Microsoft Word	.docx	Styles, tables, formatting	python-docx library required
Markdown

Best Practices:

- ✅ Prefer DOCX for complex documents with tables and formatting; preserves structure best
✅ Use Markdown for version-controlled manuscripts (e.g., Git-tracked LaTeX alternatives)
✅ Validate output formatting especially for complex tables and mathematical content
✅ Check figure/table captions which may contain author information in DOCX files

Common Issues and Solutions:

Issue: DOCX formatting lost after processing

- Symptom: Complex formatting, styles, or tracked changes are stripped
Solution: Accept all changes and remove comments before sanitization; save as clean document first

Issue: Unicode/UTF-8 characters corrupted in text files

- Symptom: Special characters (accents, mathematical symbols) display incorrectly
Solution: Ensure files are UTF-8 encoded; specify encoding explicitly if needed

6. Audit Trail and Verification Reporting

Generate comprehensive logs of all sanitization actions for transparency, quality assurance, and compliance verification.

CODEBLOCK6

Audit Information Captured:

Information Type	Description	Use Case
INLINECODE28	List of author names redacted	Verification, re-identification post-review
INLINECODE29

Best Practices:

- ✅ Save audit logs for compliance verification and post-review re-identification
✅ Review audit trail before submission to ensure nothing was missed
✅ Use logs for quality improvement - identify patterns in missed redactions
✅ Share logs with co-authors for multi-author verification

Common Issues and Solutions:

Issue: Audit log too verbose

- Symptom: Every instance of common words (e.g., "University") is logged separately
Solution: Use summary mode for large documents; filter by category for review

Issue: Sensitive information in audit logs

- Symptom: Audit logs themselves contain the sensitive data that was removed
Solution: Store logs securely; consider encrypting or limiting access to audit data

Complete Workflow Example

From input to output for double-blind journal submission:

CODEBLOCK7

Python API Usage:

CODEBLOCK8

Expected Output Files:

CODEBLOCK9

Common Patterns

Pattern 1: Standard Double-Blind Journal Submission

Scenario: Preparing a research paper for submission to Nature, Science, or IEEE Transactions with strict double-blind review.

CODEBLOCK10

Workflow:

1. Run sanitization with full author list and strict settings
Manually verify title page, headers, and footers
Check supplementary materials for author metadata
Submit blinded version with separate cover letter

Output Example:
CODEBLOCK11

Pattern 2: Conference Submission with Anonymization Period

Scenario: Submitting to computer science conference (e.g., ICML, NeurIPS) that requires anonymization during review but allows deanonymization after acceptance.

CODEBLOCK12

Workflow:

1. Use highlight mode to identify all potential identity leaks
Review highlighted sections manually
Replace GitHub URLs with anonymous placeholder
Keep general acknowledgments but remove personal thanks
Create separate deanonymization key for post-acceptance

Output Example:
CODEBLOCK13

Pattern 3: Medical Journal with Funding Disclosure Requirements

Scenario: Submitting to medical journal requiring funding disclosure but author anonymity during review.

CODEBLOCK14

Workflow:

1. Keep acknowledgments section but redact investigator names
Retain funding information with anonymized grant numbers
Remove all institutional identifiers
Maintain IRB/ethics committee references
Add note about funding disclosure availability

Output Example:
CODEBLOCK15

Pattern 4: Resubmission After Previous Rejection

Scenario: Revising and resubmitting a previously rejected manuscript to a new journal, requiring fresh anonymization.

CODEBLOCK16

Workflow:

1. Create clean copy without previous submission metadata
Update author list to include new collaborators
Remove all references to previous submission or reviews
Sanitize with updated author information
Verify no "response to reviewers" content remains in main text

Output Example:

Before: "We have revised the manuscript based on Nature 
          Medicine reviewer comments..."

After: Complete removal of all previous submission references

Quality Checklist

Pre-sanitization Checks:

- [ ] CRITICAL: Verify you have permission to anonymize all authors' contributions
[ ] Confirm target journal/conference anonymization requirements
[ ] Compile complete list of all author names and name variants
[ ] Identify all institutional affiliations including secondary appointments
[ ] Check for author photos or bios in supplementary materials
[ ] Review document properties/metadata for author information
[ ] Verify no tracked changes or comments contain author identity

During Sanitization:

- [ ] Run initial scan with highlight mode to preview all changes
[ ] Verify author list is complete (check initials, surnames, full names)
[ ] Confirm acknowledgment handling matches journal policy
[ ] Check self-citation replacement maintains citation flow
[ ] Review institution masking in complex affiliations
[ ] Validate contact information removal (emails, phone, addresses)
[ ] Ensure figure captions and table notes are processed

Post-sanitization Verification:

- [ ] CRITICAL: Search for author surnames in output document
[ ] Check headers, footers, and page numbers for institutional branding
[ ] Verify document properties are cleared (File → Properties → Remove Personal Information)
[ ] Review all figures for embedded author metadata or watermarks
[ ] Check PDF metadata if converting to PDF for submission
[ ] Validate that citations remain properly numbered after text changes
[ ] Ensure mathematical notation and symbols preserved correctly

Before Submission:

- [ ] CRITICAL: Have a non-author colleague verify anonymity
[ ] Confirm acknowledgments handling meets journal ethical guidelines
[ ] Check that funding disclosure requirements are met
[ ] Verify supplementary materials are also sanitized
[ ] Test submission system upload with sanitized file
[ ] Create deanonymization key mapping for post-acceptance
[ ] Save audit trail securely for potential post-review verification

Common Pitfalls

Input Preparation Issues:

- ❌ Processing tracked changes without accepting → Hidden revision marks reveal author identity

- ✅ Accept all changes and remove comments before sanitization

- ❌ Ignoring document metadata → File properties contain author name and institution

- ✅ Clear document properties: File → Info → Check for Issues → Inspect Document

- ❌ Forgetting supplementary materials → Author info in supplementary PDFs not sanitized

- ✅ Process all supplementary files through sanitizer with same settings

- ❌ Incomplete author lists → Co-author names appear in text unrecognized

- ✅ Include all authors, middle names, and common name variants

Sanitization Strategy Issues:

- ❌ Over-aggressive replacement → Legitimate citations and references damaged

- ✅ Use highlight mode first; review each replacement context

- ❌ Under-sanitization → Subtle identifiers remain (e.g., "our previous work")

- ✅ Enable all detection modules; manually review after automated processing

- ❌ Inconsistent handling → Some instances replaced, others missed

- ✅ Use case-insensitive matching; verify regex patterns cover all variants

- ❌ Context-insensitive replacement → "University research" becomes "[INSTITUTION] research"

- ✅ Review outputs carefully; consider strict mode for ambiguous terms

Output Validation Issues:

- ❌ Assuming perfect automation → Automated tools miss edge cases

- ✅ Always perform manual verification pass

- ❌ Submitting without verification → Undetected author info reaches reviewers

- ✅ Use search function to look for author surnames before submission

- ❌ Losing audit trail → No record of what was changed for post-review

- ✅ Save and securely store sanitization reports

- ❌ Forgetting downstream effects → Citations broken, cross-references lost

- ✅ Verify document integrity after sanitization

Troubleshooting

Problem: Author names still appear in output

- Symptoms: Specific author names visible after sanitization
Causes:

- Author name not in provided list
- Different spelling or name variant used
- Name embedded in image/figure (not text)
- Name in document metadata, not body text

- Solutions:

- Add all name variants to authors list
- Search for partial matches (surnames only)
- Check document properties and clear metadata
- Manually review figures and images

Problem: Excessive false positives

- Symptoms: Common words like "University" or "Center" replaced throughout document
Causes:

- Overly broad pattern matching
- Generic terms in specialized vocabulary
- Institution keywords used in non-institutional contexts

- Solutions:

- Enable strict mode for more precise matching
- Add context requirements (e.g., must be capitalized)
- Use explicit author/institution lists instead of pattern matching
- Post-process to restore legitimate usage

Problem: Document formatting corrupted

- Symptoms: Styles lost, fonts changed, layout broken after DOCX processing
Causes:

- Complex formatting not supported by python-docx
- Tracked changes or comments interfering
- Corrupted original document

- Solutions:

- Accept all changes and remove comments before processing
- Save document in compatibility mode if using advanced features
- Use plain text or Markdown for highly formatted documents
- Manually verify and correct formatting post-processing

Problem: Self-citations not detected

- Symptoms: First-person references to previous work remain in text
Causes:

- Non-standard phrasing not matching patterns
- Citations in different format (e.g., "our Nature 2020 paper")
- Citations in footnotes not processed

- Solutions:

- Add custom patterns for specific phrasing
- Use highlight mode to identify missed citations
- Check all document sections including footnotes/endnotes
- Manually review for unique self-reference formulations

Problem: Acknowledgment section not removed

- Symptoms: Acknowledgments section remains in output
Causes:

- Non-standard section title
- Section formatted as body text, not heading
- Acknowledgments integrated into other sections

- Solutions:

- Add custom acknowledgment titles for your document style
- Manually identify and mark acknowledgment boundaries
- Use section-based processing if document has clear structure
- Review document outline to identify acknowledgment location

Problem: References/citations broken

- Symptoms: Citation numbers wrong, references missing, cross-references broken
Causes:

- Text replacement affecting citation indices
- Reference list entries removed that are still cited
- Self-citation replacement removing reference context

- Solutions:

- Use neutral replacement that preserves citation markers
- Verify reference list integrity after processing
- Consider citations as protected text
- Manually correct citation numbering if needed

Problem: python-docx import error

- Symptoms: "Error: python-docx not installed" when processing DOCX files
Causes:

- Required dependency not installed
- Virtual environment not activated

- Solutions:

- Install dependency: pip install python-docx
- Check Python environment and activate if needed
- Consider using text-based formats if dependencies unavailable
- Verify pip and Python installation

References

Available in references/ directory:

- (No reference files currently available for this skill)

External Resources:

- COPE Guidelines for Peer Review: https://publicationethics.org/resources/guidelines
IEEE Anonymization Guidelines: https://www.ieee.org/publications/rights/index.html
ACM Policy on Authorship: https://www.acm.org/publications/policies/authorship

Scripts

Located in scripts/ directory:

- main.py - Main sanitization engine with document processing logic

Limitations and Considerations

⚠️ Important Limitations:

1. Not Foolproof: Automated sanitization cannot guarantee complete anonymity. Always perform manual verification.

2. Context Blindness: Pattern matching may miss context-dependent identifiers or incorrectly flag legitimate content.

3. Image Processing: This tool processes text only. Images, figures, and embedded objects may contain identifying information not detected.

4. LaTeX Support: Limited support for LaTeX source files. Consider using LaTeX-specific tools for LaTeX manuscripts.

5. Language Support: Optimized for English and Chinese. Other languages may have reduced accuracy.

⚠️ Ethical and Legal Considerations:

- Author Consent: Ensure all authors consent to anonymization before submission
Copyright: Anonymization does not change copyright ownership
Data Availability: Some journals require non-anonymized versions for data/code availability statements
Post-Acceptance: Plan for deanonymization process after paper acceptance

Last Updated: 2026-02-09 Skill ID: 162 Version: 2.0 (K-Dense Standard)

盲审匿名化工具

自动对学术稿件进行匿名化处理，以用于双盲同行评审，通过移除作者标识符、机构隶属关系、致谢部分以及过多的自引，同时保留文档格式和学术内容的完整性。

核心功能：

- 作者身份移除：使用模式匹配和可自定义规则自动检测并删除作者姓名、机构隶属关系和联系信息
致谢部分清理：识别并移除或标记可能通过资金来源或个人感谢泄露作者身份的致谢部分
自引检测与中和：识别第一人称引用和可能暴露投稿者身份的过多自引
多格式文档支持：处理DOCX、Markdown和纯文本文件，采用格式感知的匿名化策略
审计追踪生成：创建所有编辑操作的详细日志，用于验证和透明度

参数

参数	类型	必填	默认值	描述
--input	str	是	-	输入稿件文件路径（DOCX、MD或TXT）
--output

使用场景

✅ 使用此技能的场景：

- 准备向要求作者匿名的期刊提交双盲同行评审稿件
向有匿名化要求的会议投稿（如NeurIPS、ICML、ACL、主要医学期刊）
在提交前进行最终合规性检查，确保没有遗留任何身份信息
对之前被拒的稿件进行重新匿名化，以提交给具有不同匿名化标准的新期刊
为公开预印本创建匿名版本，同时保持引文完整性
处理合作稿件，其中部分作者需要在特定投稿中保持匿名

❌ 不要使用此技能的场景：

- 准备开放同行评审或采用透明评审流程的期刊 → 改用 cover-letter-drafter
稿件包含正在申请专利的创新，需要作者身份标识 → 先咨询法律顾问
需要添加作者信息而非移除 → 使用 citation-formatter 进行文献管理
处理需要HIPAA合规的高度敏感临床数据 → 使用 hipaa-compliance-auditor 处理医疗数据
文档使用包含嵌入作者宏的复杂LaTeX格式 → 需要人工审核

相关技能：

- 上游：cover-letter-drafter、citation-formatter、conflict-of-interest-checker
下游：journal-club-presenter、conference-abstract-adaptor

与其他技能的集成

上游技能：

- cover-letter-drafter：在稿件匿名化之后生成投稿信，避免在通信中包含已匿名化的内容
citation-formatter：在匿名化之前格式化引文，确保正确的编号和格式
conflict-of-interest-checker：在匿名化之前检查合著者利益冲突，保持披露准确性

下游技能：

- journal-club-presenter：使用匿名化版本创建演示材料，用于外部评审
conference-abstract-adaptor：为可能有不同匿名化要求的会议改编摘要

完整工作流：

稿件撰写 → citation-formatter → conflict-of-interest-checker → blind-review-sanitizer → cover-letter-drafter → 提交

核心功能

1. 作者身份检测与移除

使用模式识别和用户指定规则，系统性地识别并从稿件中移除作者姓名、机构隶属关系和联系信息。

python
from scripts.main import BlindReviewSanitizer

使用已知作者姓名初始化匿名化工具

sanitizer = BlindReviewSanitizer( authors=[张三, 李四, 王五], keep_acknowledgments=False, highlightselfcites=False )

处理文本内容

text = 张三¹, 李四² ¹清华大学计算机科学系 ²北京大学信息学院

邮箱: zhangsan@tsinghua.edu.cn

sanitized = sanitizer.sanitize_text(text)
print(sanitized)

参数：

参数	类型	必填	描述	默认值
authors	List[str]	否	需要删除的作者姓名列表。指定后可提高准确性。	None
case_sensitive

最佳实践：

- ✅ 始终提供明确的作者姓名（如已知），以提高检测准确性并减少误报
✅ 在处理重要稿件前先用样本文档测试，以验证删除模式
✅ 在作者列表中包含所有姓名变体（全名、首字母缩写、英文版本）
✅ 仔细检查输出，注意可能出现在图表、表格或补充材料中的作者姓名

常见问题及解决方案：

问题：常见词汇被标记为作者姓名

- 症状：文本中出现的如王（意为国王）或常见英文名字等词汇被错误删除
解决方案：使用包含全名的明确作者列表；对包含许多常见名字词汇的文档禁用部分匹配

问题：引文中的作者姓名未被检测到

- 症状：如Smith等人（2023）所示...在Smith是作者时保留了作者姓名
解决方案：使用自引检测模式，该模式专门针对引文上下文中的作者姓名

2. 机构隶属关系屏蔽

自动检测并用通用占位符替换机构标识符，包括大学、研究所、院系和实验室。

python
from scripts.main import BlindReviewSanitizer

sanitizer = BlindReviewSanitizer()

机构检测使用模式匹配

textwithinstitutions = 斯坦福大学计算机科学系马克斯·普朗克信息学研究所 MIT CSAIL实验室

处理机构信息

result = sanitizer.removeinstitutions(textwithinstitutions) print(result)

输出：[INSTITUTION], [INSTITUTION], [INSTITUTION]

参数：

参数	类型	必填	描述	默认值
institutionkeywords	List[str]	否	用于机构检测的自定义关键词	预定义列表
strictmode

bool | 否 | 仅匹配明确的机构模式，减少误报 | False |

最佳实践：

- ✅ 为专业领域添加自定义机构关键词（例如联盟、网络），如果不在默认列表中
✅ 对误报较多的文档使用严格模式（例如，频繁提到医院的医学文本）
✅ 检查机构缩写，全名模式可能无法捕获（例如，JHU代表约翰霍普金斯大学）
✅ 验证地理引用，有些可能间接揭示机构信息（例如，硅谷校区）

常见问题及解决方案：

问题：通用词汇被标记为机构

- 症状：通用语境中的研究组或技术研究所被替换
解决方案：启用严格模式或添加否定上下文模式以排除通用用法

问题：多校区机构未完全屏蔽

- 症状：加州大学伯克利分校被部分屏蔽为[INSTITUTION]，伯克利
解决方案：在匿名化前预处理，合并多部分机构名称

3. 致谢部分管理

智能识别并处理可能泄露作者身份或机构隶属关系的致谢部分、资金披露和个人感谢。

python
from scripts.main import BlindReviewSanitizer

初始化为不保留致谢部分

sanitizer = BlindReviewSanitizer(keep_acknowledgments=False)

示例致谢部分

acknowledgment_text = 致谢

我们感谢Johnson教授的有益讨论以及NSF Grant #12345的资助。
本工作是在先进计算中心完成的。

参考文献

lines = acknowledgment_text.split(\n)
processedlines = sanitizer.removeacknowledgments(lines)

print(\n.join(processed_lines))

输出：[致谢已移除] 后跟参考文献部分

参数：

参数	类型	必填	描述	默认值
keepacknowledgments	bool	否	保留致谢部分而非移除	False
acknowledgmenttitles

List[str] | 否 | 要识别的自定义章节标题 | 预定义列表 |

最佳实践：

blind-review-sanitizer盲审匿名器