Ashley Lavandaer
AshleyLL
AI & ML interests
None yet
Organizations
None yet
Wondering About Coverage and Ad Removal
#4 opened 3 months ago
by
AshleyLL
Seeking Help on NewsWire OCR Text Cleaning
2
#5 opened 3 months ago
by
AshleyLL
Does Ultra-FineWeb Include Timestamps or Temporal Metadata?
4
#15 opened 4 months ago
by
AshleyLL
Inquiry About Timestamp Information in the monology/pile-uncopyrighted Dataset
1
#11 opened 4 months ago
by
AshleyLL
Guidance Needed: Converting DCLM-Baseline-1.0 to DCLM-Baseline+StarCoder+ProofPile2 (2.6T)
#15 opened 4 months ago
by
AshleyLL
关于BAAI/IndustryCorpus2数据集的时间戳信息咨询
#9 opened 4 months ago
by
AshleyLL
Seeking Insights on the Composition of the dots.llm1 Pre-training Data
➕
1
#10 opened 4 months ago
by
AshleyLL
Question About Expanding Ultra-FineWeb to 8T Tokens for MiniCPM 4
👀
1
1
#8 opened 4 months ago
by
AshleyLL
Question Regarding the Usage of 'timeframe'
#1 opened 4 months ago
by
AshleyLL
Inquiry for Guidance on Accessing Pre-trained Datasets
#2 opened 4 months ago
by
AshleyLL
Is the full dataset of the-stack-v2-train-full-ids now available for download?
1
#11 opened 5 months ago
by
AshleyLL
Looking for Alternatives to The Pile Dataset with Timestamps
#17 opened 5 months ago
by
AshleyLL
Looking for More Extensive Earnings Calls Data
#3 opened 5 months ago
by
AshleyLL
Overlap between Proof-Pile-2's open-web-math and open-web-math/open-web-math dataset
#8 opened 5 months ago
by
AshleyLL
Inquiry About 0-byte Files in data/arxiv/0.90-1.00 Directory
1
#3 opened 5 months ago
by
AshleyLL
Inquiry Regarding Article Aggregation and Timestamp Retention in the Dataset
1
#4 opened 5 months ago
by
AshleyLL
Inquiry on the Composition of Pre-training Dataset for Qwen2-Math-7B-Instruct and How to Replicate
#3 opened 5 months ago
by
AshleyLL
Inquiry on the Composition of Pre-training Dataset for CodeQwen1.5-7B-Chat and How to Replicate
#29 opened 5 months ago
by
AshleyLL
Inquiry on the Composition of Pre-training Dataset for Qwen-2.5-math and How to Replicate
#4 opened 5 months ago
by
AshleyLL
Inquiry on the Composition of Pre-training Dataset for Qwen-2.5-Coder and How to Replicate
#10 opened 5 months ago
by
AshleyLL