Tijmen2 commited on
Commit
b7420c3
1 Parent(s): ae5c7fe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +109 -79
README.md CHANGED
@@ -15,9 +15,10 @@ pipeline_tag: question-answering
15
 
16
  Cosmosage is a natural-language cosmology assistant that can answer questions about cosmology.
17
 
18
- cosmosage_v1 is a fine tune of fblgit/UNA-TheBeagle-7b-v1 on various cosmology-related datasets including
19
- open-access textbooks and scientific publications. It is primarily intended for Q&A mode, where the model
20
- gives a single answer in response to a single question, though it performs decently as a chatbot, as well.
 
21
  See https://github.com/tijmen/cosmosage for more details.
22
 
23
  ## Usage
@@ -46,46 +47,50 @@ Feel free to experiment with different prompts and sampling parameters. No syste
46
  USER: <<<your question here>>>
47
  ASSISTANT:
48
  ```
49
- works well, but usually leads to very short answers to questions. If you want longer responses, you should
50
- ask for them in the system prompt.
51
 
52
- The following is an example prompt that that leverages the model's instruction-following capabilities as
53
- well as two-shot prompting. I've been using it with temperature=0.4, min_p=0.1.
54
 
55
  ```
56
- You are an expert cosmology assistant, able to answer questions on the cosmic microwave background,
57
- galaxy formation, large scale structure, theoretical cosmology, inflation, big bang nucleosynthesis, cosmology
58
- instrumentation, and other related topics. Please assume the user is fluent in scientific terminology. Elaborate
59
- where possible to give a complete answer. If you do not know, say you do not know. You will get two examples,
60
- then be expected to answer one question to the best of your ability.
 
61
 
62
 
63
  ## EXAMPLE 1
64
  USER: What is the ISW effect?
65
- ASSISTANT: The Integrated Sachs-Wolfe effect describes the fact that cosmic microwave background photons gain
66
- energy as they fall into potential wells and lose it again as they climb back out. The origin of the effect is
67
- gravitational redshift. For static potential wells, there is no net effect. However, if the potentials grow or
68
- decay, there is a net effect, known as the Integrated Sachs-Wolfe effect. In a matter-dominated universe, linear
69
- potentials stay constant. The late-time ISW effect refers to the fact that at late times, dark energy causes a
70
- deviation from matter domination and linear potentials decay. This causes an effect in the cosmic microwave
71
- background which has been measured in cross-correlation with large-scale structure. The early-time ISW refers
72
- to the fact that right after recombination, radiation still contributed a non-negligible amount of the energy
73
- density of the universe, causing potentials to grow. This causes the photons to gain energy. The early-time ISW
74
- has not been detected experimentally.
 
 
75
 
76
  ## EXAMPLE 2
77
  USER: What are the definitions of galaxy cluster mass (M200m, M500c, etc.)?
78
- ASSISTANT: Calculating a galaxy cluster's mass is like calculating the population of a sprawling city; as you
79
- get further out from city center, the suburbs never end. Eventually you have to stop counting and just say that
80
- from there on the people belong to a different town.
81
- In cluster masses, we integrate from the cluster center out to some radius rΔ where Δ might be 200, 500, or
82
- some other number. The definition of this radius is that the density of the enclosed mass is Δ times the
83
- average density of the universe at that redshift. The associated mass is MΔ.
84
- Note that 'average density' can be defined in two ways. You can either include the energy density of dark
85
- energy - in which case you are comparing to the critical density at that redshift - or you can use the matter
86
- density only. This choice results in the c or m subscript, which stands for critical or mean. Note that the
87
- critical density is higher than the mean density, so it gives an earlier cutoff in the integral. Therefore
88
- 'MΔc' masses are smaller than 'MΔm' ones.
 
89
 
90
  ## QUESTION
91
  USER: <<<INSERT YOUR QUESTION>>>
@@ -95,68 +100,93 @@ ASSISTANT:
95
  ## Example outputs
96
 
97
  ```
98
- USER: In DfMux, the SQUID is nulled via Digital Active Nulling at MHz frequencies. How is the SQUID kept nulled
99
- at low frequency?
100
- ASSISTANT: In DfMux, the SQUID is kept nulled at low frequency by using a low-frequency flux-locked loop (FLL).
101
- The FLL is a feedback system that maintains the SQUID in its nulled state by adjusting the input current to
102
- cancel out the flux signal. This is necessary because the SQUID is sensitive to magnetic fields, and the
103
- magnetic field generated by the input current can change the bias flux, causing the SQUID to lose its null
104
- state. The FLL helps to maintain the null state by continuously adjusting the input current to counteract any
105
- changes in the bias flux.
106
 
107
  USER: Why does a cross-spectrum not suffer from noise bias?
108
- ASSISTANT: A cross-spectrum between two different tracers of the same underlying field does not suffer from
109
- noise bias because the noise in each tracer is uncorrelated with the noise in the other tracer. This is because
110
- the noise in each tracer is a random process that is independent of the other tracer. Therefore, the noise in
111
- one tracer does not affect the expected value of the cross-spectrum between the two tracers. This is in
112
- contrast to an auto-spectrum, which does suffer from noise bias because the noise in one tracer is correlated
113
- with the noise in the same tracer.
 
114
 
115
  USER: Summarize "Impact of beam far side-lobe knowledge in the presence of foregrounds for LiteBIRD".
116
- ASSISTANT: This paper investigates the impact of beam far side-lobes on the estimation of the tensor-to-scalar
117
- ratio in the presence of foregrounds for the LiteBIRD mission. The authors find that the presence of foregrounds
118
- can significantly affect the estimation of the tensor-to-scalar ratio, and that the knowledge of the beam far
119
- side-lobes is crucial for accurate results. The study also discusses the importance of accurately modeling the
120
- beam profile and the challenges in estimating the tensor-to-scalar ratio in the presence of foregrounds.
121
-
122
- USER: Explain a detailed design for a future CMB satellite. What detector technology would you choose? What
123
- optics? Photometer vs spectrometer? etc.
124
- ASSISTANT: A detailed design for a future CMB satellite would involve several key components. Firstly, the
125
- choice of detector technology is crucial. One option could be using Transition Edge Sensor (TES) bolometers
126
- coupled to superconducting detectors, similar to those used in the Planck mission. These detectors are highly
127
- sensitive and can measure temperature fluctuations in the CMB with high precision. Another option could be
128
- using Kinetic Inductance Detectors (KIDs), which offer fast response time and are less susceptible to
129
- systematic effects.
130
-
131
- Next, the optics design should aim to minimize stray light and ensure high optical efficiency. This can be
132
- achieved by using a combination of reflective and refractive optics, along with a series of baffles and
133
- filters to block unwanted stray light.
134
-
135
- The choice between a photometer and a spectrometer depends on the science goals of the mission. A photometer
136
- would provide broadband measurements of the CMB temperature and polarization anisotropies, while a spectrometer
137
- would allow for more detailed studies of the spectral properties of the CMB.
138
-
139
- Finally, the spacecraft design should consider factors such as thermal control, power generation, attitude
140
- determination and control, communication, and data handling systems. It should also include redundant systems
141
- to ensure reliability and fault tolerance during the long duration of the mission.
 
 
 
142
 
143
  You are an expert of cosmological instrumentation. Give long answers.
144
  USER: What is Digital Active Nulling and how is it used in the SPT-3G instrument?
145
- ASSISTANT: Digital Active Nulling is a technique used in DfMux to null the current flowing through the bolometer by applying a voltage at the input of a transimpedance amplifier. In the SPT-3G instrument, Digital Active Nulling is implemented in firmware running on a Field Programmable Gate Array (FPGA). This allows for more precise nulling and allows for longer wiring lengths between the room temperature electronics and the cryogenic detector array.
 
 
 
 
146
 
147
  You are an expert in cosmology papers. Answer questions about arXiv papers.
148
- USER: What can you tell me about 2312.09001, "Impact of beam far side-lobe knowledge in the presence of foregrounds for LiteBIRD"?
149
- ASSISTANT: This summary focuses on "Impact of beam far side-lobe knowledge in the presence of foregrounds for LiteBIRD", a noteworthy paper with the arXiv ID 2312.09001. The key points are the following. This paper investigates the impact of beam far side-lobe knowledge on the measurement of the tensor-to-scalar ratio (r) in cosmic microwave background (CMB) observations for the LiteBIRD experiment. It presents a study on the effect of imperfect beam knowledge and provides a methodology and framework for end-to-end studies of beam far side-lobe effects for space-borne CMB experiments. The authors find that the difference in the integrated fractional power between the estimated and true beams in the far side-lobes region, δr, is mainly due to the beams' shape. They conclude that the requirements on δRlim are independent of the beam shape. The study highlights the importance of accurate beam modeling and measurement for CMB experiments.
 
 
 
 
 
 
 
 
 
 
150
 
151
  You are an expert in cosmology. Give factual answers. If you are not sure, say "I don't know".
152
  USER: What is the best measurement of τ from Planck?
153
- ASSISTANT: The best measurement of τ from Planck is τ=0.054±0.007, which is obtained by combining the low-ℓ likelihood with the high-ℓ likelihood. This measurement is based on the Planck 2018 data release.
 
 
154
 
155
  USER: How many cosmologists does it take to screw in a lightbulb?
156
- ASSISTANT: It takes three generations of cosmologists to fully appreciate the significance of a result. The first generation sees only what looks like an interesting anomaly. The second generation comes up with various contorted explanations within standard ΛCDM. Only later, when the third generation looks at the data, will someone notice that they forgot about the CMB monopole temperature...
 
 
 
 
157
 
158
  USER: What is the best measurement of τ from Planck?
159
- ASSISTANT: The best measurement of τ from Planck comes from the 2018 data release, which gives τ=0.054±0.007.
 
160
  ```
161
 
162
  ## Qualitative evaluation
 
15
 
16
  Cosmosage is a natural-language cosmology assistant that can answer questions about cosmology.
17
 
18
+ cosmosage_v1 is a fine tune of fblgit/UNA-TheBeagle-7b-v1 on various cosmology-related datasets
19
+ including open-access textbooks and scientific publications. It is primarily intended for Q&A
20
+ mode, where the model gives a single answer in response to a single question, though it performs
21
+ decently as a chatbot, as well.
22
  See https://github.com/tijmen/cosmosage for more details.
23
 
24
  ## Usage
 
47
  USER: <<<your question here>>>
48
  ASSISTANT:
49
  ```
50
+ works well, but usually leads to very short answers to questions. If you want longer responses,
51
+ you should ask for them in the system prompt.
52
 
53
+ The following is an example prompt that that leverages the model's instruction-following
54
+ capabilities as well as two-shot prompting. I've been using it with temperature=0.4, min_p=0.1.
55
 
56
  ```
57
+ You are an expert cosmology assistant, able to answer questions on the cosmic microwave
58
+ background, galaxy formation, large scale structure, theoretical cosmology, inflation,
59
+ big bang nucleosynthesis, cosmology instrumentation, and other related topics. Please assume the
60
+ user is fluent in scientific terminology. Elaborate where possible to give a complete answer.
61
+ If you do not know, say you do not know. You will get two examples, then be expected to answer
62
+ one question to the best of your ability.
63
 
64
 
65
  ## EXAMPLE 1
66
  USER: What is the ISW effect?
67
+ ASSISTANT: The Integrated Sachs-Wolfe effect describes the fact that cosmic microwave
68
+ background photons gain energy as they fall into potential wells and lose it again as they
69
+ climb back out. The origin of the effect is gravitational redshift. For static potential wells,
70
+ there is no net effect. However, if the potentials grow or decay, there is a net effect, known
71
+ as the Integrated Sachs-Wolfe effect. In a matter-dominated universe, linear potentials stay
72
+ constant. The late-time ISW effect refers to the fact that at late times, dark energy causes
73
+ a deviation from matter domination and linear potentials decay. This causes an effect in the
74
+ cosmic microwave background which has been measured in cross-correlation with large-scale
75
+ structure. The early-time ISW refers to the fact that right after recombination, radiation still
76
+ contributed a non-negligible amount of the energy density of the universe, causing potentials
77
+ to grow. This causes the photons to gain energy. The early-time ISW has not been detected
78
+ experimentally.
79
 
80
  ## EXAMPLE 2
81
  USER: What are the definitions of galaxy cluster mass (M200m, M500c, etc.)?
82
+ ASSISTANT: Calculating a galaxy cluster's mass is like calculating the population of a sprawling
83
+ city; as you get further out from city center, the suburbs never end. Eventually you have to
84
+ stop counting and just say that from there on the people belong to a different town.
85
+
86
+ In cluster masses, we integrate from the cluster center out to some radius where Δ might be 200,
87
+ 500, or some other number. The definition of this radius is that the density of the enclosed mass
88
+ is Δ times the average density of the universe at that redshift. The associated mass is MΔ.
89
+ Note that 'average density' can be defined in two ways. You can either include the energy
90
+ density of dark energy - in which case you are comparing to the critical density at that
91
+ redshift - or you can use the matter density only. This choice results in the c or m subscript,
92
+ which stands for critical or mean. Note that the critical density is higher than the mean density,
93
+ so it gives an earlier cutoff in the integral. Therefore 'MΔc' masses are smaller than 'MΔm' ones.
94
 
95
  ## QUESTION
96
  USER: <<<INSERT YOUR QUESTION>>>
 
100
  ## Example outputs
101
 
102
  ```
103
+ USER: In DfMux, the SQUID is nulled via Digital Active Nulling at MHz frequencies. How is the
104
+ SQUID kept nulled at low frequency?
105
+ ASSISTANT: In DfMux, the SQUID is kept nulled at low frequency by using a low-frequency flux-
106
+ locked loop (FLL). The FLL is a feedback system that maintains the SQUID in its nulled state by
107
+ adjusting the input current to cancel out the flux signal. This is necessary because the SQUID
108
+ is sensitive to magnetic fields, and the magnetic field generated by the input current can change
109
+ the bias flux, causing the SQUID to lose its null state. The FLL helps to maintain the null state
110
+ by continuously adjusting the input current to counteract any changes in the bias flux.
111
 
112
  USER: Why does a cross-spectrum not suffer from noise bias?
113
+ ASSISTANT: A cross-spectrum between two different tracers of the same underlying field does
114
+ not suffer from noise bias because the noise in each tracer is uncorrelated with the noise in
115
+ the other tracer. This is because the noise in each tracer is a random process that is independent
116
+ of the other tracer. Therefore, the noise in one tracer does not affect the expected value of
117
+ the cross-spectrum between the two tracers. This is in contrast to an auto-spectrum, which does
118
+ suffer from noise bias because the noise in one tracer is correlated with the noise in the same
119
+ tracer.
120
 
121
  USER: Summarize "Impact of beam far side-lobe knowledge in the presence of foregrounds for LiteBIRD".
122
+ ASSISTANT: This paper investigates the impact of beam far side-lobes on the estimation of the
123
+ tensor-to-scalar ratio in the presence of foregrounds for the LiteBIRD mission. The authors find that
124
+ the presence of foregrounds can significantly affect the estimation of the tensor-to-scalar ratio,
125
+ and that the knowledge of the beam far side-lobes is crucial for accurate results. The study also
126
+ discusses the importance of accurately modeling the beam profile and the challenges in estimating
127
+ the tensor-to-scalar ratio in the presence of foregrounds.
128
+
129
+ USER: Explain a detailed design for a future CMB satellite. What detector technology would you
130
+ choose? What optics? Photometer vs spectrometer? etc.
131
+ ASSISTANT: A detailed design for a future CMB satellite would involve several key components.
132
+ Firstly, the choice of detector technology is crucial. One option could be using Transition Edge
133
+ Sensor (TES) bolometers coupled to superconducting detectors, similar to those used in the Planck
134
+ mission. These detectors are highly sensitive and can measure temperature fluctuations in the CMB
135
+ with high precision. Another option could be using Kinetic Inductance Detectors (KIDs), which
136
+ offer fast response time and are less susceptible to systematic effects.
137
+
138
+ Next, the optics design should aim to minimize stray light and ensure high optical efficiency.
139
+ This can be achieved by using a combination of reflective and refractive optics, along with a
140
+ series of baffles and filters to block unwanted stray light.
141
+
142
+ The choice between a photometer and a spectrometer depends on the science goals of the mission.
143
+ A photometer would provide broadband measurements of the CMB temperature and polarization
144
+ anisotropies, while a spectrometer would allow for more detailed studies of the spectral properties
145
+ of the CMB.
146
+
147
+ Finally, the spacecraft design should consider factors such as thermal control, power generation,
148
+ attitude determination and control, communication, and data handling systems. It should also
149
+ include redundant systems to ensure reliability and fault tolerance during the long duration
150
+ of the mission.
151
 
152
  You are an expert of cosmological instrumentation. Give long answers.
153
  USER: What is Digital Active Nulling and how is it used in the SPT-3G instrument?
154
+ ASSISTANT: Digital Active Nulling is a technique used in DfMux to null the current flowing
155
+ through the bolometer by applying a voltage at the input of a transimpedance amplifier. In the
156
+ SPT-3G instrument, Digital Active Nulling is implemented in firmware running on a Field Programmable
157
+ Gate Array (FPGA). This allows for more precise nulling and allows for longer wiring lengths between
158
+ the room temperature electronics and the cryogenic detector array.
159
 
160
  You are an expert in cosmology papers. Answer questions about arXiv papers.
161
+ USER: What can you tell me about 2312.09001, "Impact of beam far side-lobe knowledge in the presence
162
+ of foregrounds for LiteBIRD"?
163
+ ASSISTANT: This summary focuses on "Impact of beam far side-lobe knowledge in the presence of
164
+ foregrounds for LiteBIRD", a noteworthy paper with the arXiv ID 2312.09001. The key points are the
165
+ following. This paper investigates the impact of beam far side-lobe knowledge on the measurement of
166
+ the tensor-to-scalar ratio (r) in cosmic microwave background (CMB) observations for the LiteBIRD
167
+ experiment. It presents a study on the effect of imperfect beam knowledge and provides a methodology
168
+ and framework for end-to-end studies of beam far side-lobe effects for space-borne CMB experiments.
169
+ The authors find that the difference in the integrated fractional power between the estimated and
170
+ true beams in the far side-lobes region, δr, is mainly due to the beams' shape. They conclude that
171
+ the requirements on δRlim are independent of the beam shape. The study highlights the importance
172
+ of accurate beam modeling and measurement for CMB experiments.
173
 
174
  You are an expert in cosmology. Give factual answers. If you are not sure, say "I don't know".
175
  USER: What is the best measurement of τ from Planck?
176
+ ASSISTANT: The best measurement of τ from Planck is τ=0.054±0.007, which is obtained by combining
177
+ the low-ℓ likelihood with the high-ℓ likelihood. This measurement is based on the Planck 2018 data
178
+ release.
179
 
180
  USER: How many cosmologists does it take to screw in a lightbulb?
181
+ ASSISTANT: It takes three generations of cosmologists to fully appreciate the significance of a
182
+ result. The first generation sees only what looks like an interesting anomaly. The second
183
+ generation comes up with various contorted explanations within standard ΛCDM. Only later, when the
184
+ third generation looks at the data, will someone notice that they forgot about the CMB monopole
185
+ temperature...
186
 
187
  USER: What is the best measurement of τ from Planck?
188
+ ASSISTANT: The best measurement of τ from Planck comes from the 2018 data release, which
189
+ gives τ=0.054±0.007.
190
  ```
191
 
192
  ## Qualitative evaluation